BLASTX nr result

ID: Glycyrrhiza32_contig00031359 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza32_contig00031359
         (877 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU38148.1 hypothetical protein TSUD_395930 [Trifolium subterran...   201   1e-57
GAU50434.1 hypothetical protein TSUD_134890, partial [Trifolium ...   202   3e-56
KHN28363.1 Putative ribonuclease H protein, partial [Glycine soja]    194   8e-56
KHN20323.1 Putative ribonuclease H protein, partial [Glycine soja]    192   4e-55
KHN30886.1 Putative ribonuclease H protein, partial [Glycine soja]    188   6e-54
XP_019447203.1 PREDICTED: uncharacterized protein LOC109350421 [...   187   9e-53
XP_019418409.1 PREDICTED: uncharacterized protein LOC109329191 [...   194   1e-52
GAU50085.1 hypothetical protein TSUD_371690 [Trifolium subterran...   184   2e-51
GAU29820.1 hypothetical protein TSUD_223660 [Trifolium subterran...   188   3e-51
GAU40143.1 hypothetical protein TSUD_163120 [Trifolium subterran...   182   5e-51
GAU43007.1 hypothetical protein TSUD_187280 [Trifolium subterran...   189   9e-51
KYP32706.1 Transposon TX1 uncharacterized [Cajanus cajan]             189   1e-50
GAU49581.1 hypothetical protein TSUD_139980 [Trifolium subterran...   180   2e-50
KYP69313.1 hypothetical protein KK1_008502 [Cajanus cajan]            176   3e-49
GAU29496.1 hypothetical protein TSUD_360410 [Trifolium subterran...   183   1e-48
GAU43110.1 hypothetical protein TSUD_373050 [Trifolium subterran...   182   2e-48
GAU35675.1 hypothetical protein TSUD_162470 [Trifolium subterran...   179   2e-48
KYP44529.1 Putative ribonuclease H protein At1g65750 family [Caj...   178   3e-48
GAU44350.1 hypothetical protein TSUD_129240 [Trifolium subterran...   174   4e-48
KYP44023.1 Putative ribonuclease H protein At1g65750 family [Caj...   181   5e-48

>GAU38148.1 hypothetical protein TSUD_395930 [Trifolium subterraneum]
          Length = 503

 Score =  201 bits (512), Expect = 1e-57
 Identities = 111/290 (38%), Positives = 161/290 (55%), Gaps = 3/290 (1%)
 Frame = +1

Query: 10  CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYG-DGIAKCGTY-LQI 183
           C P+ +GGLGV+DL+L N SLLAK RW+L T +  +W+E++  RYG D I K     + +
Sbjct: 123 CKPKKEGGLGVRDLRLVNISLLAKWRWKLLTTECEVWKEVVGARYGRDVIGKVNLGDIDV 182

Query: 184 RQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMF 363
            + GS W +DL LL+ D      WFS+A+ +++G G S  FW+E W G   L+  FPR+F
Sbjct: 183 TRTGSCWWRDLCLLDSDVR----WFSSAVGKRVGRGDSTMFWNEIWIGDQPLRQRFPRLF 238

Query: 364 SLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWS 543
            +ST +   + +MG     +W W+L WRR+FF WE      F + + QF+P+   +D W 
Sbjct: 239 GMSTQQNEVICNMGSLVNGLWHWELQWRRNFFTWEEDQYNHFLDIIVQFAPTVQ-QDRWL 297

Query: 544 WKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSL-DPRVSKALGVLWQTNVPSNIQIFSWR 720
           W  D               AY  ++   ++ S+ DP       +LW+   PS +  FSW+
Sbjct: 298 WLGD------GVQGYTANSAYSLVVNKLVTPSVCDPINDLVFKILWKCGAPSKVSAFSWQ 351

Query: 721 LFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870
           L LDRL T++ LM+R II + HG +CVFC    ES  HLF  C     +W
Sbjct: 352 LMLDRLQTKDNLMKRRIIQAHHG-NCVFCNLAQESASHLFLHCDRVAKVW 400


>GAU50434.1 hypothetical protein TSUD_134890, partial [Trifolium subterraneum]
          Length = 712

 Score =  202 bits (513), Expect = 3e-56
 Identities = 107/290 (36%), Positives = 154/290 (53%), Gaps = 3/290 (1%)
 Frame = +1

Query: 10   CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYG---DGIAKCGTYLQ 180
            CLP++KGGLG+K+L  FN +LL K +WR   D + +W +L+  RYG   D   +  T   
Sbjct: 387  CLPKDKGGLGIKNLNCFNQALLCKWKWRGLCDHNTLWTKLLEHRYGSLADNFLR-DTTRD 445

Query: 181  IRQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRM 360
            ++ Q S+W +D+ ++       D WF   +   LGNG+ I+FW E W+GP CLKDLFP++
Sbjct: 446  VKGQ-SLWWRDIMMIGGIEN--DAWFRFNVRNVLGNGTCIRFWHETWHGPVCLKDLFPQL 502

Query: 361  FSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSW 540
            +  S   E  + D+G+     W W L W  +  + E     E    L+   PS    D  
Sbjct: 503  YCKSPQAEAIIYDVGKWVNQQWVWNLQWSTNLTSTEHDAACELANLLTGIQPSLECADRR 562

Query: 541  SWKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWR 720
             W L  +              Y+ L   ++  +++  V KAL +LW  +VPS + IF WR
Sbjct: 563  RWGLTQTGMFSVKS------TYEFLQSREVVVAIEDNVVKALQLLWLNDVPSKVSIFGWR 616

Query: 721  LFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870
            L L RLPTR  L R+NII + H +SC+FC  E E   HL F+C FS  +W
Sbjct: 617  LLLSRLPTRMALARKNIIVNLHELSCIFCGEEQEELSHLLFNCPFSQELW 666


>KHN28363.1 Putative ribonuclease H protein, partial [Glycine soja]
          Length = 417

 Score =  194 bits (494), Expect = 8e-56
 Identities = 107/289 (37%), Positives = 150/289 (51%), Gaps = 2/289 (0%)
 Frame = +1

Query: 10  CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAK--CGTYLQI 183
           C P+ +GGLGVK+L++FN SLLAK +WR   D +A+WR+L+ FRYG+ IAK  C      
Sbjct: 70  CKPKKEGGLGVKNLEVFNISLLAKWKWRCIHDHNALWRDLLAFRYGNLIAKQTCSLDRSW 129

Query: 184 RQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMF 363
             + SIW +DL LLE D      +F  A+   +G+G SI FW  +W G   LKD FP +F
Sbjct: 130 GTKDSIWWRDLMLLEKDLSQNQNFFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELF 189

Query: 364 SLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWS 543
           ++S+ +   VG+ G  R D W W L W+R     E   L      L          D W 
Sbjct: 190 AISSQQLVSVGNAGSWRRDQWTWGLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWK 249

Query: 544 WKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWRL 723
           W L  S              Y   + +   + ++  +   L ++W+  VPS + +F WRL
Sbjct: 250 WSLHNSK------LFTVSSCYSFAMSLVNQTQMNSDILDILSIVWKVPVPSKVALFCWRL 303

Query: 724 FLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870
            LDRLPT++ L+RRN++   +   C  C    E+  HLFF C FS  IW
Sbjct: 304 LLDRLPTKDNLIRRNVVI--NNSRCSLCDSCDENVVHLFFHCDFSNCIW 350


>KHN20323.1 Putative ribonuclease H protein, partial [Glycine soja]
          Length = 417

 Score =  192 bits (489), Expect = 4e-55
 Identities = 106/289 (36%), Positives = 149/289 (51%), Gaps = 2/289 (0%)
 Frame = +1

Query: 10  CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAK--CGTYLQI 183
           C P+ +GGLGVK+L++FN SLLAK +WR   D +A+WR+L+ FRYG+ IAK  C      
Sbjct: 70  CKPKKEGGLGVKNLEVFNISLLAKWKWRCIHDHNALWRDLLAFRYGNLIAKQTCSLDRSW 129

Query: 184 RQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMF 363
             + SIW +DL LLE D      +F  A+   +G+G SI FW  +W G   LKD FP +F
Sbjct: 130 GTKDSIWWRDLMLLEKDLSQNQNFFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELF 189

Query: 364 SLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWS 543
           ++S+ +   VG+    R D W W L W+R     E   L      L          D W 
Sbjct: 190 AISSQQLESVGNASSWRRDQWTWGLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWK 249

Query: 544 WKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWRL 723
           W L  S              Y   + +   + ++  +   L ++W+  VPS + +F WRL
Sbjct: 250 WSLHNSK------LFTVSSCYSFAMSLVNQTQMNSDILDILSIVWKVPVPSKVALFCWRL 303

Query: 724 FLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870
            LDRLPT++ L+RRN++   +   C  C    E+  HLFF C FS  IW
Sbjct: 304 LLDRLPTKDNLIRRNVVI--NNSRCSLCDSCDENVVHLFFHCDFSKCIW 350


>KHN30886.1 Putative ribonuclease H protein, partial [Glycine soja]
          Length = 373

 Score =  188 bits (478), Expect = 6e-54
 Identities = 105/283 (37%), Positives = 146/283 (51%), Gaps = 2/283 (0%)
 Frame = +1

Query: 28  GGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAK--CGTYLQIRQQGSI 201
           GGLGVK+L++FN SLLAK +WR   D +A+WR+L+ FRYG+ IAK  C        + SI
Sbjct: 1   GGLGVKNLEVFNISLLAKWKWRCIHDHNALWRDLLAFRYGNLIAKQTCSLDRSWGTKDSI 60

Query: 202 WMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMFSLSTSK 381
           W +DL LLE D      +F  A+   +G+G SI FW  +W G   LKD FP +F++S+ +
Sbjct: 61  WWRDLMLLEKDLSQNQNFFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELFAISSQQ 120

Query: 382 EGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWSWKLDPS 561
              VG+ G  R D W W L W+R     E   L      L          D W W L  S
Sbjct: 121 LVSVGNAGSWRRDQWTWDLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWKWSLHNS 180

Query: 562 NTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWRLFLDRLP 741
                         Y   + +   + ++  +   L ++W+  VPS + +F WRL LDRLP
Sbjct: 181 K------LFTVSSCYSFAMSLVNQTQMNSDILDILSIVWKVPVPSKVALFCWRLLLDRLP 234

Query: 742 TRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870
           T++ L+RRN++   +   C  C    E+  HLFF C FS  IW
Sbjct: 235 TKDNLIRRNVVI--NNSRCSLCDSCDENVVHLFFHCDFSKCIW 275


>XP_019447203.1 PREDICTED: uncharacterized protein LOC109350421 [Lupinus
           angustifolius]
          Length = 456

 Score =  187 bits (476), Expect = 9e-53
 Identities = 102/284 (35%), Positives = 149/284 (52%), Gaps = 4/284 (1%)
 Frame = +1

Query: 31  GLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQIR--QQGSIW 204
           G GVK+L LFN +LL K RWR+ +   ++W +++   YG      G  + +   ++GS W
Sbjct: 12  GFGVKNLGLFNLALLGKWRWRMLSSSESLWVKVLRSIYGVEAVVRGGLVDVECFKKGSSW 71

Query: 205 MKDL-FLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMFSLSTSK 381
            +DL  +   D G    WF+  + R++G+G S  FW + W G  CLK+ F R+F ++ +K
Sbjct: 72  WRDLGCVCNRDNGFNKGWFNEGVRRRVGSGQSTLFWRDIWVGGECLKNCFERLFQVTLNK 131

Query: 382 EGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWSWKLDPS 561
           + C+  M E R  VWCW L WRR  F WE   + +    + +    +  ED W W  D +
Sbjct: 132 DACISSMDEWRNGVWCWLLNWRRSLFLWEQDEVNDLLNKVEEVRLVQGNEDGWLWVHDKN 191

Query: 562 NTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWRLFLDRL 738
            T           AYK L  EV+  + L  +       LW + VPS ++ F+WRLF+  +
Sbjct: 192 GT------YSVRNAYKVLQNEVRNDNYLHYK------RLWASKVPSKLKCFAWRLFVGGV 239

Query: 739 PTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870
           PTR  L RR II S     C FC    ES +HLFF+C+ SY +W
Sbjct: 240 PTRMNLARRGIIGSLPSTLCAFCGELEESSDHLFFTCSLSYSVW 283


>XP_019418409.1 PREDICTED: uncharacterized protein LOC109329191 [Lupinus
            angustifolius]
          Length = 953

 Score =  194 bits (493), Expect = 1e-52
 Identities = 105/293 (35%), Positives = 154/293 (52%), Gaps = 4/293 (1%)
 Frame = +1

Query: 4    KACLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQI 183
            + C  + +GGLGVK+L LFN +LL K RW + +   ++W +++   YG      G  + +
Sbjct: 562  EVCRSKEEGGLGVKNLGLFNLALLGKWRWHMLSSSESLWVKVLRSIYGVEAVVRGGLVDV 621

Query: 184  R--QQGSIWMKDL-FLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFP 354
               ++GS W +DL  L   D G    WF+  + R++G+G S  FW + W G  CLK+ F 
Sbjct: 622  ECFKKGSSWWRDLGCLCNRDNGFNKGWFNEGVRRRVGSGQSTLFWRDIWVGGECLKNCFE 681

Query: 355  RMFSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIED 534
            R+F ++ +K+ C+  MGE R  VWCW L WRR  F WE   + +    + +    +  ED
Sbjct: 682  RLFQVTLNKDACISSMGEWRNGVWCWLLNWRRSLFLWEQDEVNDLLNKVEEVRLVQGNED 741

Query: 535  SWSWKLDPSNTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQIF 711
             W W  D + T           AYK L  EV+  + L  +       LW + VPS ++ F
Sbjct: 742  GWLWVHDKNGT------YSVRNAYKVLQNEVRNDNYLHYK------RLWASKVPSKLKCF 789

Query: 712  SWRLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870
            +WRLF+  +PT   L RR II S     C FC    ES +HLFF+C+ SY +W
Sbjct: 790  AWRLFVGGVPTWMNLARRGIIGSLPSTLCAFCGELEESSDHLFFTCSLSYSVW 842


>GAU50085.1 hypothetical protein TSUD_371690 [Trifolium subterraneum]
          Length = 438

 Score =  184 bits (466), Expect = 2e-51
 Identities = 106/291 (36%), Positives = 149/291 (51%), Gaps = 4/291 (1%)
 Frame = +1

Query: 10  CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQIR- 186
           C  ++KGGLGV+D++L N SLLAK RWRL    + +W+E+++ +YG+ I     +  IR 
Sbjct: 54  CKEKSKGGLGVRDVRLANLSLLAKWRWRLLLPGNPLWKEVLVAKYGNHILNRVDWRDIRI 113

Query: 187 -QQGSIWMKDLFLLEHDRGVPDL-WFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRM 360
               S W KD+  L  D+ V +  W + ++ RK+GNG+S  FW   W G   L   FP +
Sbjct: 114 PTLASKWWKDICTL--DKVVDNHNWLAESMIRKVGNGTSTSFWCSNWIGEAPLSVTFPLL 171

Query: 361 FSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSW 540
           FSLS  K G V +  +  G+ W W   WRRD F WE  ++    E L     S  +ED W
Sbjct: 172 FSLSNHKNGMVRNFCDHVGENWRWSFSWRRDLFQWEEDLVVRLREILEPVVLSL-VEDFW 230

Query: 541 SWKLDPSNTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQIFSW 717
           SWKLDP              AY  L+ E+     L+  ++     +W +  PS +  FSW
Sbjct: 231 SWKLDPEG------KFSVKSAYTFLVEELTRDDDLEEAMATVFDQIWDSPAPSKVIAFSW 284

Query: 718 RLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870
           +L  DR+PTR  L  R ++       CV C   VES  HLF  C  +  +W
Sbjct: 285 QLLSDRIPTRRNLEIRGLLGLDMPWECVGCVGRVESTTHLFLHCPSAMMVW 335


>GAU29820.1 hypothetical protein TSUD_223660 [Trifolium subterraneum]
          Length = 672

 Score =  188 bits (477), Expect = 3e-51
 Identities = 104/292 (35%), Positives = 159/292 (54%), Gaps = 3/292 (1%)
 Frame = +1

Query: 10   CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGD---GIAKCGTYLQ 180
            C P+ +GGLG+++L+L N SLL K RWRL + +  +W+++++ +YG+   G A+    + 
Sbjct: 292  CKPKKEGGLGIRNLRLVNLSLLTKWRWRLLSGEGEVWKDIIVAKYGERVMGNARLDNIVY 351

Query: 181  IRQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRM 360
            + Q GS W +DL  L+ D G    WF+  + +K+G G+SI FW + W G   L+  FPR+
Sbjct: 352  L-QFGSAWWRDLCNLDKDEG----WFNQVVLKKVGMGNSILFWKDVWAGDQSLEHRFPRL 406

Query: 361  FSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSW 540
            F +S  +   V +MG      W W+L WRR FF WE  ++RE  E L+ F  S+ + D W
Sbjct: 407  FGISIQQNEVVRNMGSWVNVEWRWELLWRRQFFVWENELVRELGEVLNIFPLSEEV-DRW 465

Query: 541  SWKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWR 720
             WK  P+                TL+   I   L P  + +   +W+  VPS +   +W+
Sbjct: 466  VWK--PNEAEGFSVKSLYDWLDSTLVTRAI---LTPLEAFSFCSIWKCVVPSKVSALAWQ 520

Query: 721  LFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIWSA 876
            LFLDR+PT++ L RR II S   + C  C    E+  H+F  C F+  +W A
Sbjct: 521  LFLDRIPTKDNLCRRRIIRSEDAV-CDMCGGVSETSRHVFMHCDFAAQVWYA 571


>GAU40143.1 hypothetical protein TSUD_163120 [Trifolium subterraneum]
          Length = 419

 Score =  182 bits (462), Expect = 5e-51
 Identities = 104/294 (35%), Positives = 155/294 (52%), Gaps = 3/294 (1%)
 Frame = +1

Query: 4   KACLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYL-- 177
           + C PR++GGLGV+D+   N SLL K RWRL     A W+E+++ +YG  +A+   +   
Sbjct: 34  EVCRPRSEGGLGVRDVAKVNLSLLIKWRWRLLQSGYAFWKEVLVAKYGI-MARFKVHWIG 92

Query: 178 -QIRQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFP 354
             +  + S+W KD+  +  D      WF+  +CRKLGNG+S +FW +RW G   L D FP
Sbjct: 93  HALPNRVSLWWKDICGI--DIREDGSWFARNMCRKLGNGNSTRFWLDRWIGSLPLSDQFP 150

Query: 355 RMFSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIED 534
           R+FSLS +++G V +  + RG    W + WRR  F WE  +L+   + L    P    ED
Sbjct: 151 RLFSLSLNQQGMVREFRDVRGGEDGWVMRWRRRLFVWEEELLQRLQDLLPVDVPWSEAED 210

Query: 535 SWSWKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFS 714
            WSW+L+   +           +         +SS + +     G +W++ VPS +  F+
Sbjct: 211 RWSWRLEEDGSFSVSSMYWYLGSV-----FSQASSFNAQELWVFGKIWKSPVPSKVIAFT 265

Query: 715 WRLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIWSA 876
           W+L  +R+PTR  L  R  I    G+ CV C    ES  HLF  C F+  IW+A
Sbjct: 266 WKLLRNRIPTRCNLASRG-IQLIGGLDCVHCVGREESGTHLFMFCDFAGQIWNA 318


>GAU43007.1 hypothetical protein TSUD_187280 [Trifolium subterraneum]
          Length = 1892

 Score =  189 bits (481), Expect = 9e-51
 Identities = 104/293 (35%), Positives = 151/293 (51%), Gaps = 5/293 (1%)
 Frame = +1

Query: 10   CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTY--LQI 183
            C P+ + GLG++DL++ N SLLAK RW+L + Q  +W+E+++ +YG  I   G    + I
Sbjct: 1564 CKPKKEAGLGIRDLRVVNISLLAKWRWKLLSHQREVWKEVVIAKYGQYIIGNGNLGNVTI 1623

Query: 184  RQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMF 363
             +  S W KD+  L+ D      WF+ A+ + +GNG    FWS+ W G   L+  FPRM+
Sbjct: 1624 PRVASTWWKDICSLDKDSN----WFAEAVEQSVGNGHLTSFWSDIWIGDQSLQQRFPRMY 1679

Query: 364  SLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWS 543
            S+S  K+  + +MG   GD W W   WRR+ FAWE  +  E  + L+QF PS   ED W 
Sbjct: 1680 SISNQKDSSIFNMGRWDGDRWRWDFNWRRNLFAWEEPMKLELMDVLNQFRPSDR-EDRWL 1738

Query: 544  W---KLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFS 714
            W   K D  +              + +LE        P        LW+   P+ +  FS
Sbjct: 1739 WSENKEDGFSVKTCYDRLQYMFCERRVLE--------PSEEFVFAKLWKCGAPTKVCAFS 1790

Query: 715  WRLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIWS 873
            W+L  DRL T+E L +R I+     M CV C   VE+  HLF  C F+  +W+
Sbjct: 1791 WQLLWDRLQTKENLYKRRILQQQQTM-CVLCNAAVETNRHLFLHCDFAAKVWN 1842


>KYP32706.1 Transposon TX1 uncharacterized [Cajanus cajan]
          Length = 1025

 Score =  189 bits (480), Expect = 1e-50
 Identities = 102/284 (35%), Positives = 149/284 (52%), Gaps = 1/284 (0%)
 Frame = +1

Query: 4    KACLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYG-DGIAKCGTYLQ 180
            K   P+ +GGLG+K++  FN +LLAK RW L  +  +MW  ++L +YG D    C +Y +
Sbjct: 735  KVTRPKEEGGLGIKNIATFNVALLAKWRWNLFHNPDSMWARVLLSKYGVDRPNLCTSYNK 794

Query: 181  IRQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRM 360
             +   SIW +D+ L        D WF  +   K+G G    FW +RW G  CL  L+PR+
Sbjct: 795  TK--ASIWWRDV-LKACGADNEDKWFDKSKDWKMGEGKQTLFWLDRWTGEECLAVLYPRL 851

Query: 361  FSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSW 540
            F +S  K+  V  MG+   D W W+  WRR+ F WE   +   ++ L+ FS  K   D W
Sbjct: 852  FLISEQKQDTVHKMGQWVDDTWVWEFRWRRERFDWEANQILTLHQILNTFSMKKLKNDYW 911

Query: 541  SWKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWR 720
             WKL+PS             AYK L   + ++       K    +W+ +VP  + +F WR
Sbjct: 912  YWKLEPSG------EFSVKSAYKFLTSQRSTNER----QKLFVCMWKLHVPLKVSLFVWR 961

Query: 721  LFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCA 852
            L ++ LPT+E L+RRNI   P    CVFC   +E+  HLF +C+
Sbjct: 962  LLINALPTKENLLRRNIQLEPQNRLCVFCRASLETASHLFCTCS 1005


>GAU49581.1 hypothetical protein TSUD_139980 [Trifolium subterraneum]
          Length = 407

 Score =  180 bits (457), Expect = 2e-50
 Identities = 107/291 (36%), Positives = 148/291 (50%), Gaps = 3/291 (1%)
 Frame = +1

Query: 10  CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQIRQ 189
           C P+ +GGLGVKDLK FN SLL K RWRL  +  ++W+ ++  +YG+ + +    L    
Sbjct: 26  CRPKEEGGLGVKDLKWFNISLLTKWRWRLLLEHGSLWKLVLEAKYGN-VERVKLVLPRGN 84

Query: 190 QGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMFSL 369
           + S+W KDL  L    GV D W  +   +KLG G S +FW +RW G   L + F R+F +
Sbjct: 85  KFSLWWKDLVGLGVTNGVEDDWNQHVFLKKLGCGGSTRFWLDRWVGLAPLCETFSRIFKV 144

Query: 370 STSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWSWK 549
           S   E  + D+GE   D W W+L WRR FF  E        E ++    +K  EDSWS+ 
Sbjct: 145 SLHPECVIKDLGEWVNDTWVWRLAWRRSFFIREEESYNNLMEIITPVPITKE-EDSWSF- 202

Query: 550 LDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVL---WQTNVPSNIQIFSWR 720
               +             YK  L         P V  ++GV+   W++  P  + +FSW+
Sbjct: 203 ---IDRGMFTVRYMYSYLYKKFLPP------SPLVLCSVGVIARVWESWAPLKVIVFSWQ 253

Query: 721 LFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIWS 873
             L RLPTR  L+RR II       CVFC    ES  HLF SC  ++ +WS
Sbjct: 254 ALLGRLPTRGNLVRRRIIIDGEASFCVFCNGARESENHLFSSCGTAWLVWS 304


>KYP69313.1 hypothetical protein KK1_008502 [Cajanus cajan]
          Length = 375

 Score =  176 bits (447), Expect = 3e-49
 Identities = 97/286 (33%), Positives = 147/286 (51%), Gaps = 5/286 (1%)
 Frame = +1

Query: 28  GGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQIRQQGSIWM 207
           GGLGVK++  FN +LLAK RW L     ++W  ++  RYG G   C      R   SIW 
Sbjct: 2   GGLGVKNITRFNMALLAKWRWSLFHQNDSLWARVLYSRYGGGTNLCAQSSSRRD--SIWW 59

Query: 208 KDLFL----LEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMFSLST 375
           +DL +    LE D      WF   +  ++G+G+  +FW + W GP CL  +FPR+F++S 
Sbjct: 60  RDLLMVCGGLEQDN-----WFERKIKWRIGSGARARFWLDNWTGPICLASVFPRLFTISE 114

Query: 376 SKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWSWKLD 555
            +   + DMG      W W+L WRR+ F WE  + ++  + L + SP     DSW W  +
Sbjct: 115 QQNHFIQDMGSWTDSSWVWQLQWRRERFEWEIQLEQQLMQQLLECSPRAEQVDSWWWLGE 174

Query: 556 PSNTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWRLFLD 732
           PS T           AY  +  EV + S+     + A  ++W    P  ++IF+WR+   
Sbjct: 175 PSGT------YTVRSAYSAITSEVVVGSN-----NGAPNIIWSIPAPPKVKIFAWRMMSR 223

Query: 733 RLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870
            LPT + L  R+I  S +   CVFC +++E+  HLF +C+    +W
Sbjct: 224 GLPTVDNLASRSITISDNDALCVFCKQDIETDYHLFCTCSVVDKVW 269


>GAU29496.1 hypothetical protein TSUD_360410 [Trifolium subterraneum]
          Length = 1301

 Score =  183 bits (465), Expect = 1e-48
 Identities = 100/290 (34%), Positives = 152/290 (52%), Gaps = 3/290 (1%)
 Frame = +1

Query: 10   CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQIR- 186
            C  + KGGLGV+D++L N SLL+K RWRL      +W+E+++ +YG+ I     +  +R 
Sbjct: 945  CKDKAKGGLGVRDIRLVNISLLSKWRWRLLQPGRPLWKEVLVAKYGEFILNKVDWSGVRI 1004

Query: 187  -QQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMF 363
                S+W +D+  ++      D WF+ ++ RK+GNG+S  FWS  W G   L  +FPR+F
Sbjct: 1005 PSTASMWWRDISSIDKVVSSKD-WFAESIVRKVGNGNSTSFWSTIWIGDDPLSVVFPRLF 1063

Query: 364  SLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWS 543
            SLS + +  V D GE R   W W   WRRD F WE  ++ +  E L     S   ED W 
Sbjct: 1064 SLSNNNDRMVKDFGEYREGRWIWSFSWRRDLFQWEEDLVAQLRELLDPVVLSLE-EDWWR 1122

Query: 544  WKLDPSNTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWR 720
            W+ + +             +YK L+ E++    L+       G +W +  PS +  FSW+
Sbjct: 1123 WRPETNGV------FSVNSSYKLLVDELESEEVLEEAEITVFGQIWDSPAPSKVIAFSWQ 1176

Query: 721  LFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870
            L  D++PTR+ L  R+++ +     CV C   VES  HLF  C  +  +W
Sbjct: 1177 LLYDQIPTRKNLEARDMVLADMPWECVGCVGNVESSLHLFLHCPSAMLVW 1226


>GAU43110.1 hypothetical protein TSUD_373050 [Trifolium subterraneum]
          Length = 1099

 Score =  182 bits (463), Expect = 2e-48
 Identities = 105/291 (36%), Positives = 148/291 (50%), Gaps = 4/291 (1%)
 Frame = +1

Query: 10   CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQIR- 186
            C  +NKGGLGV+D+++ N SLLAK RWRL      +W+E+++ +YG+ I     +   R 
Sbjct: 715  CRAKNKGGLGVRDVRIVNLSLLAKWRWRLLLPGRPLWKEILVAKYGEHILHRVDWSDYRI 774

Query: 187  -QQGSIWMKDLFLLEHDRGVPDL-WFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRM 360
                S W KD+  +  D+ V D  W    + RK+GNG+S  FWS +W G   L  +FPR+
Sbjct: 775  PSSASKWWKDICSI--DKVVEDKNWLVEEVGRKVGNGNSTSFWSTKWIGDAPLSVIFPRL 832

Query: 361  FSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSW 540
            FSLS  K+  V D  E  GD   W+  WRR+ F WE   L    E L  F  S   +DSW
Sbjct: 833  FSLSNHKDCMVRDFYEDDGDNERWRFSWRRELFQWEVDRLTRLKELLVSFVFSSD-DDSW 891

Query: 541  SWKLDPSNTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQIFSW 717
             W+ DP              AY  L+ E++    L+   +     +W++  PS +  FSW
Sbjct: 892  IWRPDPDGV------FSVKSAYNLLIEELRSGEELEEEAALIFEQIWESPAPSKVIAFSW 945

Query: 718  RLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870
            +L  DR+PTR  L  R ++       CV C   VE+  HLF  C  +  +W
Sbjct: 946  QLLYDRIPTRRNLEVRGLLGLDSPWECVGCVGSVETTTHLFLHCPSALMVW 996


>GAU35675.1 hypothetical protein TSUD_162470 [Trifolium subterraneum]
          Length = 587

 Score =  179 bits (453), Expect = 2e-48
 Identities = 99/288 (34%), Positives = 154/288 (53%), Gaps = 2/288 (0%)
 Frame = +1

Query: 4    KACLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAK--CGTYL 177
            + CLP++KGGLGV+DL+LFN +LL K +WR  TD+ A+W  L+ +RYG    K  C   +
Sbjct: 282  QVCLPKDKGGLGVRDLELFNLALLCKCKWRCITDKHALWNALLQYRYGPLSFKLLCRETI 341

Query: 178  QIRQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPR 357
              R + S+W +D+  +   +G  D WF   +   LGNG+SI FW E+W+G   L++LFP 
Sbjct: 342  VTRPKDSLWWRDVVGVG-GKG-EDCWFPTQVSSVLGNGNSISFWKEKWHGVVPLRELFPL 399

Query: 358  MFSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDS 537
            ++     K+  V ++     ++  W   W R   + E A   +    L   + +  + D 
Sbjct: 400  LYEKEIHKDCVVSELFLPGSNLLNWNREWLRSLSSSELAEKADLEILLVGLTLNSDVADH 459

Query: 538  WSWKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSW 717
            W W   P N+            Y  L      + LD  +  AL  LW+ ++PS + +F W
Sbjct: 460  WRWV--PENSGLFSVKS----VYIFLQSSLELNPLDSDLLYALSKLWKNDIPSKVGVFGW 513

Query: 718  RLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSY 861
            RL LD+LPTR  L+ + I+S+ + +SC+FC  +VE   H+FFS A  +
Sbjct: 514  RLLLDKLPTRAALVSKGILSNSNDVSCIFCSMDVEDSNHIFFSDATKF 561


>KYP44529.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 567

 Score =  178 bits (451), Expect = 3e-48
 Identities = 99/294 (33%), Positives = 149/294 (50%), Gaps = 5/294 (1%)
 Frame = +1

Query: 4    KACLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQI 183
            +  LP+  GGLGVK++  FN +LLAK RW L     ++W  ++  RYG G   C      
Sbjct: 189  RVTLPKKMGGLGVKNIIRFNMALLAKWRWSLFHQNDSLWARVLYSRYGGGTNLCAQSSSR 248

Query: 184  RQQGSIWMKDLFL----LEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLF 351
            R   S+W +DL +    LE D      WF   +   +G+GS ++FW ++W GP CL  LF
Sbjct: 249  RD--SLWWRDLVVVCGGLEQDN-----WFDRKVKWSIGSGSRVRFWLDKWIGPICLASLF 301

Query: 352  PRMFSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIE 531
            PR+F++S  +   + DMG   G  W W+L WRR+ F WE  + ++  + L + +P     
Sbjct: 302  PRLFTISEQQNQFIQDMGYWTGHRWAWQLHWRRERFEWEIPLEQQLMQRLLECNPRARQV 361

Query: 532  DSWSWKLDPSNTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQI 708
            DSW W  +PS T           AY  +  E  + S++      A   +W    P   +I
Sbjct: 362  DSWWWLGEPSGT------YTVRSAYSAITSEADVGSNIG-----APSSVWSIPAPPKAKI 410

Query: 709  FSWRLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870
            F+WR+    LPT + L  R+I+ S +   CVFC  ++E+  HLF +C     +W
Sbjct: 411  FAWRMMSRGLPTVDNLASRSIVLSENDALCVFCKSDIETDYHLFCTCPVVDKVW 464


>GAU44350.1 hypothetical protein TSUD_129240 [Trifolium subterraneum]
          Length = 388

 Score =  174 bits (440), Expect = 4e-48
 Identities = 100/291 (34%), Positives = 143/291 (49%), Gaps = 4/291 (1%)
 Frame = +1

Query: 10  CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQIR- 186
           C  +  GGLGV+D+K+ N SLLAK RWRL    + +W+++++ +YG+ I     +  IR 
Sbjct: 28  CKAKRMGGLGVRDIKIVNLSLLAKWRWRLLLPGNPLWKQVLVAKYGNHILNRVIWSDIRI 87

Query: 187 -QQGSIWMKDLFLLEHDRGVPDL-WFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRM 360
               S W KD+  L  D+ V    W   ++ RK+GNG S  FWS  W G   L ++FPR+
Sbjct: 88  PSLASKWWKDVCSL--DKVVESKNWLGESIVRKVGNGFSTYFWSSNWIGEAPLLEVFPRL 145

Query: 361 FSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSW 540
           +SLS  K+  V D     G  W W   WRR+ F WE  ++    E +    P    ED W
Sbjct: 146 YSLSIHKDSMVRDFYVQEGGGWRWSFSWRRNLFQWEEDLVTRLREMVEPV-PLSLEEDYW 204

Query: 541 SWKLDPSNTXXXXXXXXXXXAYKTL-LEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSW 717
            W  DP              AY  L  E+++   L+  V+     +W +  PS +  FSW
Sbjct: 205 VWSPDPEG------KFSVKSAYNFLGDELRVGEDLEEEVALVFDNIWGSPAPSKVIAFSW 258

Query: 718 RLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870
           +L  DR+P+R  L  R ++       CV C   VES  HLF  C  +  +W
Sbjct: 259 QLLYDRIPSRRNLEARGLLCLDMPWECVGCVGSVESTTHLFLHCPSAMKVW 309


>KYP44023.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 1034

 Score =  181 bits (460), Expect = 5e-48
 Identities = 99/294 (33%), Positives = 151/294 (51%), Gaps = 5/294 (1%)
 Frame = +1

Query: 4    KACLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQI 183
            +  LP+  GGLGVK++  FN +LLAK RW L     ++W  ++  RYG G   C      
Sbjct: 653  RVTLPKKMGGLGVKNITRFNMALLAKWRWSLFHQNDSLWARVLYSRYGGGTNLCAQSSSR 712

Query: 184  RQQGSIWMKDLFL----LEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLF 351
            R   SIW +DL +    LE D      WF   +  ++G+G+  +FW + W GP CL  +F
Sbjct: 713  RD--SIWWRDLLMVCGGLEQDN-----WFERKIKWRIGSGARARFWLDNWTGPICLASVF 765

Query: 352  PRMFSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIE 531
            PR+F++S  +   + DMG      W W+L WRR+ F WE  + ++  + L + SP     
Sbjct: 766  PRLFTISEQQNHFIQDMGSWTDSSWVWQLQWRRERFEWEIQLEQQLMQQLLECSPRAEQV 825

Query: 532  DSWSWKLDPSNTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQI 708
            DSW W  +PS T           AY  +  EV + S+     + A  ++W    P  ++I
Sbjct: 826  DSWWWLGEPSGT------YTVRSAYSAITSEVVVGSN-----NGAPNIIWSIPAPPKVKI 874

Query: 709  FSWRLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870
            F+WR+    LPT + L  R+I  S +   CVFC +++E+  HLF +C+    +W
Sbjct: 875  FAWRMMSRGLPTVDNLASRSITISDNDALCVFCKQDIETDYHLFCTCSVVDKVW 928


Top