BLASTX nr result

ID: Astragalus23_contig00016007 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00016007
         (920 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU45069.1| hypothetical protein TSUD_243350 [Trifolium subt...   201   1e-58
gb|PNX59756.1| retrovirus-related Pol polyprotein from transposo...   199   2e-58
dbj|GAU49830.1| hypothetical protein TSUD_293850 [Trifolium subt...   198   4e-57
gb|KYP51705.1| hypothetical protein KK1_026473 [Cajanus cajan]        194   8e-57
dbj|GAU13166.1| hypothetical protein TSUD_179090 [Trifolium subt...   196   8e-57
gb|PNY13856.1| hypothetical protein L195_g010524 [Trifolium prat...   197   4e-56
gb|KYP35344.1| hypothetical protein KK1_043625 [Cajanus cajan]        191   2e-55
dbj|GAU41486.1| hypothetical protein TSUD_239620 [Trifolium subt...   201   2e-55
dbj|GAU10619.1| hypothetical protein TSUD_418290 [Trifolium subt...   191   7e-55
gb|PNX54609.1| hypothetical protein L195_g048229, partial [Trifo...   188   2e-54
gb|PNX76620.1| hypothetical protein L195_g032574 [Trifolium prat...   191   2e-54
dbj|GAU45259.1| hypothetical protein TSUD_291430 [Trifolium subt...   191   2e-54
dbj|GAU51775.1| hypothetical protein TSUD_415620 [Trifolium subt...   200   3e-54
gb|KHN08218.1| hypothetical protein glysoja_017696, partial [Gly...   186   4e-54
gb|PNY05212.1| flavonol sulfotransferase-like protein [Trifolium...   191   5e-54
gb|PNX80244.1| hypothetical protein L195_g036241 [Trifolium prat...   189   5e-54
gb|PNX93130.1| retrovirus-related Pol polyprotein from transposo...   189   6e-54
dbj|GAU31202.1| hypothetical protein TSUD_210590 [Trifolium subt...   199   9e-54
dbj|GAU43924.1| hypothetical protein TSUD_28660 [Trifolium subte...   197   1e-53
dbj|GAU41868.1| hypothetical protein TSUD_366150 [Trifolium subt...   196   2e-53

>dbj|GAU45069.1| hypothetical protein TSUD_243350 [Trifolium subterraneum]
          Length = 358

 Score =  201 bits (511), Expect = 1e-58
 Identities = 101/224 (45%), Positives = 145/224 (64%), Gaps = 7/224 (3%)
 Frame = -2

Query: 919 ERFDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCE 740
           ++FDPSF AW RCN L  SW++NSVS SIAQSLI+ E+A ++W DL+ER+S   L+R+ E
Sbjct: 69  DQFDPSFRAWNRCNMLVHSWILNSVSESIAQSLIFMENAIDVWNDLRERFSQSDLVRISE 128

Query: 739 LLQQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCD---AADRVMKAYKNEDDVF 569
           L Q++ + KQ S +VTE+++ LK++W EL+   PIP CSC      + + +A  N   ++
Sbjct: 129 LQQEIYAMKQESRTVTEFYSDLKLLWEELEIYLPIPNCSCRIRCTCEAMRQARTNHTLLY 188

Query: 568 --RFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNL--CDKRKEIDKGNVNS 401
             RFL GLNE FS+V+  ILLMD LPP+++VFS+ LQHER  N    D+ K +     + 
Sbjct: 189 IVRFLTGLNEHFSMVKSQILLMDPLPPMNKVFSLVLQHERQGNFYPTDESKALLNAAKSK 248

Query: 400 SAKAKQCTFCGRVGHRVETCYRKNGFPSR*KKGVSVSTINNTAI 269
               + CT+CG+  H VE C++K G P   +     ST+NN A+
Sbjct: 249 GFPPRICTYCGKDNHIVENCFKKYGLPPHLRP--KPSTVNNAAL 290


>gb|PNX59756.1| retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Trifolium pratense]
          Length = 309

 Score =  199 bits (505), Expect = 2e-58
 Identities = 97/223 (43%), Positives = 143/223 (64%), Gaps = 13/223 (5%)
 Frame = -2

Query: 913 FDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCELL 734
           FDP++ AW RCN+L  SW++NS+SPSIAQS+++ E+A +IW DL+ER+S   LIR+ EL 
Sbjct: 25  FDPAYRAWHRCNQLISSWILNSISPSIAQSVVFMENAIDIWNDLRERFSQGDLIRISELQ 84

Query: 733 QQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCD---AADRVMKAYKNE--DDVF 569
           Q++ S KQ++ SVT++F++LK +W EL+   PIP C+C    A + +  A KN       
Sbjct: 85  QEIYSLKQDNRSVTDFFSELKTLWEELELYLPIPSCTCRQRCACEAMRSARKNHLLLHTV 144

Query: 568 RFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNLCD--------KRKEIDKG 413
           RFL GLNE FS VR  IL+M+ LPP+++VFS+ +QHER  N  +           +  K 
Sbjct: 145 RFLTGLNENFSTVRSQILIMEPLPPINKVFSLVIQHERQGNFAEVDDSKILVNAAKSAKP 204

Query: 412 NVNSSAKAKQCTFCGRVGHRVETCYRKNGFPSR*KKGVSVSTI 284
           + +S +  + C++CG+  H VE C++KNG P   KK  S   +
Sbjct: 205 SSSSKSSTRNCSYCGKDNHVVENCFKKNGVPPHMKKFSSAHNV 247


>dbj|GAU49830.1| hypothetical protein TSUD_293850 [Trifolium subterraneum]
          Length = 410

 Score =  198 bits (504), Expect = 4e-57
 Identities = 104/229 (45%), Positives = 146/229 (63%), Gaps = 12/229 (5%)
 Frame = -2

Query: 919 ERFDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCE 740
           ++FDPSF AW RCN L  SW++NSVS SIAQS+++ E+A ++W DLKER+S   LIR+ E
Sbjct: 69  DQFDPSFRAWNRCNMLVHSWIMNSVSESIAQSIVFMENAIDVWNDLKERFSQADLIRIAE 128

Query: 739 LLQQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCD-----AADRVMKAYKNEDD 575
           L Q+L++ +Q+S SVTE+++ LK+IW EL+   P+P CSC       A R  +A      
Sbjct: 129 LQQELHALQQDSRSVTEFYSDLKLIWEELEIYLPMPNCSCRNRCTCEAMRSARANHALLY 188

Query: 574 VFRFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNLC---DKRKEID----K 416
           + RFL GLNE F+VV+  ILLMD LPP+++VFS+ LQH+R +N     D +  ++    K
Sbjct: 189 IIRFLTGLNEHFAVVKSQILLMDPLPPMNKVFSLVLQHQRQSNFSPSEDSKALLNAAKSK 248

Query: 415 GNVNSSAKAKQCTFCGRVGHRVETCYRKNGFPSR*KKGVSVSTINNTAI 269
           G+  S    + CTFCG+  H V  C++K G P   +K    S  NN  I
Sbjct: 249 GSFPSKNPVRICTFCGKDNHIVANCFKKYGLPPHFRKN---SQANNAEI 294


>gb|KYP51705.1| hypothetical protein KK1_026473 [Cajanus cajan]
          Length = 278

 Score =  194 bits (492), Expect = 8e-57
 Identities = 100/231 (43%), Positives = 141/231 (61%), Gaps = 19/231 (8%)
 Frame = -2

Query: 913 FDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCELL 734
           FDPS+ +W RCN +  SW++NSV  SI QS+I+ E+  ++W DLKER+S   LIR+ EL 
Sbjct: 37  FDPSYKSWNRCNMIIHSWIVNSVVKSIGQSIIFLENVVDVWNDLKERFSQGDLIRISELQ 96

Query: 733 QQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCD-----AADRVMKAYKNEDDVF 569
           Q++   KQ S  VTE++++LK++W EL+   PIP C+C       A R  + +   +   
Sbjct: 97  QEIYGIKQGSLFVTEFYSELKILWEELETYMPIPCCACPVKCTCVAMRNARQFHTLNHFI 156

Query: 568 RFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNLC--DKRKEI--------- 422
           RFL GLNE FSVV+  ILLMD +P ++++F M +QHER  N    D+ K +         
Sbjct: 157 RFLTGLNENFSVVKSQILLMDLVPSMNQIFYMVIQHERQGNFIVNDESKALINAIDYKRS 216

Query: 421 ---DKGNVNSSAKAKQCTFCGRVGHRVETCYRKNGFPSR*KKGVSVSTINN 278
               KG   +S   K CT+ G+ GH +ETCYRK+GFP   +KG S S +NN
Sbjct: 217 QGRGKGFAQNSGPKKICTYYGKTGHTIETCYRKHGFPPHFQKGNS-SMVNN 266


>dbj|GAU13166.1| hypothetical protein TSUD_179090 [Trifolium subterraneum]
          Length = 353

 Score =  196 bits (498), Expect = 8e-57
 Identities = 102/230 (44%), Positives = 145/230 (63%), Gaps = 13/230 (5%)
 Frame = -2

Query: 919 ERFDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCE 740
           + FDPSF +W RCN    SW++NSVS SIAQS+++ E+A  +W DLKER++   L+R+ E
Sbjct: 72  DHFDPSFRSWNRCNMPVHSWIMNSVSESIAQSIVFMENALAVWNDLKERFAQSDLVRIAE 131

Query: 739 LLQQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIP------ECSCDAADRVMKAYKNED 578
           L Q++ + KQ+S +VTE+++ LK++W EL+   P+P       CSC+A     +A     
Sbjct: 132 LQQEIYALKQDSRTVTEFYSDLKLLWEELEIYLPMPNCSSRVRCSCEAMHSA-RANHTLL 190

Query: 577 DVFRFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNL--CDKRKEI-----D 419
           ++ RFL GLN+ FSVV+  +LLMD LPPL++VFSM LQHE   N    D+ K +      
Sbjct: 191 NIVRFLIGLNDHFSVVKSQVLLMDPLPPLNKVFSMVLQHEGQGNFYPTDESKALLNAAKS 250

Query: 418 KGNVNSSAKAKQCTFCGRVGHRVETCYRKNGFPSR*KKGVSVSTINNTAI 269
           KG+ N  +  + CTFCG+  H VE C++K G P   KK    ST +N AI
Sbjct: 251 KGHFNPKSTVRICTFCGKDNHIVENCFKKYGIPPHMKKN---STAHNAAI 297


>gb|PNY13856.1| hypothetical protein L195_g010524 [Trifolium pratense]
          Length = 448

 Score =  197 bits (500), Expect = 4e-56
 Identities = 104/229 (45%), Positives = 149/229 (65%), Gaps = 12/229 (5%)
 Frame = -2

Query: 919 ERFDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCE 740
           ++FDPSF AW RCN L  SW++NSVS SIAQS+++ E+A ++W DLKER+S   LIR+ E
Sbjct: 69  DQFDPSFCAWNRCNMLVHSWIMNSVSESIAQSIVFIENAIDVWNDLKERFSQADLIRIAE 128

Query: 739 LLQQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCD-----AADRVMKAYKNEDD 575
           L Q+L++ KQ+S++V E+++ LK+IW EL+   P+P CSC       A R  +A      
Sbjct: 129 LQQELHALKQDSHTVNEFYSDLKLIWEELEIYLPMPNCSCRNCCTCEAMRSARANHTLLY 188

Query: 574 VFRFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNLC---DKRKEID----K 416
           V  FL GLNE FSVV+  ILLMD LPP+ +V S+ LQHER ++     D R  ++    +
Sbjct: 189 VICFLTGLNEHFSVVKSQILLMDPLPPMTKVVSLVLQHERQSHFSTSDDSRVLLNAAKSR 248

Query: 415 GNVNSSAKAKQCTFCGRVGHRVETCYRKNGFPSR*KKGVSVSTINNTAI 269
           G+ +S +  + CTFCG+  H V+ C++K+G P   +K    S +NN AI
Sbjct: 249 GSSSSRSGNRVCTFCGKDNHIVDNCFKKHGLPPHFRKN---SQVNNAAI 294


>gb|KYP35344.1| hypothetical protein KK1_043625 [Cajanus cajan]
          Length = 287

 Score =  191 bits (484), Expect = 2e-55
 Identities = 96/210 (45%), Positives = 131/210 (62%), Gaps = 19/210 (9%)
 Frame = -2

Query: 913 FDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCELL 734
           FDPS+ +W RCN +  SW++NSV  SI QS+I+ E+A ++W DLKER+S   L R+ EL 
Sbjct: 78  FDPSYKSWNRCNMIIHSWIVNSVVESIGQSIIFLENAVDVWNDLKERFSQGDLTRISELQ 137

Query: 733 QQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCD-----AADRVMKAYKNEDDVF 569
           Q++   KQ S SVTE++++LK++W EL+   PIP C+C      AA R  + +   + V 
Sbjct: 138 QEIYGLKQGSLSVTEFYSELKILWEELETYMPIPSCACPVKCTCAAMRNARQFHTLNHVI 197

Query: 568 RFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNLC--DKRKEI--------- 422
           RFL GLNE FSVV+  ILLMD LP ++R+FSM +QHER  N    D+ K +         
Sbjct: 198 RFLTGLNENFSVVKSQILLMDPLPSMNRIFSMVIQHERQGNFIFNDESKALINAVDYKRS 257

Query: 421 ---DKGNVNSSAKAKQCTFCGRVGHRVETC 341
               KG   +S   K CT+CG+ GH VETC
Sbjct: 258 QGRGKGFAQNSGPKKICTYCGKTGHTVETC 287


>dbj|GAU41486.1| hypothetical protein TSUD_239620 [Trifolium subterraneum]
          Length = 794

 Score =  201 bits (512), Expect = 2e-55
 Identities = 103/226 (45%), Positives = 147/226 (65%), Gaps = 9/226 (3%)
 Frame = -2

Query: 919 ERFDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCE 740
           ++FDPSF AW RCN L  SW++NSVS SIAQSLI+ E+A ++W DL+ER+S   L+R+ E
Sbjct: 27  DQFDPSFRAWNRCNMLVHSWILNSVSESIAQSLIFMENAIDVWNDLRERFSQSDLVRISE 86

Query: 739 LLQQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCD---AADRVMKAYKNEDDVF 569
           L Q++ + KQ S +VTE+++ LK++W EL+   PIP CSC      + + +A  N   ++
Sbjct: 87  LQQEIYAMKQESRTVTEFYSDLKLLWEELEIYLPIPNCSCRIRCTCEAMRQARTNHTLLY 146

Query: 568 --RFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNL--CDKRKEIDKGNVNS 401
             RFL GLNE FS+V+ HILLMD LPP+++VFS+ LQHER  N    D+ K +     + 
Sbjct: 147 IVRFLTGLNEHFSMVKSHILLMDPLPPMNKVFSLVLQHERQGNFYPTDESKALLNAAKSK 206

Query: 400 S--AKAKQCTFCGRVGHRVETCYRKNGFPSR*KKGVSVSTINNTAI 269
               K + CT+CG+  H VE C++K G P   +     ST+NN A+
Sbjct: 207 GFPPKNRICTYCGKDNHIVENCFKKYGLPPHLRP--KPSTVNNAAL 250


>dbj|GAU10619.1| hypothetical protein TSUD_418290 [Trifolium subterraneum]
          Length = 368

 Score =  191 bits (486), Expect = 7e-55
 Identities = 96/210 (45%), Positives = 140/210 (66%), Gaps = 10/210 (4%)
 Frame = -2

Query: 919 ERFDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCE 740
           + FDPSF AW RCN+L  SW++NSVS SIAQS+++ E+A +IW DL+ER+S   LIR+ E
Sbjct: 69  DAFDPSFCAWNRCNQLVSSWILNSVSESIAQSVVFLENAIDIWNDLRERFSQGNLIRISE 128

Query: 739 LLQQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSC--DAADRVMKAYKNEDDV-- 572
           L Q++ S +Q+  SV+E++++LK +W EL+   PIP+C+C        M++ +N   +  
Sbjct: 129 LQQEIYSLRQDHRSVSEFYSELKQLWEELELYLPIPQCTCRNRCTCEAMRSARNNHTLMH 188

Query: 571 -FRFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNLC--DKRK---EIDKGN 410
             RFL GLN++FSVV+  ILL+D LP +++VFSM LQHER +NL   D  K      +  
Sbjct: 189 TIRFLTGLNDQFSVVKSQILLIDPLPQMNKVFSMVLQHERRSNLASLDDSKFLVHAARTG 248

Query: 409 VNSSAKAKQCTFCGRVGHRVETCYRKNGFP 320
             +S   + C+FCG+  H VE C++KNG P
Sbjct: 249 KQASGAKRVCSFCGKDNHVVENCFKKNGTP 278


>gb|PNX54609.1| hypothetical protein L195_g048229, partial [Trifolium pratense]
          Length = 287

 Score =  188 bits (477), Expect = 2e-54
 Identities = 93/216 (43%), Positives = 135/216 (62%), Gaps = 13/216 (6%)
 Frame = -2

Query: 913 FDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCELL 734
           FDP++ AW RCN+L  SW++NSVSPSIAQS+++ E+A +IW DL ER+S   LIR+ EL 
Sbjct: 69  FDPAYCAWHRCNQLISSWILNSVSPSIAQSVVFMENAIDIWNDLHERFSQGDLIRISELQ 128

Query: 733 QQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCD---AADRVMKAYKNED--DVF 569
           Q++   KQ++ SVT++++ LK +W EL+   PIP C+C      + +  A +N       
Sbjct: 129 QEIYVLKQDNRSVTDFYSDLKTLWEELELYLPIPSCTCRQRCTCEAMRSARRNHTLLHTV 188

Query: 568 RFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNLCD--------KRKEIDKG 413
           RFL GLNE FS VR  IL+M+ LPP+++V S+ +QHE   N  +           +  K 
Sbjct: 189 RFLTGLNENFSTVRSQILIMEPLPPINKVLSLVIQHECQGNFSEVDDSKILVNAAKYTKS 248

Query: 412 NVNSSAKAKQCTFCGRVGHRVETCYRKNGFPSR*KK 305
           + NS A +  C++CG+  H VE C++KNG P   KK
Sbjct: 249 SSNSKASSHNCSYCGKDKHVVENCFKKNGVPPHMKK 284


>gb|PNX76620.1| hypothetical protein L195_g032574 [Trifolium pratense]
          Length = 398

 Score =  191 bits (485), Expect = 2e-54
 Identities = 97/225 (43%), Positives = 136/225 (60%), Gaps = 22/225 (9%)
 Frame = -2

Query: 913 FDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCELL 734
           FDPS  AW RCN L  SW++NSVS SIAQS+++ E+A ++W DLKER+S   L+R+ EL 
Sbjct: 74  FDPSLRAWNRCNMLVHSWILNSVSESIAQSIVFMENAIDVWNDLKERFSQGDLVRIAELQ 133

Query: 733 QQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCDA---ADRVMKAYKNED--DVF 569
           Q++ S +Q+S SVTE+F+ LK++W EL+   PIP C+C      D + +A  N     V 
Sbjct: 134 QEIYSLRQDSRSVTEFFSALKILWEELELYLPIPTCTCRVKCNCDAMRRARANHQLMYVI 193

Query: 568 RFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNLCDKRKE---IDKGNV--- 407
           RFL GLN+ F +V+  ILL+D LP L+++FSM +QHER  N          I+  N    
Sbjct: 194 RFLTGLNDHFDMVKSQILLLDPLPSLNKIFSMVIQHERQGNFTPSEHSKALINAANFRPP 253

Query: 406 -----------NSSAKAKQCTFCGRVGHRVETCYRKNGFPSR*KK 305
                      NSS   + CTFCG+  H ++ CY+K+G P   +K
Sbjct: 254 GSTSSSKNSRSNSSTGKRVCTFCGKDNHIIDNCYQKHGLPPHLQK 298


>dbj|GAU45259.1| hypothetical protein TSUD_291430 [Trifolium subterraneum]
          Length = 387

 Score =  191 bits (484), Expect = 2e-54
 Identities = 103/237 (43%), Positives = 143/237 (60%), Gaps = 22/237 (9%)
 Frame = -2

Query: 913 FDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCELL 734
           FDPS  AW RCN L  S ++NSVS SIAQS+++ E+  ++W DLKE++S   L+R+ EL 
Sbjct: 74  FDPSLRAWNRCNMLVHSLILNSVSESIAQSIVFMENVIDVWNDLKEQFSQGDLVRIAELQ 133

Query: 733 QQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSC--DAADRVMKAYKNEDD---VF 569
           Q++ S +Q S SVTE+F+ LK++W EL+   PIP C+C        M++ +N  +   V 
Sbjct: 134 QEIYSLRQESRSVTEFFSALKILWEELELYLPIPMCTCRVKCNCEAMRSARNNHNLMYVI 193

Query: 568 RFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNLC---DKRKEIDKGNVN-- 404
           RFL GLNE F VV+  ILLMD LP L+++FSM +QHER  N     D +  I+  N N  
Sbjct: 194 RFLTGLNEHFDVVKSQILLMDPLPTLNKIFSMVIQHERQGNFTPSEDSQALINAANSNSK 253

Query: 403 ------------SSAKAKQCTFCGRVGHRVETCYRKNGFPSR*KKGVSVSTINNTAI 269
                       SS+  + CTFCG+  H V+ CY+K+G P   +K V  S  +N AI
Sbjct: 254 GYGSKNPKSSYASSSVKRVCTFCGKDNHIVDNCYKKHGLPPHLQKRVQ-SQAHNAAI 309


>dbj|GAU51775.1| hypothetical protein TSUD_415620 [Trifolium subterraneum]
          Length = 1234

 Score =  200 bits (509), Expect = 3e-54
 Identities = 104/227 (45%), Positives = 145/227 (63%), Gaps = 12/227 (5%)
 Frame = -2

Query: 913 FDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCELL 734
           FDPSF AW RCN L  SW++NSVS SIAQS+++ E+A ++W DLKER++   L+R+ EL 
Sbjct: 71  FDPSFRAWNRCNMLVHSWIMNSVSESIAQSIVFMENALDVWHDLKERFAQADLVRISELQ 130

Query: 733 QQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCD-----AADRVMKAYKNEDDVF 569
           Q L + KQ S SVTE+++ LK++W EL+   P+P CSC       A R  +A  N   + 
Sbjct: 131 QDLYALKQESRSVTEFYSDLKLLWEELEIYLPMPNCSCRIRCSCEAMRAARANHNLLYIM 190

Query: 568 RFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNLC---DKRKEID----KGN 410
           RFL GLNE F++V+  ILLMD LPP+++VFS+ LQHER  N     D +  ++    K +
Sbjct: 191 RFLTGLNENFAMVKSQILLMDPLPPMNKVFSLVLQHERQGNFTSAEDSKISLNATQSKFS 250

Query: 409 VNSSAKAKQCTFCGRVGHRVETCYRKNGFPSR*KKGVSVSTINNTAI 269
             S +  K CTFCG+  H V  C++K+G P   +KG   S+ NN A+
Sbjct: 251 GYSRSGTKVCTFCGKDNHVVANCFKKHGLPPHFRKG---SSSNNAAV 294


>gb|KHN08218.1| hypothetical protein glysoja_017696, partial [Glycine soja]
          Length = 243

 Score =  186 bits (471), Expect = 4e-54
 Identities = 93/216 (43%), Positives = 133/216 (61%), Gaps = 19/216 (8%)
 Frame = -2

Query: 910 DPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCELLQ 731
           +P ++AW+RCN L  SWL+NS+SPSIAQS+I+ ESA +IW DL ER+S   L+R+ EL +
Sbjct: 25  NPDYSAWERCNTLIMSWLLNSISPSIAQSVIFLESAVDIWTDLHERFSQGDLLRIAELQE 84

Query: 730 QLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCDAADRVMKAYKNEDDVFRFLDGL 551
           ++    Q + SV+E++T LK +W ELD  RP+  CSCDA     K Y  +D + RFL  L
Sbjct: 85  EIYGLSQANRSVSEFYTALKALWEELDNYRPLAACSCDA-----KIYHQQDFIIRFLKRL 139

Query: 550 NEEFSVVRIHILLMDTLPPLHRVFSMALQHERL-----------NNLCD--------KRK 428
           +E FSVVR  ILLMD LP ++R+FSM +QHER            N+  +        K +
Sbjct: 140 DERFSVVRSQILLMDPLPAVNRIFSMVIQHERQLQIPPTTADDPNSFVNAAGGPSSVKSR 199

Query: 427 EIDKGNVNSSAKAKQCTFCGRVGHRVETCYRKNGFP 320
              +   N ++  K+C +C R GH +  C+ K+G+P
Sbjct: 200 GGSRSQPNKTSSNKRCAYCHRSGHTINECWGKHGYP 235


>gb|PNY05212.1| flavonol sulfotransferase-like protein [Trifolium pratense]
          Length = 417

 Score =  191 bits (484), Expect = 5e-54
 Identities = 95/217 (43%), Positives = 137/217 (63%), Gaps = 20/217 (9%)
 Frame = -2

Query: 913 FDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCELL 734
           FDPS  AW RCN L  SW++NSVS SIAQS+++ E+A ++W DLK R+S   L+R+ EL 
Sbjct: 146 FDPSLRAWSRCNMLVHSWILNSVSESIAQSIMFMENAIDVWNDLKGRFSQGDLVRISELQ 205

Query: 733 QQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCD---AADRVMKAYKNED--DVF 569
           Q++ S +Q S SVTE+F+ LKV+W E +   PIP C+C    + + +  A+ N +   V 
Sbjct: 206 QEIYSLRQESRSVTEFFSALKVLWEEFEIYLPIPMCTCRVKCSCEAMRSAHNNHNLMYVI 265

Query: 568 RFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNLC---DKRKEID------- 419
           RFL GLN+ F VV+  IL+MD LPPL+++FSM +QHER  N     D +  I+       
Sbjct: 266 RFLTGLNDHFDVVKSQILIMDPLPPLYKIFSMLIQHERQGNFAPSEDSKALINAANSKTS 325

Query: 418 -----KGNVNSSAKAKQCTFCGRVGHRVETCYRKNGF 323
                K +  SS+  + CTFCG+  H ++ CY+K+G+
Sbjct: 326 GSKNFKSSYGSSSVKRVCTFCGKDNHIIDNCYKKHGY 362


>gb|PNX80244.1| hypothetical protein L195_g036241 [Trifolium pratense]
          Length = 362

 Score =  189 bits (480), Expect = 5e-54
 Identities = 106/246 (43%), Positives = 145/246 (58%), Gaps = 30/246 (12%)
 Frame = -2

Query: 913 FDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCELL 734
           FDP+F AW RCN+L  +W+++SVSPSIAQS+++ E+A +IW DL+ER+S   LIR+ EL 
Sbjct: 70  FDPAFRAWHRCNQLVSAWILSSVSPSIAQSVVFMENAIDIWNDLRERFSQGDLIRISELQ 129

Query: 733 QQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCD---AADRVMKAYKNED--DVF 569
           Q+  + KQ+S SVT+++T LKVIW EL+   PIP C+C      + +  A +N       
Sbjct: 130 QEAYALKQDSKSVTDFYTDLKVIWEELELYLPIPSCTCPRRCTCEAMRSARRNHSLLHTI 189

Query: 568 RFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHER------------LNNLCDKRKE 425
           RFL GLN  FS V+  IL+MD LPP+++VFS+ LQHER            L N       
Sbjct: 190 RFLTGLNANFSTVKSQILIMDPLPPINKVFSLVLQHERQGISHESDDSTILVNAARSTPS 249

Query: 424 ID--KGNVNSSAKAK---QCTFCGRVGHRVETCYRKNGFPSR*KKGVSVS--------TI 284
               K +  SS+ +K   +CT+CG   H VE C++KNG P   KK  S +        T 
Sbjct: 250 SSGYKQSTQSSSGSKPPRKCTYCGMNNHFVENCFKKNGVPPHMKKFASANNAASEEGITN 309

Query: 283 NNTAIS 266
           NN A S
Sbjct: 310 NNAATS 315


>gb|PNX93130.1| retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Trifolium pratense]
          Length = 369

 Score =  189 bits (480), Expect = 6e-54
 Identities = 106/246 (43%), Positives = 145/246 (58%), Gaps = 30/246 (12%)
 Frame = -2

Query: 913 FDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCELL 734
           FDP+F AW RCN+L  +W+++SVSPSIAQS+++ E+A +IW DL+ER+S   LIR+ EL 
Sbjct: 70  FDPAFRAWHRCNQLVSAWILSSVSPSIAQSVVFMENAIDIWNDLRERFSQGDLIRISELQ 129

Query: 733 QQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCD---AADRVMKAYKNED--DVF 569
           Q+  + KQ+S SVT+++T LKVIW EL+   PIP C+C      + +  A +N       
Sbjct: 130 QEAYALKQDSKSVTDFYTDLKVIWEELELYLPIPSCTCPRRCTCEAMRSARRNHSLLHTI 189

Query: 568 RFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHER------------LNNLCDKRKE 425
           RFL GLN  FS V+  IL+MD LPP+++VFS+ LQHER            L N       
Sbjct: 190 RFLTGLNANFSTVKSQILIMDPLPPINKVFSLVLQHERQGISHESDDSTILVNAARSTPS 249

Query: 424 ID--KGNVNSSAKAK---QCTFCGRVGHRVETCYRKNGFPSR*KKGVSVS--------TI 284
               K +  SS+ +K   +CT+CG   H VE C++KNG P   KK  S +        T 
Sbjct: 250 SSGYKQSTQSSSGSKPPRKCTYCGMNNHFVENCFKKNGVPPHMKKFASANNAASEEGITN 309

Query: 283 NNTAIS 266
           NN A S
Sbjct: 310 NNAATS 315


>dbj|GAU31202.1| hypothetical protein TSUD_210590 [Trifolium subterraneum]
          Length = 1059

 Score =  199 bits (505), Expect = 9e-54
 Identities = 122/335 (36%), Positives = 176/335 (52%), Gaps = 37/335 (11%)
 Frame = -2

Query: 913  FDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCELL 734
            FDPSF AW RCN L  SW++NSVS SIAQSL++ E+A ++W DL+ER++   L+R+ EL 
Sbjct: 73   FDPSFRAWNRCNSLVHSWILNSVSDSIAQSLVFLENAIDVWTDLRERFAQADLVRIAELQ 132

Query: 733  QQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSC--DAADRVMKAYKNEDDV---F 569
            Q+++S KQ S +VTE+++ LK++W EL+   PIP CSC        M++ +N   +    
Sbjct: 133  QEIHSLKQESRTVTEFYSDLKLLWEELEIYIPIPNCSCRNRCTCEAMRSARNNHTMLYAI 192

Query: 568  RFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNL--CDKRKEI----DKGNV 407
            RFL GLNE F++V+  ILLMD LP ++RVFS+ LQHER  N    D+ K +         
Sbjct: 193  RFLTGLNEHFAMVKSQILLMDPLPTMNRVFSLVLQHERQGNFSPSDESKALLNAAKSRGF 252

Query: 406  NSSAKAKQCTFCGRVGHRVETCYRKNGFPSR*KKGVSVSTINNTAI--SL*GPYRLSAKG 233
             S   A+ CT+CG+  H ++ C++K+G P   +K    S +NN AI   +      S+ G
Sbjct: 253  PSKTPARVCTYCGKDNHMIDNCFKKHGPPPHFRKS---SQLNNAAIEGGIDEAIASSSTG 309

Query: 232  *VLEESKIFSSACC*YSRRYFLCAFLHELFQS----------YGKVRTT*YLSMN----- 98
             V+ + +              L   L   F S           G    T + S+N     
Sbjct: 310  PVMTQDQALQ-----------LITLLQNSFPSQDSDKAASNQVGSTAFTDHTSVNQGLYY 358

Query: 97   ------DVSNVVCN---HSVFPELVIWHFRFEHFA 20
                  DV     N   H+  P+  IWHFR  H +
Sbjct: 359  LKLANKDVHIHNANGTTHTTIPDQAIWHFRLGHIS 393


>dbj|GAU43924.1| hypothetical protein TSUD_28660 [Trifolium subterraneum]
          Length = 915

 Score =  197 bits (502), Expect = 1e-53
 Identities = 102/226 (45%), Positives = 146/226 (64%), Gaps = 9/226 (3%)
 Frame = -2

Query: 919 ERFDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCE 740
           ++FDPSF AW RCN L  SW++NSVS SIAQSLI+ E+A ++W DL+ER+S   L+R+ E
Sbjct: 69  DQFDPSFRAWNRCNMLVHSWILNSVSESIAQSLIFMENAIDVWNDLRERFSQSDLVRISE 128

Query: 739 LLQQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCD---AADRVMKAYKNEDDVF 569
           L Q++ + KQ S +VTE+++ LK++W EL+   PIP CSC      + + +A  N   ++
Sbjct: 129 LQQEIYAMKQESRTVTEFYSDLKLLWEELEIYLPIPNCSCRIRCTCEAMRQARTNHTLLY 188

Query: 568 --RFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNL--CDKRKEIDKGNVNS 401
             RFL GLNE FS+V+  ILLMD LPP+++VFS+ LQHER  N    D+ K +     + 
Sbjct: 189 IVRFLTGLNEHFSMVKSQILLMDPLPPMNKVFSLVLQHERQGNFYPTDESKALLNAAKSK 248

Query: 400 S--AKAKQCTFCGRVGHRVETCYRKNGFPSR*KKGVSVSTINNTAI 269
               K + CT+CG+  H VE C++K G P   +     ST+NN A+
Sbjct: 249 GFPPKNRICTYCGKDNHIVENCFKKYGLPPYLRP--KPSTVNNAAL 292


>dbj|GAU41868.1| hypothetical protein TSUD_366150 [Trifolium subterraneum]
          Length = 792

 Score =  196 bits (498), Expect = 2e-53
 Identities = 101/229 (44%), Positives = 145/229 (63%), Gaps = 12/229 (5%)
 Frame = -2

Query: 919 ERFDPSFNAWKRCNKLARSWLINSVSPSIAQSLIYTESAAEIWKDLKERYSGRKLIRVCE 740
           + FDPSF +W RCN L   W++NSVS SIAQS+++ E+A ++W DLKE+++   L+R+ E
Sbjct: 72  DHFDPSFRSWNRCNMLIHYWIMNSVSESIAQSIVFMENALDVWNDLKEQFAQSDLVRIAE 131

Query: 739 LLQQLNSTKQNSNSVTEYFTKLKVIWGELDYIRPIPECSCDA-----ADRVMKAYKNEDD 575
           L Q++ + KQ+S +VTE+++ LK++  EL+   P+P CSC       A R  +A     +
Sbjct: 132 LQQEIYALKQDSRTVTEFYSDLKLLCEELEIYLPMPNCSCRVRCSCEAMRSARANHTLLN 191

Query: 574 VFRFLDGLNEEFSVVRIHILLMDTLPPLHRVFSMALQHERLNNL--CDKRKEI-----DK 416
           + RFL GLN+ FSVV+  +LLMD LPPL++VFSM LQHER  N    D+ K +      K
Sbjct: 192 IVRFLTGLNDHFSVVKSQVLLMDPLPPLNKVFSMVLQHERQGNFYPSDESKALLNAANSK 251

Query: 415 GNVNSSAKAKQCTFCGRVGHRVETCYRKNGFPSR*KKGVSVSTINNTAI 269
           G+ N  +  + CT CG+  H VE C++K G P   KK    ST +N AI
Sbjct: 252 GHFNPKSTVRICTLCGKDNHIVENCFKKYGIPPHMKKN---STAHNAAI 297


Top