BLASTX nr result

ID: Astragalus22_contig00019377 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00019377
         (1587 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX72331.1| flavonol sulfotransferase-like protein, partial [...   370   e-121
gb|PNX74687.1| flavonol sulfotransferase-like protein, partial [...   364   e-119
gb|PNY17766.1| flavonol sulfotransferase-like protein [Trifolium...   359   e-116
dbj|GAU37803.1| hypothetical protein TSUD_276210, partial [Trifo...   358   e-116
dbj|GAU28547.1| hypothetical protein TSUD_268860 [Trifolium subt...   374   e-115
gb|PNX54381.1| flavonol sulfotransferase-like protein, partial [...   352   e-115
gb|PNX89135.1| flavonol sulfotransferase-like protein, partial [...   352   e-114
gb|PNX93395.1| retrovirus-related Pol polyprotein from transposo...   374   e-114
gb|PNX98106.1| flavonol sulfotransferase-like protein, partial [...   352   e-114
dbj|GAU16526.1| hypothetical protein TSUD_167570 [Trifolium subt...   350   e-113
gb|PNY17451.1| retrovirus-related Pol polyprotein from transposo...   365   e-113
dbj|GAU28022.1| hypothetical protein TSUD_264800 [Trifolium subt...   374   e-112
dbj|GAU16205.1| hypothetical protein TSUD_298370 [Trifolium subt...   363   e-111
dbj|GAU45556.1| hypothetical protein TSUD_27570 [Trifolium subte...   354   e-111
gb|PNY16454.1| flavonol sulfotransferase-like protein [Trifolium...   368   e-110
dbj|GAU38555.1| hypothetical protein TSUD_320270 [Trifolium subt...   353   e-110
gb|PNX78530.1| hypothetical protein L195_g034508, partial [Trifo...   342   e-110
dbj|GAU20516.1| hypothetical protein TSUD_130740 [Trifolium subt...   352   e-110
dbj|GAU30132.1| hypothetical protein TSUD_360250 [Trifolium subt...   354   e-108
dbj|GAU23578.1| hypothetical protein TSUD_385660 [Trifolium subt...   350   e-107

>gb|PNX72331.1| flavonol sulfotransferase-like protein, partial [Trifolium pratense]
          Length = 380

 Score =  370 bits (951), Expect = e-121
 Identities = 185/351 (52%), Positives = 246/351 (70%), Gaps = 5/351 (1%)
 Frame = -3

Query: 1282 NGSSNKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMALRSKNKLHFVD 1103
            +GS NK YQND LNPYF+HP++ NP   +VT  L+G NYH+WSR+M MAL+SKNKLHF++
Sbjct: 16   SGSQNKGYQNDTLNPYFLHPNE-NPSLILVTPLLSGPNYHSWSRSMTMALKSKNKLHFIN 74

Query: 1102 GTL*RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWKDLKNRFYQGD 923
            G+L RP D+DRDSLAWD CNTM MSW+N+++  EISQSILWMDTA+++W DLK RFYQGD
Sbjct: 75   GSLPRPLDDDRDSLAWDMCNTMIMSWLNNAVESEISQSILWMDTASEIWHDLKERFYQGD 134

Query: 922  MFRISDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDCDCGTKCNTAIKFKE 743
            +FRISD+QEEIYT KQGD+S+S Y+T++KKL QE +NF   PT +C     C+   K K+
Sbjct: 135  VFRISDIQEEIYTLKQGDNSVSTYFTKMKKLWQELDNFRPIPTSNCV--NNCSAITKMKQ 192

Query: 742  YRERSKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANSSIEEPKVLAA 563
            YR+  +V+RFLKGLN+QY+AVR+QIMLMDPLP+IGKVYS+L+QQ R+    ++E ++LA 
Sbjct: 193  YRDSDQVIRFLKGLNDQYTAVRSQIMLMDPLPNIGKVYSLLVQQERKTVMPLDESELLAV 252

Query: 562  TXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDVCYKKHGYPT---HL 392
            +                               +C+HCG+T H ID C++K+GYP+   HL
Sbjct: 253  SANNYGGRGSSNRGRGTRGGRSNAGRGKGTK-LCTHCGQTNHIIDNCWEKYGYPSHLRHL 311

Query: 391  QQNSVNNYAVETGNE--DDDENSYHEDQQATTKQAPSFGSLGFTPEQHKAI 245
            Q+N+ NN  V T +E  D++  S H D+     +    G L FTP QHKA+
Sbjct: 312  QRNAANN-CVNTEHEIADEESQSVHYDEDNHDSET---GKLFFTPAQHKAL 358


>gb|PNX74687.1| flavonol sulfotransferase-like protein, partial [Trifolium pratense]
          Length = 393

 Score =  364 bits (935), Expect = e-119
 Identities = 197/394 (50%), Positives = 254/394 (64%), Gaps = 9/394 (2%)
 Frame = -3

Query: 1291 SSGNGSSNKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMALRSKNKLH 1112
            +S + S NK YQND LNPYF+HP++ NP  A+VT  L+G NYH+WSRAM MALRSK+K+H
Sbjct: 8    ASSSNSQNKGYQNDTLNPYFLHPNE-NPNLALVTPLLSGPNYHSWSRAMTMALRSKHKMH 66

Query: 1111 FVDGTL*RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWKDLKNRFY 932
            F++GTL RP D+DRDSLAWDRCNTM +SW+N+S++PEISQSILW+D+A+++W++LK RFY
Sbjct: 67   FINGTLPRPDDDDRDSLAWDRCNTMLISWLNNSVIPEISQSILWLDSASEIWQELKERFY 126

Query: 931  QGDMFRISDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDC--DCGTKCNTA 758
            QGD+FRISDLQ+EI + KQGD++IS YYT LKKL QE +NF   P   C  +C  +C   
Sbjct: 127  QGDVFRISDLQDEISSLKQGDNTISTYYTALKKLWQELDNFRPIPASHCVHNCTHECAAI 186

Query: 757  IKFKEYRERSKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANSSIEEP 578
             K K Y+E  +V+RFLKGLNE Y AVR+QIMLMDPLPSI KVYS+L+QQ RQ  + ++E 
Sbjct: 187  AKMKSYKESDQVIRFLKGLNEPYHAVRSQIMLMDPLPSISKVYSLLVQQERQIVTPVDES 246

Query: 577  KVLAAT---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDVCYKKHG 407
            K+L  +                                 K+CSHCG+T H +D C+KK+G
Sbjct: 247  KLLVVSGNNHYAGRGYSTRGRGNRGGRSSGGRGKGPIGNKLCSHCGQTNHVVDNCWKKYG 306

Query: 406  YPTHL----QQNSVNNYAVETGNEDDDENSYHEDQQATTKQAPSFGSLGFTPEQHKAIQT 239
            YP H+    Q  +VNN A    ++DDDEN     +        S  SL  T  QHKA+  
Sbjct: 307  YPPHMQHLQQDGAVNNIA--NXDDDDDENPTVNHEGNNNDHETSKFSL--TAAQHKALTA 362

Query: 238  H*LLITFTIPQSNQVLSVPYQIPERSNPGS*IQE 137
              LL  FT   S+ +  V        N G  IQE
Sbjct: 363  --LLQGFTSMPSHSINHV------TRNTGEFIQE 388


>gb|PNY17766.1| flavonol sulfotransferase-like protein [Trifolium pratense]
          Length = 409

 Score =  359 bits (921), Expect = e-116
 Identities = 181/367 (49%), Positives = 245/367 (66%), Gaps = 11/367 (2%)
 Frame = -3

Query: 1312 MAAEIPKSSGNGS---------SNKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHT 1160
            M+ E   SS NG+          NK YQ D+LNPYFMHP++ NPG  + T  L+G NYH+
Sbjct: 1    MSTESSASSVNGARAHANAQNNQNKGYQTDILNPYFMHPNE-NPGNILATPLLSGPNYHS 59

Query: 1159 WSRAMMMALRSKNKLHFVDGTL*RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILW 980
            WSR++ +ALRSK+KLHF++G L  P D+DRDS+AWDRCN M +SWI++S+ PEISQSILW
Sbjct: 60   WSRSVTVALRSKHKLHFINGALPHPADDDRDSIAWDRCNAMIISWISNSVEPEISQSILW 119

Query: 979  MDTAADVWKDLKNRFYQGDMFRISDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLA 800
            MDTAA++WK+LK+RFYQGD+FRISD+QEEIYT +QGD ++S YYT++KKL QE +NF   
Sbjct: 120  MDTAAEIWKELKDRFYQGDVFRISDIQEEIYTLRQGDCTVSAYYTKMKKLWQELDNF--C 177

Query: 799  PTCDCDCGTKCNTAIKFKEYRERSKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSML 620
            P     C   C    K + YR+  +V+RFLKGLN+Q  AVR+QI+LM+PLP+IGKVYS+L
Sbjct: 178  PIPHTSCNDDCTMLDKMRTYRDSDQVIRFLKGLNDQCVAVRSQILLMEPLPNIGKVYSLL 237

Query: 619  MQQARQANSSIEEPKVLAAT--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGR 446
            +QQ RQ+    +E K+LAA+                                K+CS+CG 
Sbjct: 238  VQQERQSLLVFDESKILAASTNQSSGRGSYSQHGRGERDGRAYGGRGKPKGNKVCSYCGM 297

Query: 445  TAHTIDVCYKKHGYPTHLQQNSVNNYAVETGNEDDDENSYHEDQQATTKQAPSFGSLGFT 266
            T H ID C+KKHGYP H+QQ    N     G+E+D ++  +E++ + + +     +L FT
Sbjct: 298  TNHIIDQCFKKHGYPPHMQQGGAVNQCHNDGDEEDTKSMAYEEETSESDK----DNLYFT 353

Query: 265  PEQHKAI 245
            P+QHKA+
Sbjct: 354  PDQHKAL 360


>dbj|GAU37803.1| hypothetical protein TSUD_276210, partial [Trifolium subterraneum]
          Length = 429

 Score =  358 bits (919), Expect = e-116
 Identities = 173/321 (53%), Positives = 227/321 (70%), Gaps = 3/321 (0%)
 Frame = -3

Query: 1270 NKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMALRSKNKLHFVDGTL* 1091
            NK YQN +LNPYFMHP++ NPG  + T  L+G NYH+WSRA+ +ALRSK+K+HF++G+L 
Sbjct: 107  NKGYQNGILNPYFMHPNE-NPGNILATPLLSGPNYHSWSRAVTVALRSKHKIHFINGSLP 165

Query: 1090 RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWKDLKNRFYQGDMFRI 911
            RP D DRDS+AWDRCNTM MSW+++S+ PEISQSILWMDTA ++WK+LK RFYQGD+FRI
Sbjct: 166  RPPDGDRDSIAWDRCNTMVMSWLSNSVEPEISQSILWMDTATEIWKELKERFYQGDVFRI 225

Query: 910  SDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDCDCGTKCNTAIKFKEYRER 731
            SD+QEEIYT KQGD+SIS+YYT++KKL QE +NF   P  +  C   C   +K +EYR+ 
Sbjct: 226  SDIQEEIYTLKQGDNSISSYYTKMKKLWQELDNFR--PIPENSCHDACQAVVKMREYRDS 283

Query: 730  SKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANSSIEEPKVLAAT--X 557
             +V+RFLKGLN+ YSAVR+QIMLM+PLP+IGKVYS+L+QQ RQ     +EPK+LAA+   
Sbjct: 284  DQVIRFLKGLNDNYSAVRSQIMLMEPLPNIGKVYSLLVQQERQFVLFSDEPKILAASGYN 343

Query: 556  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDVCYKKHGYPTHLQQNSV 377
                                         K+C+ CG T H +D C+KKHG+P H+Q   +
Sbjct: 344  GNGRGSSSSRGRGDRGGKPSYGRGRCKGNKVCTFCGMTNHVVDECFKKHGFPPHMQNKGI 403

Query: 376  NNYAVETGNEDDDEN-SYHED 317
             N     G E+D ++ +Y ED
Sbjct: 404  VNNCHSNGAEEDSKSIAYEED 424


>dbj|GAU28547.1| hypothetical protein TSUD_268860 [Trifolium subterraneum]
          Length = 1059

 Score =  374 bits (961), Expect = e-115
 Identities = 192/361 (53%), Positives = 251/361 (69%), Gaps = 5/361 (1%)
 Frame = -3

Query: 1312 MAAEIPKSSGNGSS----NKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAM 1145
            MA +    S NG++    N+ YQND LNPYF+H S++NPG  +VT  L+  NYH+WSRAM
Sbjct: 1    MAEDSVAGSINGTTATTQNRGYQNDALNPYFLH-SNENPGNVLVTPLLSSSNYHSWSRAM 59

Query: 1144 MMALRSKNKLHFVDGTL*RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAA 965
             +ALRSKNKLHF++G+L RP DEDRDS+AWDRCNTM MSW+++++ PEISQSILW+DTA+
Sbjct: 60   TVALRSKNKLHFINGSLPRPLDEDRDSIAWDRCNTMVMSWLSNAVEPEISQSILWIDTAS 119

Query: 964  DVWKDLKNRFYQGDMFRISDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDC 785
            ++WK+LK+RFYQGD+FRISD+QEEIYT KQGD +IS YYT++KKL QE +NF   P  + 
Sbjct: 120  EIWKELKDRFYQGDVFRISDIQEEIYTLKQGDSTISTYYTKMKKLWQELDNFR--PIPNS 177

Query: 784  DCGTKCNTAIKFKEYRERSKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQAR 605
            +C   C    K +EYR+  +V+RFLKGLNEQYS VR+QIMLM+PLP+IGKVYS+L QQ R
Sbjct: 178  NCTANCQAITKMREYRDGDQVIRFLKGLNEQYSHVRSQIMLMEPLPNIGKVYSLLAQQER 237

Query: 604  QANSSIEEPKVLAATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDV 425
            Q    ++E K+LAA+                              K C++CG T H ++ 
Sbjct: 238  QQVIPLDESKILAAS--ASQQANRAQFPRGRGGRTSGGRGRGKNSKYCTYCGMTNHIVED 295

Query: 424  CYKKHGYPTHLQQN-SVNNYAVETGNEDDDENSYHEDQQATTKQAPSFGSLGFTPEQHKA 248
            CYKKHGYP HLQQN +VNN A    NEDD ++  ++++          G L FTP+QHKA
Sbjct: 296  CYKKHGYPPHLQQNGTVNNCA--NFNEDDSKSMAYDEEAGD----HGSGKLLFTPDQHKA 349

Query: 247  I 245
            +
Sbjct: 350  L 350


>gb|PNX54381.1| flavonol sulfotransferase-like protein, partial [Trifolium pratense]
          Length = 330

 Score =  352 bits (904), Expect = e-115
 Identities = 173/318 (54%), Positives = 224/318 (70%), Gaps = 9/318 (2%)
 Frame = -3

Query: 1291 SSGNGSSNKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMALRSKNKLH 1112
            +S + S NK YQND LNPYF+HP++ NP  A+VT  L+G NYH+WSRAM MALRSK+K+H
Sbjct: 13   ASSSNSQNKGYQNDTLNPYFLHPNE-NPNLALVTPLLSGPNYHSWSRAMTMALRSKHKMH 71

Query: 1111 FVDGTL*RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWKDLKNRFY 932
            F++GTL RP D+DRDSLAWDRCNTM +SW+N+S++PEISQSILW+D+A+++W++LK RFY
Sbjct: 72   FINGTLPRPDDDDRDSLAWDRCNTMLISWLNNSVIPEISQSILWLDSASEIWQELKERFY 131

Query: 931  QGDMFRISDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDC--DCGTKCNTA 758
            QGD+FRISDLQ+EI + KQGD++IS YYT LKKL QE +NF   P   C  +C  +C   
Sbjct: 132  QGDVFRISDLQDEISSLKQGDNTISTYYTALKKLWQELDNFRPIPASHCVHNCTHECAAI 191

Query: 757  IKFKEYRERSKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANSSIEEP 578
             K K Y+E  +V+RFLKGLNE Y AVR+QIMLMDPLPSI KVYS+L+QQ RQ  + ++E 
Sbjct: 192  AKMKSYKESDQVIRFLKGLNEPYHAVRSQIMLMDPLPSISKVYSLLVQQERQIVTPVDES 251

Query: 577  KVLAAT---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDVCYKKHG 407
            K+L  +                                 K+CSHCG+T H +D C+KK+G
Sbjct: 252  KLLVVSGNNHYAGRGYSTRGRGNRGGRSSGGRGKGPIGNKLCSHCGQTNHVVDNCWKKYG 311

Query: 406  YPTHL----QQNSVNNYA 365
            YP H+    Q  +VNN A
Sbjct: 312  YPPHMQHLQQDGAVNNIA 329


>gb|PNX89135.1| flavonol sulfotransferase-like protein, partial [Trifolium pratense]
          Length = 395

 Score =  352 bits (904), Expect = e-114
 Identities = 173/318 (54%), Positives = 224/318 (70%), Gaps = 9/318 (2%)
 Frame = -3

Query: 1291 SSGNGSSNKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMALRSKNKLH 1112
            +S + S NK YQND LNPYF+HP++ NP  A+VT  L+G NYH+WSRAM MALRSK+K+H
Sbjct: 78   ASSSNSQNKGYQNDTLNPYFLHPNE-NPNLALVTPLLSGPNYHSWSRAMTMALRSKHKMH 136

Query: 1111 FVDGTL*RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWKDLKNRFY 932
            F++GTL RP D+DRDSLAWDRCNTM +SW+N+S++PEISQSILW+D+A+++W++LK RFY
Sbjct: 137  FINGTLPRPDDDDRDSLAWDRCNTMLISWLNNSVIPEISQSILWLDSASEIWQELKERFY 196

Query: 931  QGDMFRISDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDC--DCGTKCNTA 758
            QGD+FRISDLQ+EI + KQGD++IS YYT LKKL QE +NF   P   C  +C  +C   
Sbjct: 197  QGDVFRISDLQDEISSLKQGDNTISTYYTALKKLWQELDNFRPIPASHCVHNCTHECAAI 256

Query: 757  IKFKEYRERSKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANSSIEEP 578
             K K Y+E  +V+RFLKGLNE Y AVR+QIMLMDPLPSI KVYS+L+QQ RQ  + ++E 
Sbjct: 257  AKMKSYKESDQVIRFLKGLNEPYHAVRSQIMLMDPLPSISKVYSLLVQQERQIVTPVDES 316

Query: 577  KVLAAT---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDVCYKKHG 407
            K+L  +                                 K+CSHCG+T H +D C+KK+G
Sbjct: 317  KLLVVSGNNHYAGRGYSTRGRGNRGGRSSGGRGKGPIGNKLCSHCGQTNHVVDNCWKKYG 376

Query: 406  YPTHL----QQNSVNNYA 365
            YP H+    Q  +VNN A
Sbjct: 377  YPPHMQHLQQDGAVNNIA 394


>gb|PNX93395.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1198

 Score =  374 bits (961), Expect = e-114
 Identities = 186/351 (52%), Positives = 241/351 (68%), Gaps = 9/351 (2%)
 Frame = -3

Query: 1270 NKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMALRSKNKLHFVDGTL* 1091
            NK YQND LNPYFMHP++ NPG  +VT  L+G NYH+WSRAM +ALRSK+KLHF++G L 
Sbjct: 23   NKGYQNDTLNPYFMHPNE-NPGNVLVTPLLSGPNYHSWSRAMTVALRSKHKLHFINGALP 81

Query: 1090 RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWKDLKNRFYQGDMFRI 911
            RP D+DRDS+AWDRCNTM MSWI++S+ PEISQSILWMDTA+++WK+LK RFYQGD+FRI
Sbjct: 82   RPHDDDRDSIAWDRCNTMIMSWISNSVDPEISQSILWMDTASEIWKELKERFYQGDVFRI 141

Query: 910  SDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDCDCGTKCNTAIKFKEYRER 731
            SD+QEEIYT KQGD SIS YYT++KKL QE +NF   P  +  C   C   +K KEYR+ 
Sbjct: 142  SDIQEEIYTLKQGDSSISAYYTKMKKLWQELDNFR--PIPELFCLENCQAIVKMKEYRDS 199

Query: 730  SKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANSSIEEPKVLAA---- 563
             +V+RFLKGLN+QYSAVR+QIMLM+PLP+IGKVYS+L+QQ RQ+    +E K+LAA    
Sbjct: 200  DQVIRFLKGLNDQYSAVRSQIMLMEPLPNIGKVYSLLVQQERQSLLVFDESKLLAANGYS 259

Query: 562  -----TXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDVCYKKHGYPT 398
                                                K+C+HCG T H ID C++KHG+P 
Sbjct: 260  TQGHSNQGYGRGSNSSRGKGNGGKKPSYGRGKGKGNKLCTHCGMTNHIIDDCFQKHGFPP 319

Query: 397  HLQQNSVNNYAVETGNEDDDENSYHEDQQATTKQAPSFGSLGFTPEQHKAI 245
            H+Q+    N+   +G+E+D ++  +++       +     L FTPEQHKA+
Sbjct: 320  HMQRGGAVNHCNNSGSEEDSKSVAYDEDNGDMDTS----KLYFTPEQHKAL 366


>gb|PNX98106.1| flavonol sulfotransferase-like protein, partial [Trifolium pratense]
          Length = 398

 Score =  352 bits (904), Expect = e-114
 Identities = 173/318 (54%), Positives = 224/318 (70%), Gaps = 9/318 (2%)
 Frame = -3

Query: 1291 SSGNGSSNKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMALRSKNKLH 1112
            +S + S NK YQND LNPYF+HP++ NP  A+VT  L+G NYH+WSRAM MALRSK+K+H
Sbjct: 81   ASSSNSQNKGYQNDTLNPYFLHPNE-NPNLALVTPLLSGPNYHSWSRAMTMALRSKHKMH 139

Query: 1111 FVDGTL*RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWKDLKNRFY 932
            F++GTL RP D+DRDSLAWDRCNTM +SW+N+S++PEISQSILW+D+A+++W++LK RFY
Sbjct: 140  FINGTLPRPDDDDRDSLAWDRCNTMLISWLNNSVIPEISQSILWLDSASEIWQELKERFY 199

Query: 931  QGDMFRISDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDC--DCGTKCNTA 758
            QGD+FRISDLQ+EI + KQGD++IS YYT LKKL QE +NF   P   C  +C  +C   
Sbjct: 200  QGDVFRISDLQDEISSLKQGDNTISTYYTALKKLWQELDNFRPIPASHCVHNCTHECAAI 259

Query: 757  IKFKEYRERSKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANSSIEEP 578
             K K Y+E  +V+RFLKGLNE Y AVR+QIMLMDPLPSI KVYS+L+QQ RQ  + ++E 
Sbjct: 260  AKMKSYKESDQVIRFLKGLNEPYHAVRSQIMLMDPLPSISKVYSLLVQQERQIVTPVDES 319

Query: 577  KVLAAT---XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDVCYKKHG 407
            K+L  +                                 K+CSHCG+T H +D C+KK+G
Sbjct: 320  KLLVVSGNNHYAGRGYSTRGRGNRGGRSSGGRGKGPIGNKLCSHCGQTNHVVDNCWKKYG 379

Query: 406  YPTHL----QQNSVNNYA 365
            YP H+    Q  +VNN A
Sbjct: 380  YPPHMQHLQQDGAVNNIA 397


>dbj|GAU16526.1| hypothetical protein TSUD_167570 [Trifolium subterraneum]
          Length = 416

 Score =  350 bits (899), Expect = e-113
 Identities = 180/363 (49%), Positives = 237/363 (65%), Gaps = 21/363 (5%)
 Frame = -3

Query: 1270 NKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMALRSKNKLHFVDGTL* 1091
            NK YQND LNPYF+HP++ NPG  +VT +L+G NYH+WSRAM+MAL+SKNKL FV+GTL 
Sbjct: 27   NKGYQNDTLNPYFLHPNE-NPGLVLVTPSLSGSNYHSWSRAMVMALKSKNKLRFVNGTLP 85

Query: 1090 RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWKDLKNRFYQGDMFRI 911
            RP D+D DSLAWDRCNTM MSWI++++  +ISQS+LWMDTA+++W+DLK RFYQGD+FRI
Sbjct: 86   RPDDDDHDSLAWDRCNTMIMSWISNAVDADISQSVLWMDTASEIWQDLKERFYQGDVFRI 145

Query: 910  SDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDCDCGTKCNTAIKFKEYRER 731
            S++QEEIYT KQGD SI  YYT++KKL QE +NF   P    +C   C    K KEY++ 
Sbjct: 146  SNIQEEIYTLKQGDSSIFAYYTKMKKLWQELDNFR--PIPQSNCVYNCTAIAKMKEYKDS 203

Query: 730  SKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANSSIEEPKVLA----- 566
             +V+RFLKGLNEQY  VR+QIMLMDPLP+I KVYS+L+QQ RQA   ++E K+LA     
Sbjct: 204  DQVIRFLKGLNEQYYVVRSQIMLMDPLPTISKVYSLLVQQERQAIIPLDESKLLAVNGYN 263

Query: 565  ---------ATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDVCYKK 413
                      +                              + C+HCG+T H ID C+KK
Sbjct: 264  SYAGRGDGYGSHAGRGQSYRGRGSKGGGRHNAGRGNPGKGNRYCTHCGQTNHIIDDCWKK 323

Query: 412  HGYPTHLQ-----QNSVNNYAVETGN--EDDDENSYHEDQQATTKQAPSFGSLGFTPEQH 254
            +GYP H+Q       +VN+     GN  +DD+  + + D++    +    G +  TP QH
Sbjct: 324  YGYPPHMQHLQNKHGAVNSCTHANGNNGDDDETQTVNCDEENVDSET---GKMYLTPAQH 380

Query: 253  KAI 245
            KA+
Sbjct: 381  KAL 383


>gb|PNY17451.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1425

 Score =  365 bits (936), Expect(2) = e-113
 Identities = 184/357 (51%), Positives = 237/357 (66%), Gaps = 8/357 (2%)
 Frame = -3

Query: 1291 SSGNGSSNKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMALRSKNKLH 1112
            +S + S NK YQND LNPYF+HP++ NP  A+VT  L+G NYH+WSRAM MALRSKNK+H
Sbjct: 13   ASSSNSQNKGYQNDTLNPYFLHPNE-NPNLALVTPLLSGPNYHSWSRAMTMALRSKNKMH 71

Query: 1111 FVDGTL*RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWKDLKNRFY 932
            F++GTL RP D+DRDSLAWDRCNTM +SW+N+S+ PEISQSILW+D+A+++W++LK RFY
Sbjct: 72   FINGTLPRPDDDDRDSLAWDRCNTMLLSWLNNSVSPEISQSILWLDSASEIWQELKERFY 131

Query: 931  QGDMFRISDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDC--DCGTKCNTA 758
            QGD+FRISDLQ+EI + KQGD +IS YYT LKKL QE +NF   P   C  +C   C   
Sbjct: 132  QGDVFRISDLQDEISSLKQGDSTISTYYTSLKKLWQELDNFRPIPDSHCVHNCVHGCAAI 191

Query: 757  IKFKEYRERSKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANSSIEEP 578
             K K Y+E  +V+RFLKGLNEQY  VR+QIMLMDPLP+IGKVYS+L+QQ RQ  + ++E 
Sbjct: 192  AKMKSYKESDQVIRFLKGLNEQYHVVRSQIMLMDPLPTIGKVYSLLVQQERQLATPVDES 251

Query: 577  KVLAAT--XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDVCYKKHGY 404
            K+LA +                                K+CSHCG+T H +D C+ K+GY
Sbjct: 252  KLLAVSGNNHYAGRGHSTRGRGTRGGRSYGNGGRGKGNKLCSHCGQTNHVVDNCWIKYGY 311

Query: 403  PTHLQ----QNSVNNYAVETGNEDDDENSYHEDQQATTKQAPSFGSLGFTPEQHKAI 245
            P H+Q      ++NN A    ++DDDE      +          G   FT  QHKA+
Sbjct: 312  PPHMQHLQHDRAINNCANVDDDDDDDETPTANCEDGNNDH--KTGKFSFTAAQHKAL 366



 Score = 73.6 bits (179), Expect(2) = e-113
 Identities = 31/73 (42%), Positives = 48/73 (65%)
 Frame = -2

Query: 221 IHYTTVKSGIICALPNSRKIQSWIIDTGATIHVCYSKRLFKCIRKIRPVTVTLPNGTQIV 42
           I++ T  +GI+C +P S     +I+DTGAT H+C+  + F+C+++I P+ + LPNGT + 
Sbjct: 381 INHVTTNTGILCTIPLSNNSDQFILDTGATDHICFDLKYFQCLKQIPPINLKLPNGTLVN 440

Query: 41  CDLAGTIYLDDYL 3
             LAGTI  D  L
Sbjct: 441 TCLAGTIMFDHQL 453


>dbj|GAU28022.1| hypothetical protein TSUD_264800 [Trifolium subterraneum]
          Length = 1614

 Score =  374 bits (959), Expect = e-112
 Identities = 189/372 (50%), Positives = 252/372 (67%), Gaps = 16/372 (4%)
 Frame = -3

Query: 1312 MAAEIPKSSGNGS------SNKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSR 1151
            M AE   +S NG+      SNK YQ D+L+PYFMHPS+ NPG  + T  L+G NYH+WSR
Sbjct: 1    MEAEGSATSANGAIPPPIPSNKNYQTDILSPYFMHPSE-NPGNVLATPLLSGPNYHSWSR 59

Query: 1150 AMMMALRSKNKLHFVDGTL*RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDT 971
            A+ ++LRSK+KLHF+ G L RP DEDRDS+AWDRCNTM +SWI++S+ PEISQSILWMDT
Sbjct: 60   AVTVSLRSKHKLHFITGALPRPPDEDRDSIAWDRCNTMIISWISNSVEPEISQSILWMDT 119

Query: 970  AADVWKDLKNRFYQGDMFRISDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTC 791
            AA++WK+LK+RFYQGD+FRISDLQEEIYT +QGD +IS YYTR+KKL QE +NF   P+ 
Sbjct: 120  AAEIWKELKDRFYQGDVFRISDLQEEIYTLRQGDSTISTYYTRMKKLWQELDNFRPIPS- 178

Query: 790  DCDCGTKCNTAIKFKEYRERSKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQ 611
               C   C    K + YR+  +V+RFLKGLN+QY+AVR+QIMLM+PLP+IGKVYS+L+QQ
Sbjct: 179  -SSCSDNCQALEKMRNYRDSDQVIRFLKGLNDQYAAVRSQIMLMEPLPNIGKVYSLLVQQ 237

Query: 610  ARQANSSIEEPKVLAAT----------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKIC 461
             RQ+   +E+ K+LAA+                                        K+C
Sbjct: 238  ERQSLLVLEDSKLLAASNSNSNFAPSFSRGSSSQSSHRGRGARSNGGRGRGKPSTSNKVC 297

Query: 460  SHCGRTAHTIDVCYKKHGYPTHLQQNSVNNYAVETGNEDDDENSYHEDQQATTKQAPSFG 281
            ++CG T H +D C+KK+G+P H+QQ    N     G+E+D+++  +ED  + + +    G
Sbjct: 298  TYCGMTNHIVDQCFKKYGFPPHMQQGGTVNNCHTNGDEEDNKSITYEDDNSESNK----G 353

Query: 280  SLGFTPEQHKAI 245
            +L FTPEQHKA+
Sbjct: 354  NLYFTPEQHKAL 365


>dbj|GAU16205.1| hypothetical protein TSUD_298370 [Trifolium subterraneum]
          Length = 1029

 Score =  363 bits (932), Expect = e-111
 Identities = 181/345 (52%), Positives = 237/345 (68%), Gaps = 3/345 (0%)
 Frame = -3

Query: 1270 NKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMALRSKNKLHFVDGTL* 1091
            NK YQND+LNPYFMHP++ NPG  + T  L+G NYH+WSRA+ +ALRSK+KLHF++ +L 
Sbjct: 24   NKGYQNDILNPYFMHPNE-NPGNVLATPLLSGPNYHSWSRAVTVALRSKHKLHFINDSLP 82

Query: 1090 RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWKDLKNRFYQGDMFRI 911
            RP DEDRDS+AWDRCNTM MSW+++S+ PEISQSILWMDT +++WK+LK+RFYQGD+FRI
Sbjct: 83   RPPDEDRDSIAWDRCNTMVMSWLSNSVDPEISQSILWMDTTSEIWKELKDRFYQGDVFRI 142

Query: 910  SDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDCDCGTKCNTAIKFKEYRER 731
            SD+QEEIYT KQGD+SIS YYT++KKL QE +NF   P  +  C   C T +K +EY++ 
Sbjct: 143  SDIQEEIYTLKQGDNSISTYYTKMKKLWQELDNFR--PIPENSCHDNCQTIVKMREYKDS 200

Query: 730  SKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANSSIEEPKVLAAT--X 557
             +V+ FLKGLN+ YSAVR+QIMLM+PLP+IGKVYS+L+QQ RQ+   ++E K+  A+   
Sbjct: 201  DQVICFLKGLNDNYSAVRSQIMLMEPLPNIGKVYSLLVQQERQSLLLVDESKISTASGYP 260

Query: 556  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDVCYKKHGYPTHLQQ-NS 380
                                         K+C++CG T H +D C+KK G+P H+QQ  +
Sbjct: 261  SYGRGSSSSRGRGDRGGNPSNGRGRGKGNKVCTYCGMTNHIVDECFKKLGFPPHMQQRGT 320

Query: 379  VNNYAVETGNEDDDENSYHEDQQATTKQAPSFGSLGFTPEQHKAI 245
            VNN    T  ED     Y ED           G L FTP+QHKA+
Sbjct: 321  VNNCHSNTIEEDSKSIVYEEDNGDM-----DTGKLSFTPDQHKAL 360


>dbj|GAU45556.1| hypothetical protein TSUD_27570 [Trifolium subterraneum]
          Length = 677

 Score =  354 bits (908), Expect = e-111
 Identities = 186/372 (50%), Positives = 243/372 (65%), Gaps = 16/372 (4%)
 Frame = -3

Query: 1312 MAAEIPKSSGNGS---------SNKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHT 1160
            MA E   SS +GS         +NK YQND LNPYFMHP++ NP   +VT  LNG NYH 
Sbjct: 1    MAGESSASSIHGSVHGGFFTNNANKGYQNDTLNPYFMHPNE-NPALVLVTPLLNGTNYHF 59

Query: 1159 WSRAMMMALRSKNKLHFVDGTL*RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILW 980
            WS++M++A+RSKNK H ++G+L RP D+DRD +AWDRCNTM MSW+ +S+ PEI+QS++W
Sbjct: 60   WSQSMIVAIRSKNKRHCINGSLPRPLDDDRDPMAWDRCNTMVMSWLTNSVDPEIAQSVIW 119

Query: 979  MDTAADVWKDLKNRFYQGDMFRISDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLA 800
            MD AAD+WK+LK+RFYQGD+F ISD+QEEI   KQGD +IS+YYT+LK+L QE +NF   
Sbjct: 120  MDVAADIWKELKDRFYQGDVFSISDIQEEICNLKQGDSTISSYYTKLKQLWQELDNFRPI 179

Query: 799  PTCDCDCGTKCNTAIKFKEYRERSKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSML 620
            P+  CDC   C    K + Y + ++V+RFLKGLNEQYS V++QIMLMDPLP I KVYS+L
Sbjct: 180  PS--CDCVVTCQAISKIRSYIDGNQVIRFLKGLNEQYSHVKSQIMLMDPLPPISKVYSLL 237

Query: 619  MQQARQANSSIEEPKVLAATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTA 440
            +QQ RQ    ++E K+LA +                              K+C++CG T 
Sbjct: 238  VQQERQTVIPMDESKLLAMS---NNQNYAMSHNQNYARGSSSNRGRGKNSKVCTYCGMTN 294

Query: 439  HTIDVCYKKHGYPTHLQQNS-------VNNYAVETGNEDDDENSYHEDQQATTKQAPSFG 281
            H ID C+KK+GYP H  QN        VNN  V  GN+ D ++  H  QQ       +FG
Sbjct: 295  HVIDNCFKKYGYPPHWHQNGNDNGNAVVNN--VVNGNDKDIQSEAHGYQQ----NEQNFG 348

Query: 280  SLGFTPEQHKAI 245
            SL FTP+QH+A+
Sbjct: 349  SLMFTPDQHQAL 360


>gb|PNY16454.1| flavonol sulfotransferase-like protein [Trifolium pratense]
          Length = 1475

 Score =  368 bits (945), Expect = e-110
 Identities = 187/363 (51%), Positives = 244/363 (67%), Gaps = 14/363 (3%)
 Frame = -3

Query: 1291 SSGNGSSNKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMALRSKNKLH 1112
            +S +   NK YQ D L+PYFMH S++NPG  +VT  L+G NYH+WSRAM +ALRSK+KLH
Sbjct: 16   NSQHTHQNKGYQFDTLSPYFMH-SNENPGNVLVTPLLSGSNYHSWSRAMTVALRSKHKLH 74

Query: 1111 FVDGTL*RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWKDLKNRFY 932
            F++G+L RP D+DRDS+AWDRCNTM MSWI++S+ PEISQSILWMDTA+++WK+LK+RFY
Sbjct: 75   FINGSLPRPDDDDRDSIAWDRCNTMIMSWISNSVDPEISQSILWMDTASEIWKELKDRFY 134

Query: 931  QGDMFRISDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDCDCGTKCNTAIK 752
            QGD+FRISD+QEEIYT KQGD SIS YYT++KKL QE +NF   P  +  C   C    K
Sbjct: 135  QGDVFRISDIQEEIYTLKQGDSSISTYYTKMKKLWQELDNFR--PIPETLCVDNCPAIAK 192

Query: 751  FKEYRERSKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANSSIEEPKV 572
             KEYR+  +V+RFLKGLN+QY AVR+QIMLM+PLP+IGKVYS+L+QQ RQA   I+E K+
Sbjct: 193  MKEYRDSDQVIRFLKGLNDQYGAVRSQIMLMEPLPNIGKVYSLLVQQERQALLVIDESKI 252

Query: 571  LAAT------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTID 428
            LAA                                           K+C+HCG T H +D
Sbjct: 253  LAANGYQSQGNNNSSYGKGSNSSNFAKGKNSGSKKPAYGRGKGKGNKLCTHCGMTNHIVD 312

Query: 427  VCYKKHGYPTHLQQNSVNNYAVETGNEDDDEN--SYHEDQQATTKQAPSFGSLGFTPEQH 254
             C++KHG+P H+Q+    N     G+E+D ++  +Y ED +         G + FTP+QH
Sbjct: 313  DCFQKHGFPPHMQRRGAVNNCHTNGDEEDSKSMAAYEEDNEDLNS-----GKMYFTPDQH 367

Query: 253  KAI 245
            KA+
Sbjct: 368  KAL 370


>dbj|GAU38555.1| hypothetical protein TSUD_320270 [Trifolium subterraneum]
          Length = 760

 Score =  353 bits (907), Expect = e-110
 Identities = 183/360 (50%), Positives = 234/360 (65%), Gaps = 18/360 (5%)
 Frame = -3

Query: 1270 NKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMALRSKNKLHFVDGTL* 1091
            NK YQND LNPYF+HP++ NPG  +VT  L+G NYH+WSRAM MAL+SKNKL FV+G+L 
Sbjct: 27   NKGYQNDTLNPYFLHPNE-NPGLILVTPPLSGTNYHSWSRAMTMALKSKNKLRFVNGSLP 85

Query: 1090 RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWKDLKNRFYQGDMFRI 911
            RP DED DSLAWDRCNTM MSWI++++  +ISQS+LWMDTA+++W+DLK RFYQGD+FRI
Sbjct: 86   RPDDEDHDSLAWDRCNTMIMSWISNAVDADISQSVLWMDTASEIWQDLKERFYQGDVFRI 145

Query: 910  SDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDCDCGTKCNTAIKFKEYRER 731
            SD+QEEIYT KQGD+SIS YYT++KKL QE +NF   P    +C   C    K KEY++ 
Sbjct: 146  SDIQEEIYTLKQGDNSISAYYTKMKKLWQELDNFR--PIPQSNCVYNCAAIAKMKEYKDS 203

Query: 730  SKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANSSIEEPKVLAA---- 563
             +V+RFLKGLNEQY AVR+QIMLMDPLP+I KVYS+L+QQ RQA   ++E K+LAA    
Sbjct: 204  DQVIRFLKGLNEQYYAVRSQIMLMDPLPTISKVYSLLVQQERQAIVPVDESKLLAANGYG 263

Query: 562  ----------TXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDVCYKK 413
                      +                              + C+HCG+T H ID C+KK
Sbjct: 264  NSTGRGDSYNSYAGRSQSNRGRGYRGGGRQSTGRGNHGKGNRYCTHCGQTNHVIDDCWKK 323

Query: 412  HGYPTHLQQNSVNNYAVETGNEDDDENSYHEDQQATTKQAPSF----GSLGFTPEQHKAI 245
            +GYP H+Q     + AV +    +  N   ED Q  T +  +     G +  TP Q KA+
Sbjct: 324  YGYPPHMQHLQNKHGAVNSCTHANANNGDDEDTQTVTYEEENVDSEAGKMLLTPAQQKAL 383


>gb|PNX78530.1| hypothetical protein L195_g034508, partial [Trifolium pratense]
          Length = 404

 Score =  342 bits (877), Expect = e-110
 Identities = 179/350 (51%), Positives = 228/350 (65%), Gaps = 6/350 (1%)
 Frame = -3

Query: 1276 SSNKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMALRSKNKLHFVDGT 1097
            S NK YQ D LNPYF+HP++ NPG  +VT  L+G NYH+WSR+M MAL+SKNKL FV+GT
Sbjct: 16   SHNKGYQTDTLNPYFLHPNE-NPGLVLVTPLLSGSNYHSWSRSMTMALKSKNKLRFVNGT 74

Query: 1096 L*RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWKDLKNRFYQGDMF 917
            L RP DED DS+AWDRCNTM MSW+ +S+  +I+QS++WMDTA+++W DLK+RFYQGD+F
Sbjct: 75   LPRPDDEDHDSIAWDRCNTMIMSWLTNSVDADIAQSVIWMDTASEIWLDLKDRFYQGDIF 134

Query: 916  RISDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDCDCGTKCNTAIKFKEYR 737
            RISD+QEEIYT KQGD SIS Y+T++KKL QE +NF   P    +C   C    K KEY+
Sbjct: 135  RISDIQEEIYTLKQGDSSISAYFTKMKKLWQELDNFRPVPA--SNCVNDCIAMAKLKEYK 192

Query: 736  ERSKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANSSIEEPKVLAAT- 560
            +  +V+RFLKGLNEQY AVR+QIMLMDPLP I KVYS+L+QQ RQ    ++E K+LA   
Sbjct: 193  DCDQVIRFLKGLNEQYHAVRSQIMLMDPLPKIAKVYSLLVQQERQIVIPLDESKLLAING 252

Query: 559  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDVCYKKHGYPTHLQ--Q 386
                                           +CSHCG+T H I  C++K+G+P H+Q  Q
Sbjct: 253  SNSYAGRGQSSRGRGYRGGGRSNGGRGKGKMLCSHCGQTNHVIYNCWRKYGFPPHMQHLQ 312

Query: 385  NSVNNYAVETG---NEDDDENSYHEDQQATTKQAPSFGSLGFTPEQHKAI 245
               NN AV      N DDDE+     ++ T       G L  T  Q KA+
Sbjct: 313  QKENNGAVNNCTNINGDDDESHTVTCEEETVDS--EAGKLLLTSAQQKAL 360


>dbj|GAU20516.1| hypothetical protein TSUD_130740 [Trifolium subterraneum]
          Length = 736

 Score =  352 bits (904), Expect = e-110
 Identities = 182/360 (50%), Positives = 234/360 (65%), Gaps = 18/360 (5%)
 Frame = -3

Query: 1270 NKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMALRSKNKLHFVDGTL* 1091
            NK YQND LNPYF+HP++ NPG  +VT +L+G NYH+WSRAM MAL+SKNKL FV+G+L 
Sbjct: 27   NKGYQNDTLNPYFLHPNE-NPGLILVTPSLSGTNYHSWSRAMTMALKSKNKLRFVNGSLP 85

Query: 1090 RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWKDLKNRFYQGDMFRI 911
            RP DED DSLAWDRCNTM MSWI++++  +ISQS+LWMDTA+++W+DLK RFYQGD+FRI
Sbjct: 86   RPVDEDHDSLAWDRCNTMIMSWISNAVDADISQSVLWMDTASEIWQDLKERFYQGDVFRI 145

Query: 910  SDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDCDCGTKCNTAIKFKEYRER 731
            SD+QEEIYT KQGD+SIS YYT++KKL QE +NF   P    +C   C    K KEY++ 
Sbjct: 146  SDIQEEIYTLKQGDNSISAYYTKMKKLWQELDNFR--PIPQSNCVYNCAAIAKMKEYKDS 203

Query: 730  SKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANSSIEEPKVLAA---- 563
             +V+RFLKGLNEQY AVR+QIMLMDPLP+I KVYS+L+QQ RQA   ++E K+LA     
Sbjct: 204  DQVIRFLKGLNEQYYAVRSQIMLMDPLPTISKVYSLLVQQERQAIVPVDESKLLAVNGYG 263

Query: 562  ----------TXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDVCYKK 413
                      +                              + C+HCG+T H ID C+KK
Sbjct: 264  NSTGRGDSYNSYAGRSQSNRGRGYRGGSRQSTGRGNPGKGNRYCTHCGQTNHVIDDCWKK 323

Query: 412  HGYPTHLQQNSVNNYAVETGNEDDDENSYHEDQQATTKQAPSF----GSLGFTPEQHKAI 245
            +GYP H+Q     + AV +    +  N   ED Q  T +  +     G +  TP Q KA+
Sbjct: 324  YGYPPHMQHLQHKHGAVNSCTHANANNGDDEDTQTVTYEEENVDSEAGKMLLTPAQQKAL 383


>dbj|GAU30132.1| hypothetical protein TSUD_360250 [Trifolium subterraneum]
          Length = 906

 Score =  354 bits (908), Expect = e-108
 Identities = 178/367 (48%), Positives = 236/367 (64%), Gaps = 11/367 (2%)
 Frame = -3

Query: 1312 MAAEIPKSSGNGSSNKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMAL 1133
            M   +  +  +GS NK YQND LNPYF+HP++ NP   +V+  L+G NYH+WSRAM MAL
Sbjct: 3    MDGSVNGAGSSGSQNKGYQNDTLNPYFLHPNE-NPNLVLVSTLLSGANYHSWSRAMTMAL 61

Query: 1132 RSKNKLHFVDGTL*RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWK 953
            +SKNK+HF+DG L RP D+DRDSLAWDRCNTM MSW+N+++ PEISQSILWMD+A ++W 
Sbjct: 62   KSKNKIHFIDGALPRPNDDDRDSLAWDRCNTMLMSWLNNAVEPEISQSILWMDSALEIWL 121

Query: 952  DLKNRFYQGDMFRISDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDCDCGT 773
            DLK RFYQGD+FRISD+QEEI++ KQGD +IS Y+T++KKL QE +NF   PT +C    
Sbjct: 122  DLKERFYQGDVFRISDIQEEIFSLKQGDSTISTYFTKMKKLWQELDNFRPIPTTNCVVNC 181

Query: 772  KCNTAIKFKEYRERSKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANS 593
                  K K YR+  +V+RFLKGLNEQYS VR+QIML+DPLPSI KVYS+L+QQ RQ   
Sbjct: 182  DSVVIAKMKSYRDSDQVIRFLKGLNEQYSVVRSQIMLIDPLPSITKVYSLLVQQERQLIL 241

Query: 592  SIEEPKVLAAT-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDVCYK 416
             I+E K+LA +                               K+C+HCG+T H ++ C++
Sbjct: 242  PIDESKLLAVSRQTYYAGRGSSTRDRGSARGGRFNGGRGKGNKLCTHCGQTNHVVENCWR 301

Query: 415  KHGYPTHLQQ----------NSVNNYAVETGNEDDDENSYHEDQQATTKQAPSFGSLGFT 266
            K+G P H+Q           N   N +V  G +D+D    + D+     +    G   F 
Sbjct: 302  KYGVPPHMQHLLGNGAANQGNGAANNSVNIGGDDEDIPIANYDEGNNDNEQ---GKFFFI 358

Query: 265  PEQHKAI 245
             +QHKA+
Sbjct: 359  ADQHKAL 365


>dbj|GAU23578.1| hypothetical protein TSUD_385660 [Trifolium subterraneum]
          Length = 851

 Score =  350 bits (897), Expect = e-107
 Identities = 183/360 (50%), Positives = 233/360 (64%), Gaps = 18/360 (5%)
 Frame = -3

Query: 1270 NKAYQNDMLNPYFMHPSDDNPGTAIVTLALNGDNYHTWSRAMMMALRSKNKLHFVDGTL* 1091
            NK YQND LNPYF+HP+  NPG  +VT +L+G NYH+WSRAM +AL+SKNKL FV+GTL 
Sbjct: 27   NKGYQNDTLNPYFLHPNK-NPGLILVTPSLSGTNYHSWSRAMTIALKSKNKLRFVNGTLP 85

Query: 1090 RPKDEDRDSLAWDRCNTMTMSWINSSIVPEISQSILWMDTAADVWKDLKNRFYQGDMFRI 911
            RP DED DSLAWDRCNTM MSWI++++  +ISQS+LWMDTA ++W+DLK RFYQGD+FRI
Sbjct: 86   RPDDEDHDSLAWDRCNTMIMSWISNAVDADISQSVLWMDTAPEIWQDLKERFYQGDVFRI 145

Query: 910  SDLQEEIYTTKQGDDSISNYYTRLKKL*QEFENFHLAPTCDCDCGTKCNTAIKFKEYRER 731
            SD+QEEIYT KQGD+SIS YYT++KKL QE +NF   P    +C   C    K KEY++ 
Sbjct: 146  SDIQEEIYTLKQGDNSISAYYTKMKKLWQELDNFR--PIPQSNCVHNCAVIAKMKEYKDF 203

Query: 730  SKVLRFLKGLNEQYSAVRAQIMLMDPLPSIGKVYSMLMQQARQANSSIEEPKVLA----- 566
            ++V+RFLKGLNEQY AVR+QIMLMDPLP+I KVYS+L+QQ RQA   I+E K+LA     
Sbjct: 204  NQVIRFLKGLNEQYYAVRSQIMLMDPLPTISKVYSLLVQQERQAIVPIDESKLLAVNGYG 263

Query: 565  ---------ATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKICSHCGRTAHTIDVCYKK 413
                     ++                              + C+HCG T H ID C+KK
Sbjct: 264  ASAGRGDSYSSYAGRGQSNRGRGSRGGCRHSTGRGNHGKGNRYCTHCGLTNHIIDDCWKK 323

Query: 412  HGYPTHLQQNSVNNYAVETGNEDDDENSYHEDQQATTKQAPSF----GSLGFTPEQHKAI 245
            +GYP H+Q     + AV +    +  N   ED Q  T +  +     G +  TP Q KA+
Sbjct: 324  YGYPPHMQHLQNKHGAVNSCTHANANNGDDEDTQTVTYEEENVDSEAGKMLLTPAQQKAL 383


Top