BLASTX nr result

ID: Astragalus23_contig00016652 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00016652
         (1006 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_012567983.1| PREDICTED: uncharacterized protein LOC101515...   216   2e-59
ref|XP_004491986.1| PREDICTED: uncharacterized protein LOC101494...   191   9e-55
gb|PNY08424.1| putative retrotransposon Ty3-gypsy subclass prote...   194   8e-52
gb|PNX86789.1| cellular nucleic acid-binding protein [Trifolium ...   182   5e-51
dbj|GAU46228.1| hypothetical protein TSUD_374740 [Trifolium subt...   191   1e-50
ref|XP_012571115.1| PREDICTED: uncharacterized protein LOC105852...   177   8e-50
ref|XP_015957327.1| uncharacterized protein LOC107481562 [Arachi...   177   1e-48
dbj|GAU12300.1| hypothetical protein TSUD_142090 [Trifolium subt...   174   8e-48
dbj|GAU10094.1| hypothetical protein TSUD_419930, partial [Trifo...   173   6e-47
dbj|GAU13379.1| hypothetical protein TSUD_126750 [Trifolium subt...   170   2e-46
dbj|GAU21844.1| hypothetical protein TSUD_176900 [Trifolium subt...   174   3e-46
dbj|GAU10552.1| hypothetical protein TSUD_420900, partial [Trifo...   171   6e-46
dbj|GAU29982.1| hypothetical protein TSUD_360970 [Trifolium subt...   177   1e-45
dbj|GAU47915.1| hypothetical protein TSUD_404680 [Trifolium subt...   169   1e-45
dbj|GAU10412.1| hypothetical protein TSUD_419120, partial [Trifo...   173   1e-45
dbj|GAU10180.1| hypothetical protein TSUD_418650, partial [Trifo...   169   1e-45
dbj|GAU48322.1| hypothetical protein TSUD_187580 [Trifolium subt...   170   2e-45
dbj|GAU49923.1| hypothetical protein TSUD_180410 [Trifolium subt...   174   5e-45
dbj|GAU10177.1| hypothetical protein TSUD_418640, partial [Trifo...   172   5e-45
dbj|GAU10637.1| hypothetical protein TSUD_418350, partial [Trifo...   174   8e-45

>ref|XP_012567983.1| PREDICTED: uncharacterized protein LOC101515713 [Cicer arietinum]
          Length = 968

 Score =  216 bits (549), Expect = 2e-59
 Identities = 109/301 (36%), Positives = 170/301 (56%), Gaps = 6/301 (1%)
 Frame = -1

Query: 901 PSPGDPNRSYFRFQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLE 722
           P P   NR ++ F K+ PP+F G    + +   LD M KIF  ++C+EE++V FA HML+
Sbjct: 38  PPPEASNRLFYDFHKLKPPAFLGSLVPLEAQSWLDEMTKIFLVVRCTEEDKVAFATHMLQ 97

Query: 721 EEAQTWWTNAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEF 542
            EA+ WW  A   M + GTP+ W NF  +F+  + P  ++  K+ EF +L+QG+M+V ++
Sbjct: 98  GEAENWWKGAKAYMISAGTPMNWENFCTVFLDKYIPMSIRKQKEFEFTHLQQGDMSVADY 157

Query: 541 TAKFEKLSQFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEAD 362
            AKFE+L++F   A++AP+D WK+  ++ GL  ++  +L+   +T+YA LV ++Y+ E  
Sbjct: 158 VAKFEELARFCAQAEYAPNDRWKINQFEWGLNPEIKSNLAQLEITSYATLVHKSYIVEES 217

Query: 361 LSRLXXXXXXXXXXXXXXXXSPTPNDE--PKGNSNKGKQAMASEFLRF---GKCPKCGKP 197
           L  L                +P  N +   K + NKGKQ   S   +     +CPKCG+ 
Sbjct: 218 LRSL---KENRQLKWQQRRDAPKSNQQLKVKTSPNKGKQPQNSVVPQARGPRECPKCGRS 274

Query: 196 HSGECRLGSSLCFYCKGEGHYANECPARK-RSREEHAAPALGRVYMLDTQKGKAEDNTAK 20
           H GEC  G ++CF+CK  GH + +CP RK +       P  GRVY L+ +K K  ++   
Sbjct: 275 HPGECLYGKNICFWCKTPGHLSQDCPQRKMKGLANSNGPLTGRVYTLNAKKTKGNNDLIA 334

Query: 19  G 17
           G
Sbjct: 335 G 335


>ref|XP_004491986.1| PREDICTED: uncharacterized protein LOC101494344 [Cicer arietinum]
          Length = 315

 Score =  191 bits (484), Expect = 9e-55
 Identities = 90/266 (33%), Positives = 149/266 (56%)
 Frame = -1

Query: 895 PGDPNRSYFRFQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLEEE 716
           P + NR ++ F K+ PP+F+G      +   +D ++K F  + C EE++V F AH+L+  
Sbjct: 47  PPNANRVFYDFLKLQPPTFQGSHNPSEAQAWMDEIKKEFEVVPCIEEQKVAFVAHLLKSR 106

Query: 715 AQTWWTNAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEFTA 536
           A+ WW +A+  ++ QGT + W +F+  F+  ++P   K   + EFV+L+QG+M+V E+  
Sbjct: 107 AEYWWRSANTYLQTQGTHMNWEHFEVSFLDKYYPKSAKRQNELEFVHLQQGDMSVVEYVV 166

Query: 535 KFEKLSQFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEADLS 356
           KFE+L++F   A++AP +EWK+  ++ GLR ++  ++    +TNY+ LV ++Y+ E +L 
Sbjct: 167 KFEELARFSPHAQYAPIEEWKINQFEWGLRPEIRGNIGHMELTNYSTLVHKSYIVEDNLK 226

Query: 355 RLXXXXXXXXXXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGECRL 176
           ++                      + K    KGKQ   S   R  KC KCG+ H  EC +
Sbjct: 227 KVQEERHAKWQQKKEFGKF-GQQLKVKTLQGKGKQVHTSSSPRARKCLKCGRDHGRECLV 285

Query: 175 GSSLCFYCKGEGHYANECPARKRSRE 98
           G   C+YCK  GH A  CP R++  E
Sbjct: 286 GKQFCYYCKQPGHMAPFCPIRQKQAE 311


>gb|PNY08424.1| putative retrotransposon Ty3-gypsy subclass protein [Trifolium
           pratense]
          Length = 1511

 Score =  194 bits (494), Expect = 8e-52
 Identities = 103/300 (34%), Positives = 151/300 (50%), Gaps = 5/300 (1%)
 Frame = -1

Query: 901 PSPGDP----NRSYFRFQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAA 734
           P P  P    N ++  + + +PP F G  +   +   +  MEKIFR   CSEE++V++A 
Sbjct: 84  PHPPPPAVVGNPAFREYNRNHPPEFNGERDPQEAKRWIKQMEKIFRMATCSEEDKVVYAT 143

Query: 733 HMLEEEAQTWWTNAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMT 554
           H     A+ WW  A   ME     + W NFK + +  + P   ++ K  EF+ L+QG M+
Sbjct: 144 HQFRGAAEDWWDGARRRMEANEVAVNWTNFKRVMMEKYLPKTFRIQKAQEFLELKQGGMS 203

Query: 553 VGEFTAKFEKLSQFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYV 374
           V EFT KFE+LS +    +H   +EWK+  YK GLR ++  ++     T +  LV ++ V
Sbjct: 204 VTEFTKKFEELSHYSSHNQHEADEEWKINQYKYGLRGEIEHTVGQQDFTCFDDLVHKSLV 263

Query: 373 AEADLSRLXXXXXXXXXXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPH 194
           AE  +++                       +PKG   KGKQ  +S+   +  C  CG  H
Sbjct: 264 AETSIAKTNREKSMAYEKKKDKYHQQL---KPKGPPQKGKQTQSSK--NYPACKNCGNSH 318

Query: 193 SGECRLGSSLCFYCKGEGHYANECPARK-RSREEHAAPALGRVYMLDTQKGKAEDNTAKG 17
           SG C  G+ +CF CK  GHY  ECP R+ RS  E    + GRVY LD +K KA ++   G
Sbjct: 319 SGPCMKGTGVCFLCKQPGHYQMECPHRRGRSGAEITTTSKGRVYSLDGRKAKANNDLIAG 378


>gb|PNX86789.1| cellular nucleic acid-binding protein [Trifolium pratense]
          Length = 357

 Score =  182 bits (462), Expect = 5e-51
 Identities = 90/278 (32%), Positives = 147/278 (52%), Gaps = 1/278 (0%)
 Frame = -1

Query: 865 FQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLEEEAQTWWTNAHE 686
           F +MNPP F G      +   +  M  I  +++C+E E+V F       +A  WW  A  
Sbjct: 82  FCRMNPPEFMGEYVPATAREWIQRMSDILESMECTEAEKVTFVTRFFRGDACNWWDGARA 141

Query: 685 LMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEFTAKFEKLSQFYQ 506
            M +  T + W NF+ +FI H+ P   +L  + E   L+QG ++V E+T +F +L ++  
Sbjct: 142 FMLSSQTEVNWTNFRRLFISHYIPESYQLQMEWELTELKQGSISVAEYTTRFNELVRYVP 201

Query: 505 LAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEADLSRLXXXXXXXX 326
            +  AP++ WK++ Y  GLRAD+A  +S   +T+   ++Q++Y AEA L  +        
Sbjct: 202 DSNDAPTEAWKIKKYHFGLRADIAHDVSLQPVTSLGEIIQKSYHAEASLEEMRKERGGIA 261

Query: 325 XXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGECRLGSSLCFYCKG 146
                       + +P+G+ +KGK   +    R  KCP+CG PH+GEC  G  +C+YC+ 
Sbjct: 262 QKKKDSEKY-NVHLKPRGSPSKGKHDYSPRSPR--KCPECGVPHNGECMKGRDVCYYCRQ 318

Query: 145 EGHYANECPARKRSRE-EHAAPALGRVYMLDTQKGKAE 35
            GHY ++CP  ++S +      + GRVY LD +K K +
Sbjct: 319 PGHYKSDCPKLQKSGDYSGTTKSKGRVYSLDGEKVKGK 356


>dbj|GAU46228.1| hypothetical protein TSUD_374740 [Trifolium subterraneum]
          Length = 1505

 Score =  191 bits (485), Expect = 1e-50
 Identities = 99/283 (34%), Positives = 146/283 (51%), Gaps = 1/283 (0%)
 Frame = -1

Query: 877 SYFRFQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLEEEAQTWWT 698
           ++  F +MNPP F G      +   +  M  I  ++ C+E E+V F       +A  WW 
Sbjct: 139 AFREFCRMNPPEFVGEYVPATAREWIQRMGGILESMNCTEAEKVTFVTRFFRGDACNWWE 198

Query: 697 NAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEFTAKFEKLS 518
            A   M +  T + W NF+ +FI H+ P   +L  + E   L+QG MTV E+T +F +L 
Sbjct: 199 GARTYMISSQTEMNWANFRRLFIAHYIPESYQLQMERELTELKQGGMTVAEYTTRFNELV 258

Query: 517 QFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEADLSRLXXXX 338
           ++      AP++ WKM+ Y+ GLRAD+A  +S   +TN   L+Q++Y AEA L  +    
Sbjct: 259 RYVTDGNDAPTEAWKMKKYRFGLRADIAHDVSMQPVTNLGDLIQKSYHAEASLGDIRRER 318

Query: 337 XXXXXXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGECRLGSSLCF 158
                             +P+G+ NKGKQ  +    R  KCP+CG PH+GEC  G  +CF
Sbjct: 319 GEAAQRRKDSGKY-NLQLKPRGSPNKGKQNYSPRSPR--KCPECGVPHNGECVKGKDVCF 375

Query: 157 YCKGEGHYANECP-ARKRSREEHAAPALGRVYMLDTQKGKAED 32
           YCK  GHY + CP  +K+     A    GRVY LD ++ K  +
Sbjct: 376 YCKKPGHYKSNCPILQKQGDSSGATRTQGRVYSLDGEEAKTNN 418


>ref|XP_012571115.1| PREDICTED: uncharacterized protein LOC105852086 [Cicer arietinum]
          Length = 293

 Score =  177 bits (449), Expect = 8e-50
 Identities = 85/247 (34%), Positives = 142/247 (57%), Gaps = 1/247 (0%)
 Frame = -1

Query: 781 FRTLQCSEEERVLFAAHMLEEEAQTWWTNAHELMENQGTPITWVNFKEMFIGHFFPPVMK 602
           F  +  ++E++V FAAH+L+  A+ WW +A   ++ QGT + W  F+  F+  ++P   +
Sbjct: 16  FEVVPYTDEQKVAFAAHLLKGGAEYWWRSAKTYLQTQGTHMNWEQFEVAFLDKYYPKSAR 75

Query: 601 LSKQAEFVNLRQGEMTVGEFTAKFEKLSQFYQLAKHAPSDEWKMQMYKAGLRADVAQSLS 422
             K+ EFV+L+QG+M++ E+ AKFE+L++F   A++AP++EWK+  ++ GLR ++  ++ 
Sbjct: 76  RQKELEFVHLQQGDMSIAEYVAKFEELARFSPHAQYAPTEEWKINQFEWGLRPEIRGNIG 135

Query: 421 SYAMTNYAALVQQAYVAEADLSRLXXXXXXXXXXXXXXXXSPTPNDEPKGNSNKGKQAMA 242
              +TNY+ LV ++Y+ E +L ++                      + K    KGKQ   
Sbjct: 136 HMELTNYSTLVHKSYIVEDNLKKVQEERQVKWQQKKESEKF-GQQLKVKTPQGKGKQVQT 194

Query: 241 SEFLRFGKCPKCGKPHSGECRLGSSLCFYCKGEGHYANECPARKRSREEHAAPA-LGRVY 65
           S   R  KC KCG+ H GEC +G  +C+YCK  GH A  CP R++  E +   +  GRV+
Sbjct: 195 SSSPRTKKCLKCGRDHGGECLVGKQVCYYCKQPGHMAPFCPIRQKQAEFNPNKSNTGRVF 254

Query: 64  MLDTQKG 44
            L  +KG
Sbjct: 255 ALSAKKG 261


>ref|XP_015957327.1| uncharacterized protein LOC107481562 [Arachis duranensis]
          Length = 392

 Score =  177 bits (449), Expect = 1e-48
 Identities = 97/306 (31%), Positives = 154/306 (50%), Gaps = 14/306 (4%)
 Frame = -1

Query: 892 GDPNRSYFRFQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLEEEA 713
           G    ++  F+K+ PP F+G  +  ++   L  MEK+F    C+EE++V +A  ML+ +A
Sbjct: 57  GQTRTTFADFKKIGPPEFRGALDSDMAEEWLTEMEKVFTIFSCTEEQQVSYATFMLKADA 116

Query: 712 QTWWTNAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEFTAK 533
           + WW  A  L+E+ GT I+W  FKE F   +FP  ++ SK+ EF+ LRQG M++ E+T K
Sbjct: 117 EFWWDGARRLLEDAGTDISWATFKEAFYKKYFPLSVRESKEMEFLQLRQGRMSIAEYTEK 176

Query: 532 FEKLSQFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEADLSR 353
           FE+L +F  + K  P ++WK   Y+ GLRA+V  +++   +  +++LV +  V E    +
Sbjct: 177 FERLCKFSAMYKANPDEKWKCMNYQGGLRAEVLTAIAPLEIREFSSLVSKCQVIEECTKK 236

Query: 352 LXXXXXXXXXXXXXXXXSPTPNDEPKGNSNK--GKQAM------------ASEFLRFGKC 215
           L                S  P  + K    K  G+Q              + +     +C
Sbjct: 237 LASERSEAFKKRQLNQESSQPPPQKKAFLEKPTGRQPQQGTDRQETSTDASPKTTELKEC 296

Query: 214 PKCGKPHSGECRLGSSLCFYCKGEGHYANECPARKRSREEHAAPALGRVYMLDTQKGKAE 35
             CGK H G C  G ++CF C   GH A ECPA   S   +     GRV+ L  ++ +  
Sbjct: 297 ASCGKQHRGRCLAGQNICFRCSQPGHIARECPAVLPS-PANLPQRQGRVFALSGEEVRES 355

Query: 34  DNTAKG 17
           ++  KG
Sbjct: 356 EDRNKG 361


>dbj|GAU12300.1| hypothetical protein TSUD_142090 [Trifolium subterraneum]
          Length = 381

 Score =  174 bits (442), Expect = 8e-48
 Identities = 92/285 (32%), Positives = 144/285 (50%), Gaps = 1/285 (0%)
 Frame = -1

Query: 883 NRSYFRFQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLEEEAQTW 704
           N ++  F +M+PP F G      +   +  M  I  ++QCSE ERV FA       A  W
Sbjct: 78  NAAFREFCRMSPPEFVGEFIPSKAREWIQRMGDILDSMQCSEAERVNFATRFFRGNACNW 137

Query: 703 WTNAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEFTAKFEK 524
           W +    M      + W NF+ +FI H+ P       + E   L+QG M+V E+T KF +
Sbjct: 138 WESTKNFMMANQIEMNWTNFRCLFINHYVPESFSYKMEKELQELKQGSMSVAEYTMKFNE 197

Query: 523 LSQFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEADLSRLXX 344
           L ++      AP++ WK++ Y+ GLRAD+A  ++   + N+  L+Q++Y AEA L  +  
Sbjct: 198 LIRYAADGPEAPTEAWKIKKYRMGLRADIAHVVTMQPIANFGDLIQRSYHAEAGLEEIRK 257

Query: 343 XXXXXXXXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGECRLGSSL 164
                             + +PKG+ +KGK   +   L    C +CG PH GEC     +
Sbjct: 258 GRGEVMQKRKDHGKF-NAHLKPKGSPSKGKHTYSPRSL--CNCSECGFPHEGECPRAKGV 314

Query: 163 CFYCKGEGHYANECP-ARKRSREEHAAPALGRVYMLDTQKGKAED 32
           C+YC+  GH+ +ECP  +K+S+      + GRVY LD +K K+ +
Sbjct: 315 CYYCRQPGHFKSECPKLKKQSKASGTTKSKGRVYSLDGKKNKSNN 359


>dbj|GAU10094.1| hypothetical protein TSUD_419930, partial [Trifolium subterraneum]
          Length = 410

 Score =  173 bits (438), Expect = 6e-47
 Identities = 91/283 (32%), Positives = 144/283 (50%), Gaps = 1/283 (0%)
 Frame = -1

Query: 877 SYFRFQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLEEEAQTWWT 698
           ++  F +MNPP F G     V+   +  M  I  +++C+E E+V FA   L   A  WW 
Sbjct: 21  AFREFCRMNPPEFVGEYIPSVAREWIQRMSGILDSMECTELEKVTFATRFLRGAACNWWE 80

Query: 697 NAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEFTAKFEKLS 518
                M      +TW NF+ +FI H+ P   ++S + E + L+QG  +V E+TAKF +L 
Sbjct: 81  GVRAYMTASQMEMTWANFRRLFIDHYIPESYRMSMERELIELKQGGKSVAEYTAKFNELV 140

Query: 517 QFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEADLSRLXXXX 338
           ++      AP++ WK++ Y+ GLRAD+A  +S   + +   L+Q++Y AE+ L  +    
Sbjct: 141 RYVADTDDAPTEAWKIKKYRFGLRADIAHDVSMLQVASLGELIQKSYHAESGLEAMRKER 200

Query: 337 XXXXXXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGECRLGSSLCF 158
                             +P+G+ +KGKQ  +    +   C +CG  H+GEC  G  +CF
Sbjct: 201 FEVNQKRRDSGKY-KEQLKPRGSPSKGKQNFSQRAQQ--ACSECGSIHNGECMKGKGVCF 257

Query: 157 YCKGEGHYANECPARKRS-REEHAAPALGRVYMLDTQKGKAED 32
           +CK  GHY NECP    S        + GRVY LD ++ +  +
Sbjct: 258 HCKQPGHYKNECPKLHGSGGSSGTTRSKGRVYSLDGEQARGNN 300


>dbj|GAU13379.1| hypothetical protein TSUD_126750 [Trifolium subterraneum]
          Length = 367

 Score =  170 bits (431), Expect = 2e-46
 Identities = 89/280 (31%), Positives = 142/280 (50%), Gaps = 1/280 (0%)
 Frame = -1

Query: 877 SYFRFQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLEEEAQTWWT 698
           ++  F +MNPP F G     V+   +  M  I  +++C+E E+V FA   L   A  WW 
Sbjct: 79  AFREFCRMNPPEFVGEYVPSVAREWIQRMSGILDSMECTELEKVTFATRFLRAAACNWWE 138

Query: 697 NAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEFTAKFEKLS 518
                M      +TW NF+ +FI H+ P   ++S + E + L+QG  +V E+TAKF +L 
Sbjct: 139 GVRAYMTASQMEMTWANFRRLFIDHYIPESYRMSMERELIELKQGGKSVAEYTAKFNELV 198

Query: 517 QFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEADLSRLXXXX 338
           ++   +  AP++ WK++ Y+ GLRAD+A  +S   + +   L++++Y  E+ L  +    
Sbjct: 199 RYVADSDDAPTEAWKIKKYRFGLRADIAHDVSMQQVASLGELIRKSYHVESGLEAMRKER 258

Query: 337 XXXXXXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGECRLGSSLCF 158
                             +P+G+ +KGKQ       +   C +CG  H+GEC  G  +CF
Sbjct: 259 FEVNQKRRDSGTY-KEQLKPRGSPSKGKQNFPQRSQQV--CSECGSVHNGECMKGKGVCF 315

Query: 157 YCKGEGHYANECPARKRS-REEHAAPALGRVYMLDTQKGK 41
           +CK  GHY NECP    S        + GRVY LD ++ +
Sbjct: 316 HCKQPGHYKNECPKLHGSGGSSGTTRSKGRVYSLDGEQAR 355


>dbj|GAU21844.1| hypothetical protein TSUD_176900 [Trifolium subterraneum]
          Length = 554

 Score =  174 bits (441), Expect = 3e-46
 Identities = 91/284 (32%), Positives = 148/284 (52%), Gaps = 1/284 (0%)
 Frame = -1

Query: 877 SYFRFQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLEEEAQTWWT 698
           ++  F +MNPP F G     V+   +  M  I  ++ C+E E+V FA   L   A  WW 
Sbjct: 13  AFREFCRMNPPEFVGECVPSVAREWIQRMSGILDSMACTELEKVTFATRFLRGAACNWWE 72

Query: 697 NAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEFTAKFEKLS 518
                M      +TWVNF+ +FI H+ P   ++S + E + L+QG  +V E+TAKF +L 
Sbjct: 73  GVRAYMTASQMEMTWVNFRRLFIDHYIPESYRMSMERELIELKQGGKSVAEYTAKFNELV 132

Query: 517 QFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEADLSRLXXXX 338
           ++   ++ AP++ WK++ Y+ GLRA++A  +S   + +   L+Q++Y AE+ L  +    
Sbjct: 133 RYMADSEDAPTEAWKIKKYRFGLRAEIAHDVSMLQVASLGELIQKSYHAESGLEAMRKER 192

Query: 337 XXXXXXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGECRLGSSLCF 158
                             +P+G+ +KGKQ  +    +   C +CG  H+GEC  G  +C+
Sbjct: 193 FEVNQKRRDSGKY-KEQLKPRGSPSKGKQNFSQRSQQ--ACSECGSVHNGECMKGKGVCY 249

Query: 157 YCKGEGHYANECPARKRS-REEHAAPALGRVYMLDTQKGKAEDN 29
           +CK  GHY NECP    S        + GRVY LD ++ +A ++
Sbjct: 250 HCKQPGHYKNECPKLHGSGGSSGTTRSKGRVYSLDGEQARATNS 293


>dbj|GAU10552.1| hypothetical protein TSUD_420900, partial [Trifolium subterraneum]
          Length = 422

 Score =  171 bits (432), Expect = 6e-46
 Identities = 89/276 (32%), Positives = 141/276 (51%), Gaps = 1/276 (0%)
 Frame = -1

Query: 856 MNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLEEEAQTWWTNAHELME 677
           MNPP F G     V+   +  M  I  +++C+E E+V FA   L   A  WW      M 
Sbjct: 1   MNPPEFVGEYVPSVAREWIQRMSGILDSMECTELEKVTFATRFLRGAACNWWEGVRAYMT 60

Query: 676 NQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEFTAKFEKLSQFYQLAK 497
                +TW NF+ +FI H+ P   ++S + E + L+QG  +V E+TAKF +L ++   ++
Sbjct: 61  ASQMEMTWANFRRLFIDHYIPESYRMSMERELIELKQGGKSVAEYTAKFNELVRYMADSE 120

Query: 496 HAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEADLSRLXXXXXXXXXXX 317
            AP++ WK++ Y+ GLRAD+A  +S   + +   L+Q++Y AE+ L  +           
Sbjct: 121 EAPTEAWKIKKYRYGLRADIAHDVSMLPVASLGELIQKSYHAESGLEAMRKERFEVNQKR 180

Query: 316 XXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGECRLGSSLCFYCKGEGH 137
                      +P+G+ +KGKQ  +    +   C +CG  H+GEC  G  +CF+CK  GH
Sbjct: 181 RDAGKY-KEQLKPRGSPSKGKQNFSQRSQQ--ACSECGSIHNGECMKGKGVCFHCKQPGH 237

Query: 136 YANECP-ARKRSREEHAAPALGRVYMLDTQKGKAED 32
           Y NECP             + GRVY LD ++ +  +
Sbjct: 238 YKNECPKLHGPGGSGGTTRSKGRVYSLDGEQARGNN 273


>dbj|GAU29982.1| hypothetical protein TSUD_360970 [Trifolium subterraneum]
          Length = 1553

 Score =  177 bits (448), Expect = 1e-45
 Identities = 93/283 (32%), Positives = 144/283 (50%), Gaps = 1/283 (0%)
 Frame = -1

Query: 877 SYFRFQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLEEEAQTWWT 698
           ++  F +MNPP F G     V+   +  M  I  +++C+E E+V FA   L   A  WW 
Sbjct: 79  AFREFCRMNPPEFVGEYVPSVAREWIQRMSGILNSMECTELEKVTFATRFLRGAACNWWE 138

Query: 697 NAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEFTAKFEKLS 518
                M      +TWVNF+ +FI H+ P   ++S + E + L+QG  +V E+T KF +L 
Sbjct: 139 GVRAYMTASQMEMTWVNFRRLFIDHYIPESYRMSMERELIELKQGGRSVAEYTTKFNELV 198

Query: 517 QFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEADLSRLXXXX 338
           Q+      AP++ WK++ Y+ GLRAD+A  +S   + +   L+Q++Y AE+ L  +    
Sbjct: 199 QYVADGDDAPTEAWKIKKYRFGLRADIAHDVSMQQLASLGELIQKSYHAESSLEAVRKER 258

Query: 337 XXXXXXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGECRLGSSLCF 158
                             +P+G+ +KGKQ  +    +   C KCG  H+GEC  G  +CF
Sbjct: 259 FEVNQKRRDSGKY-KEQLKPRGSPSKGKQKFSQRPQQ--ACSKCGSVHNGECMKGKYVCF 315

Query: 157 YCKGEGHYANECPARKRS-REEHAAPALGRVYMLDTQKGKAED 32
           +CK  GHY NECP    S        + GRVY LD ++ +  +
Sbjct: 316 HCKQPGHYKNECPKLHGSGGSSGTTGSKGRVYSLDGEQARGNN 358


>dbj|GAU47915.1| hypothetical protein TSUD_404680 [Trifolium subterraneum]
          Length = 408

 Score =  169 bits (429), Expect = 1e-45
 Identities = 93/288 (32%), Positives = 141/288 (48%), Gaps = 2/288 (0%)
 Frame = -1

Query: 889 DPNRSYFR-FQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLEEEA 713
           D   + FR F +MNPP F G     VS   +  M  I  ++ C+  E+  FA   L   A
Sbjct: 72  DTGSAAFREFCRMNPPEFVGEYVPSVSREWIQRMSGILDSMACTGLEKFTFATRFLRGAA 131

Query: 712 QTWWTNAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEFTAK 533
             WW      M      +TW NF+ +FI H+ P   ++S + E + L+Q   +V E+ AK
Sbjct: 132 CDWWEGVRAYMTASQMEMTWANFRRLFIDHYIPESYRMSMERELIELKQAGKSVAEYIAK 191

Query: 532 FEKLSQFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEADLSR 353
           F +L ++      AP++ WK++ Y+ GLRAD+A  +S   + ++  L+Q++Y AE+ L  
Sbjct: 192 FNELVRYVADGDDAPTEAWKIKKYRFGLRADIAHDVSMQPVASFGELIQKSYHAESSLEA 251

Query: 352 LXXXXXXXXXXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGECRLG 173
           +                      +P+G+  KGKQ +         CP+CG  H GEC  G
Sbjct: 252 MRKERFEVNQKRRDSGKY-KEQLKPRGSPQKGKQNVPQR--SHPACPECGMFHHGECMKG 308

Query: 172 SSLCFYCKGEGHYANECPARKRSR-EEHAAPALGRVYMLDTQKGKAED 32
             +CF+CK  GHY NECP    SR       + GRVY LD ++ +  +
Sbjct: 309 KGVCFHCKQPGHYKNECPKLHGSRGSSGTTKSKGRVYSLDGEQARGNN 356


>dbj|GAU10412.1| hypothetical protein TSUD_419120, partial [Trifolium subterraneum]
          Length = 576

 Score =  173 bits (438), Expect = 1e-45
 Identities = 89/283 (31%), Positives = 146/283 (51%), Gaps = 1/283 (0%)
 Frame = -1

Query: 877 SYFRFQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLEEEAQTWWT 698
           ++  F +MNPP F G     V+   +  M  I  +++C+E E+V FA   L   A  WW 
Sbjct: 84  AFREFCRMNPPEFVGEYVPSVAREWIQRMSGILDSMECTELEKVTFATRFLRGAACNWWE 143

Query: 697 NAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEFTAKFEKLS 518
                M      +TW NF+ +FI H+ P   ++S + E + L+QG  +V E+TAKF +L 
Sbjct: 144 GVRAYMTASQMEMTWANFRRLFIDHYIPESYRMSMERELIELKQGSKSVAEYTAKFNELV 203

Query: 517 QFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEADLSRLXXXX 338
           ++   +  AP++ WK++ Y+ GLRAD+A  +S   + +   L+Q++Y+AE+ L  +    
Sbjct: 204 RYVADSDDAPTEAWKIKKYRFGLRADIAHDVSMQQVASLGELIQKSYLAESGLEAMRKER 263

Query: 337 XXXXXXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGECRLGSSLCF 158
                             +P+G+ +KGKQ  +    +   C +CG  H+GEC  G  +CF
Sbjct: 264 FEVNQKKRDSGKY-KEQLKPRGSPSKGKQNFSKRSQQV--CSECGSIHNGECMKGKGVCF 320

Query: 157 YCKGEGHYANECPARKRS-REEHAAPALGRVYMLDTQKGKAED 32
           +CK  GH+ +ECP    S        + GRVY LD ++ +  +
Sbjct: 321 HCKQPGHFKSECPKLHSSGGSSGTTRSKGRVYSLDGEQARGNN 363


>dbj|GAU10180.1| hypothetical protein TSUD_418650, partial [Trifolium subterraneum]
          Length = 418

 Score =  169 bits (429), Expect = 1e-45
 Identities = 91/283 (32%), Positives = 141/283 (49%), Gaps = 1/283 (0%)
 Frame = -1

Query: 877 SYFRFQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLEEEAQTWWT 698
           ++  F +MNPP F G     V+   +  M  I  +++C+E E+V FA   L   A  WW 
Sbjct: 79  AFREFCRMNPPEFVGEYVPSVAREWIQRMSGILDSMECTELEKVTFATRFLRGAACNWWE 138

Query: 697 NAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEFTAKFEKLS 518
                M      +TW NF+ +FI H+ P   ++S + E + L+QG  +V E+TAKF +L 
Sbjct: 139 GVRAYMTASQMEMTWANFRRLFIDHYIPESYRMSMERELIELKQGGKSVAEYTAKFNELV 198

Query: 517 QFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEADLSRLXXXX 338
           ++   +  AP++ WK++ Y+ GLRAD+A  +S   + +   L+Q++Y AE+ L  +    
Sbjct: 199 RYVADSDDAPTEAWKIKKYRFGLRADIAHDVSMQQVASLGELIQKSYHAESGLEAMRKER 258

Query: 337 XXXXXXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGECRLGSSLCF 158
                             +  G+  KGKQ +         CP+CG  H GEC  G  +CF
Sbjct: 259 FEVNQKRRDSGKY-KEQLKSMGSPQKGKQNVPQR--SHPACPECGVFHHGECMKGKGVCF 315

Query: 157 YCKGEGHYANECPARKRS-REEHAAPALGRVYMLDTQKGKAED 32
           +CK  GHY NECP    S        + GRVY LD ++ +  +
Sbjct: 316 HCKQPGHYKNECPKLHGSGGSSGTTRSKGRVYSLDGEQARGNN 358


>dbj|GAU48322.1| hypothetical protein TSUD_187580 [Trifolium subterraneum]
          Length = 449

 Score =  170 bits (430), Expect = 2e-45
 Identities = 91/283 (32%), Positives = 143/283 (50%), Gaps = 1/283 (0%)
 Frame = -1

Query: 877 SYFRFQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLEEEAQTWWT 698
           ++  F +MNPP F G     V+   +  M  I  ++ C E E+V FA   L   A  WW 
Sbjct: 79  AFREFSRMNPPEFVGEYVPSVAREWIQRMSGILDSMACIELEKVTFATRFLRGAACNWWE 138

Query: 697 NAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEFTAKFEKLS 518
                M      +TW NF+ +FI H+ P   ++S + E + L+QG  +V E+TAKF +L 
Sbjct: 139 GVRAYMTASQMEMTWANFRRLFIDHYIPESYRMSMERELIELKQGGKSVAEYTAKFNELV 198

Query: 517 QFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEADLSRLXXXX 338
           ++   +  AP++ WK++ Y+ GLRAD+A  +S   +T+   L+Q++Y AE+ L  +    
Sbjct: 199 RYVADSDDAPTEAWKIKKYRFGLRADIAHDVSMQQVTSPGELIQKSYHAESSLEAMRKER 258

Query: 337 XXXXXXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGECRLGSSLCF 158
                             +P+G+ +KGKQ  +    +   C +CG  H+GEC  G  +CF
Sbjct: 259 FEVNQKRRDSGKY-KEQLKPRGSPSKGKQNFSQRSEQV--CSQCGSVHNGECMKGKGVCF 315

Query: 157 YCKGEGHYANECPARKRS-REEHAAPALGRVYMLDTQKGKAED 32
           + K  GHY NECP    S        + GRVY LD ++ +  +
Sbjct: 316 HYKQPGHYKNECPKLHGSGGSSGTTRSKGRVYSLDGEQARGNN 358


>dbj|GAU49923.1| hypothetical protein TSUD_180410 [Trifolium subterraneum]
          Length = 947

 Score =  174 bits (442), Expect = 5e-45
 Identities = 94/292 (32%), Positives = 148/292 (50%), Gaps = 1/292 (0%)
 Frame = -1

Query: 901  PSPGDPNRSYFRFQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLE 722
            P+    + ++  F +MNPP F G     V+   +  M  I  ++ C E E+V FA   L 
Sbjct: 208  PAQNTGSAAFREFCRMNPPEFVGEYIPSVAREWIQRMSGILDSMGCIELEKVTFATRFLC 267

Query: 721  EEAQTWWTNAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEF 542
              A  WW      M      +TW NF+ +FI H+ P   ++S + E + L+QG  +V E+
Sbjct: 268  GAACNWWEGVRVYMTASQMEMTWANFRRLFIDHYIPESYRMSMERELIELKQGSKSVAEY 327

Query: 541  TAKFEKLSQFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEAD 362
            TAKF +L ++   +  AP++ WK++ Y+ GLRAD+AQ ++   + +   L+Q++Y AE+ 
Sbjct: 328  TAKFNELVRYVADSDDAPTEAWKIKKYRFGLRADIAQDVAMQQVASLGELIQKSYHAESG 387

Query: 361  LSRLXXXXXXXXXXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGEC 182
            L  +                      +P+G+  KGKQ ++        CP+CG  H GEC
Sbjct: 388  LEAMRKERFEVNQKRRDSGKY-KEQLKPRGSPQKGKQNVSQR--SHPACPECGMFHHGEC 444

Query: 181  RLGSSLCFYCKGEGHYANECPARKRS-REEHAAPALGRVYMLDTQKGKAEDN 29
              G  +CF+CK  GHY NECP    S        + GRVY LD ++ +A ++
Sbjct: 445  MKGKGVCFHCKQPGHYKNECPKLHVSGGSSGTTKSKGRVYSLDGEQARATNS 496


>dbj|GAU10177.1| hypothetical protein TSUD_418640, partial [Trifolium subterraneum]
          Length = 667

 Score =  172 bits (437), Expect = 5e-45
 Identities = 90/279 (32%), Positives = 143/279 (51%), Gaps = 1/279 (0%)
 Frame = -1

Query: 865 FQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLEEEAQTWWTNAHE 686
           F +MNPP F G     V+   +  M  I  +++C+E E+V FA   L   A  WW     
Sbjct: 83  FCRMNPPEFVGEYVPSVAREWIQRMSGILDSMECTELEKVTFATRFLRGAACNWWEGVRA 142

Query: 685 LMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEFTAKFEKLSQFYQ 506
            M      +TW NF+ +F+ H+ P   ++S + E + L+QG  +V E+TAKF +L ++  
Sbjct: 143 YMTASQMEMTWANFRCLFVDHYIPESYRMSMERELIELKQGGKSVAEYTAKFNELVRYVA 202

Query: 505 LAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEADLSRLXXXXXXXX 326
               AP++ WK++ Y+ GLRAD+A  +S   + +   L+Q++Y AE+ L  +        
Sbjct: 203 DTDDAPTEAWKIKKYRFGLRADIAHDVSMLQVASLGELIQKSYHAESGLEAMRKERFEVN 262

Query: 325 XXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGECRLGSSLCFYCKG 146
                         +P+G+ +KGKQ  +    +   CP+CG  H+GEC  G  +CF+CK 
Sbjct: 263 QKRRDSGKY-KEQLKPRGSPSKGKQNFSQRSQQ--ACPECGSIHNGECMKGKGVCFHCKQ 319

Query: 145 EGHYANECPARKRS-REEHAAPALGRVYMLDTQKGKAED 32
            GHY NECP    S        + GRVY L+ ++ +  +
Sbjct: 320 PGHYKNECPKLHGSGGSSGTTRSKGRVYSLEGEQARGNN 358


>dbj|GAU10637.1| hypothetical protein TSUD_418350, partial [Trifolium subterraneum]
          Length = 883

 Score =  174 bits (440), Expect = 8e-45
 Identities = 91/291 (31%), Positives = 145/291 (49%), Gaps = 1/291 (0%)
 Frame = -1

Query: 901 PSPGDPNRSYFRFQKMNPPSFKGGPEVMVSHC*LDSMEKIFRTLQCSEEERVLFAAHMLE 722
           P+    + ++  F +MNPP F G     V+   +  M  I  ++ C+E E+V F    L 
Sbjct: 48  PTQNTGSAAFREFCRMNPPEFVGEYVPSVAREWIQRMSGILDSMGCTELEKVTFTTRFLR 107

Query: 721 EEAQTWWTNAHELMENQGTPITWVNFKEMFIGHFFPPVMKLSKQAEFVNLRQGEMTVGEF 542
             A  WW      M      +TW NF+ +FI H+ P   ++S + E + L+QG  +V E+
Sbjct: 108 GAACNWWEGVRAYMTASQIEMTWANFRRLFIDHYIPESYRMSMERELIELKQGSKSVAEY 167

Query: 541 TAKFEKLSQFYQLAKHAPSDEWKMQMYKAGLRADVAQSLSSYAMTNYAALVQQAYVAEAD 362
           T+KF +L ++   +  AP++ WK++ Y+ GLRAD+A  +S   + +   L+Q++Y AE+ 
Sbjct: 168 TSKFNELVRYVADSDDAPTEAWKIKKYRFGLRADIAHDVSMQQVASLGELIQKSYHAESG 227

Query: 361 LSRLXXXXXXXXXXXXXXXXSPTPNDEPKGNSNKGKQAMASEFLRFGKCPKCGKPHSGEC 182
           L  +                      +P+G+  KGKQ ++        CP+CG  H GEC
Sbjct: 228 LEAMRKERFEVNQKRRDSGKY-KEQLKPRGSPQKGKQNVSQR--SHPACPECGMFHHGEC 284

Query: 181 RLGSSLCFYCKGEGHYANECPARKRS-REEHAAPALGRVYMLDTQKGKAED 32
             G  +CF+CK  GHY NECP    S        + GRVY LD ++ +  +
Sbjct: 285 MKGKGVCFHCKQSGHYKNECPKLHGSGGSSGTTKSKGRVYSLDGEQARGNN 335