BLASTX nr result

ID: Astragalus24_contig00016414 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00016414
         (1431 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KHN45890.1| Retrovirus-related Pol polyprotein from transposo...   283   5e-88
dbj|GAU44417.1| hypothetical protein TSUD_100640 [Trifolium subt...   296   1e-85
dbj|GAU37126.1| hypothetical protein TSUD_278780 [Trifolium subt...   290   2e-85
gb|PNY05002.1| putative copia-type polyprotein [Trifolium pratense]   286   4e-85
gb|PNX98468.1| putative copia-type polyprotein, partial [Trifoli...   289   5e-83
dbj|GAU25658.1| hypothetical protein TSUD_265850 [Trifolium subt...   283   2e-81
dbj|GAU44225.1| hypothetical protein TSUD_399890 [Trifolium subt...   259   1e-79
ref|XP_006580852.1| PREDICTED: uncharacterized protein LOC102661...   261   7e-79
gb|PNX90720.1| pectinesterase, partial [Trifolium pratense]           251   2e-76
dbj|GAU18816.1| hypothetical protein TSUD_81050 [Trifolium subte...   257   1e-71
ref|XP_006579114.1| PREDICTED: uncharacterized protein LOC102664...   239   2e-71
dbj|GAU22886.1| hypothetical protein TSUD_376970 [Trifolium subt...   255   2e-71
ref|XP_006582570.1| PREDICTED: uncharacterized protein LOC102664...   239   1e-70
dbj|GAU10169.1| hypothetical protein TSUD_421370, partial [Trifo...   234   3e-70
ref|XP_019429635.1| PREDICTED: uncharacterized protein LOC109337...   235   4e-70
ref|XP_019423054.1| PREDICTED: uncharacterized protein LOC109332...   235   6e-70
ref|XP_006588085.1| PREDICTED: uncharacterized protein LOC102667...   241   7e-70
dbj|GAU47338.1| hypothetical protein TSUD_101250, partial [Trifo...   233   7e-69
ref|XP_006582657.1| PREDICTED: uncharacterized protein LOC102669...   228   1e-68
ref|XP_006575979.1| PREDICTED: uncharacterized protein LOC102662...   235   3e-68

>gb|KHN45890.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 374

 Score =  283 bits (724), Expect = 5e-88
 Identities = 150/300 (50%), Positives = 204/300 (68%), Gaps = 10/300 (3%)
 Frame = -3

Query: 871 MANKSTSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXX 692
           MA  S++NG FPA+LPIF+GKNYD+W A+MKVIFR+QDV+EIVN GV             
Sbjct: 1   MATTSSNNG-FPAHLPIFDGKNYDQWIAKMKVIFRFQDVVEIVNTGVAALPRNPTDDQDA 59

Query: 691 XNRELQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLR 512
            ++E +K+DGK LF+IHQC+D+++ E I+ CE+ K AWD L   YGG ++LKKV+LQ+LR
Sbjct: 60  AHKEQKKRDGKALFIIHQCLDADIFEKIVHCENAKEAWDTLARNYGGDEKLKKVRLQALR 119

Query: 511 KQNESLKMKDQETITEFVTRLVALT-QMKSSGETVTELSKIEKVLRSLTLNFDHVVVAIE 335
           +Q E L+M + ET+ ++  RL +LT QM  +GE +++L KIEKVLR+LT  FDH+VVA E
Sbjct: 120 RQYELLQMTEGETVNQYFVRLTSLTNQMVRNGEKISDLMKIEKVLRTLTPKFDHIVVAKE 179

Query: 334 ESKNLDEMRLEELQGSLEAHELR------IKQRSSEKETEQALQAQASXXXXXXXXXXXX 173
           ESKNLDE+++EELQ SLEAHELR      IK +S+E   +QALQAQ              
Sbjct: 180 ESKNLDELKIEELQASLEAHELRLNERSKIKDKSNESAADQALQAQHQKKGKYKKGKRKN 239

Query: 172 XXXXXXXXDFGEG---TSEKGASDVGNKKKTDKSSIQCYNCQQFGHYRNQCKNKKVPRNK 2
                      +G   +S++G  + G KKK +K  IQCYNCQ++GH+  +CK+KKVPR +
Sbjct: 240 QNSKNSNEGTSKGHDHSSQEG--NKGQKKKINKKDIQCYNCQKWGHFAAECKSKKVPREE 297


>dbj|GAU44417.1| hypothetical protein TSUD_100640 [Trifolium subterraneum]
          Length = 1318

 Score =  296 bits (758), Expect = 1e-85
 Identities = 155/294 (52%), Positives = 199/294 (67%), Gaps = 6/294 (2%)
 Frame = -3

Query: 871 MANKSTSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXX 692
           MAN++T++  FPANLP+F G+NYDRWCAQMKVIFR+QDVLE V +GV             
Sbjct: 1   MANQNTTSTQFPANLPVFKGENYDRWCAQMKVIFRFQDVLETVINGVAELAANAEEAART 60

Query: 691 XNRELQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLR 512
            + EL+KKD K LF+IHQCVD N+ E II+ E++K AWD L N YGG ++LK ++LQ+LR
Sbjct: 61  HHHELKKKDAKALFIIHQCVDPNIFEKIIEEETSKGAWDTLKNTYGGDEKLKGIKLQALR 120

Query: 511 KQNESLKMKDQETITEFVTRLVALTQ-MKSSGETVTELSKIEKVLRSLTLNFDHVVVAIE 335
           +Q E ++M +QETI E++ R+++LT  MK+ GE +++ SKIEKVLR+LT  FDH+VVAIE
Sbjct: 121 RQYEMMQMNEQETIAEYLARMLSLTNLMKACGEALSDRSKIEKVLRTLTEKFDHIVVAIE 180

Query: 334 ESKNLDEMRLEELQGSLEAHELRIKQRSSEKETEQALQAQASXXXXXXXXXXXXXXXXXX 155
           ESK+L  M++EELQ SLEAHELR+KQRSS K  EQALQA+                    
Sbjct: 181 ESKDLATMKIEELQASLEAHELRVKQRSSNKAVEQALQAKIQNKNYKGKDKWKKKKEEPE 240

Query: 154 XXDFGEGT----SEKGASDVGN-KKKTDKSSIQCYNCQQFGHYRNQCKNKKVPR 8
                  T    S KG  +  N KKK DK  IQCYNCQ +GHY  +C +KKV R
Sbjct: 241 NSSKNSKTQAVGSIKGNQNKKNPKKKIDKKDIQCYNCQNYGHYARECNSKKVER 294


>dbj|GAU37126.1| hypothetical protein TSUD_278780 [Trifolium subterraneum]
          Length = 870

 Score =  290 bits (741), Expect = 2e-85
 Identities = 153/295 (51%), Positives = 196/295 (66%), Gaps = 7/295 (2%)
 Frame = -3

Query: 871 MANKSTSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXX 692
           MAN++T++  FPANLP+F G+NYDRWCAQMKVIFR+QDVLE V +GV             
Sbjct: 1   MANQNTTSTQFPANLPVFKGENYDRWCAQMKVIFRFQDVLETVINGVAELAANADEAART 60

Query: 691 XNRELQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLR 512
            + EL+KK+ K LF+IHQCVD N+ E II+ E++K A D L N YGG ++LK ++LQ+LR
Sbjct: 61  HHHELKKKEAKALFIIHQCVDPNIFEKIIEEETSKGACDTLRNTYGGDEKLKGIKLQALR 120

Query: 511 KQNESLKMKDQETITEFVTRLVALTQ-MKSSGETVTELSKIEKVLRSLTLNFDHVVVAIE 335
           +Q E ++M DQETI E++ R+++LT  MK+ GE +++ SKIEKVLR+LT  FDH+VVAIE
Sbjct: 121 RQYEMMQMNDQETIAEYLARMLSLTNLMKACGEALSDRSKIEKVLRTLTEKFDHIVVAIE 180

Query: 334 ESKNLDEMRLEELQGSLEAHELRIKQRSSEKETEQALQAQASXXXXXXXXXXXXXXXXXX 155
           ESKNL  M++E+LQ  LEAHELR+KQRSS K  EQALQA+                    
Sbjct: 181 ESKNLATMKIEDLQAYLEAHELRVKQRSSNKAVEQALQAKIQNKNYKGKDKWKKKKEESE 240

Query: 154 XXDFGEGTSEKGASDVGN------KKKTDKSSIQCYNCQQFGHYRNQCKNKKVPR 8
                  T   G S  GN      KKK DK  IQCYNCQ +GHY  +C +KKV R
Sbjct: 241 NSSKNSKTQAAG-SIKGNQNKKNPKKKIDKKDIQCYNCQNYGHYARECNSKKVER 294


>gb|PNY05002.1| putative copia-type polyprotein [Trifolium pratense]
          Length = 762

 Score =  286 bits (733), Expect = 4e-85
 Identities = 152/296 (51%), Positives = 200/296 (67%), Gaps = 10/296 (3%)
 Frame = -3

Query: 859 STSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXXXNRE 680
           ++SN  FPA+LPIF+GKNYD+W A+MKVIFR QDV+EIVNDGV              ++E
Sbjct: 3   TSSNNGFPAHLPIFDGKNYDQWIAKMKVIFRLQDVVEIVNDGVAALPRNPNDEQNAVHKE 62

Query: 679 LQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLRKQNE 500
            +KKDGK LF+IHQC+D+++ E I+ CES K AWD L   YGG ++LKKV+LQSLR+Q E
Sbjct: 63  SKKKDGKALFIIHQCLDADIFEKILHCESAKEAWDTLARNYGGDEKLKKVRLQSLRRQYE 122

Query: 499 SLKMKDQETITEFVTRLVALT-QMKSSGETVTELSKIEKVLRSLTLNFDHVVVAIEESKN 323
            L+M + ET++++  RL +LT QM  +GET+++L KIEKVLR+LT  FDH+VVA EESKN
Sbjct: 123 LLQMNESETVSQYFVRLTSLTNQMVRNGETISDLMKIEKVLRTLTPKFDHIVVAKEESKN 182

Query: 322 LDEMRLEELQGSLEAHELRIKQRS------SEKETEQALQAQASXXXXXXXXXXXXXXXX 161
           L+E++ EELQ SLEAHELR+ +RS      SE   +QALQAQ +                
Sbjct: 183 LEELKFEELQASLEAHELRLTERSKNNGKQSEDSNDQALQAQYNKKGKNQNSNE------ 236

Query: 160 XXXXDFGEG---TSEKGASDVGNKKKTDKSSIQCYNCQQFGHYRNQCKNKKVPRNK 2
                 G G    S +  +  G KKK +K  IQCYNCQ++GH+  +CK+KKVPR K
Sbjct: 237 ------GNGKNQDSNQQENSNGQKKKFNKKEIQCYNCQKWGHFAAECKSKKVPREK 286


>gb|PNX98468.1| putative copia-type polyprotein, partial [Trifolium pratense]
          Length = 1267

 Score =  289 bits (739), Expect = 5e-83
 Identities = 150/295 (50%), Positives = 200/295 (67%), Gaps = 6/295 (2%)
 Frame = -3

Query: 871 MANKSTSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXX 692
           MAN++T++  FPANLP+F G+NYDRWCAQM+VI R+QD LEIV DGV             
Sbjct: 1   MANQTTTSSQFPANLPVFKGENYDRWCAQMRVILRFQDCLEIVTDGVGELAEDADDEART 60

Query: 691 XNRELQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLR 512
            ++E ++KD K LF+IHQCVD NV E II+ E++K AWD L + YGG ++LK ++LQ+LR
Sbjct: 61  LHKETKRKDAKSLFIIHQCVDPNVFEKIIEEETSKGAWDKLKDYYGGDEKLKGIKLQALR 120

Query: 511 KQNESLKMKDQETITEFVTRLVALTQ-MKSSGETVTELSKIEKVLRSLTLNFDHVVVAIE 335
           +Q E+++M+++E+I E+++RL++LT  MKS GE +   SKI+KVLR+LT  FDH+VVAIE
Sbjct: 121 RQYETMQMEEKESIGEYMSRLLSLTNLMKSCGEALEVKSKIQKVLRTLTEKFDHIVVAIE 180

Query: 334 ESKNLDEMRLEELQGSLEAHELRIKQRSSEKETEQALQAQASXXXXXXXXXXXXXXXXXX 155
           ESK+L  M++EELQ SLEAHELR+KQRSS K  EQALQA+                    
Sbjct: 181 ESKDLSTMKIEELQASLEAHELRVKQRSSSKAVEQALQAKVQNKNHKGKDKCKKKKDDSE 240

Query: 154 XXDFGE----GTSEKGASDVGN-KKKTDKSSIQCYNCQQFGHYRNQCKNKKVPRN 5
                     G S KG  +  N KKK DK  +QCYNCQ+ GHY  +C +KKV R+
Sbjct: 241 SSSKNSKNQAGESSKGNQNKKNFKKKVDKKDVQCYNCQKHGHYARECHSKKVDRD 295


>dbj|GAU25658.1| hypothetical protein TSUD_265850 [Trifolium subterraneum]
          Length = 1126

 Score =  283 bits (724), Expect = 2e-81
 Identities = 147/294 (50%), Positives = 194/294 (65%), Gaps = 9/294 (3%)
 Frame = -3

Query: 871 MANKSTSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXX 692
           MAN +T++  FPANLP+F G++YDRWCAQMKVIF +QDVLEIVNDGV             
Sbjct: 1   MANANTTSNQFPANLPVFRGEHYDRWCAQMKVIFIFQDVLEIVNDGVEDLVADANEAQRT 60

Query: 691 XNRELQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLR 512
            +R+ +KKDGKGLF+IHQCVDSN+ E II+ E+ K A D L   YGG ++LKKV+LQ+LR
Sbjct: 61  LHRDQKKKDGKGLFIIHQCVDSNIFEKIIEEEAAKGACDTLKKIYGGDEKLKKVKLQALR 120

Query: 511 KQNESLKMKDQETITEFVTRLVALT-QMKSSGETVTELSKIEKVLRSLTLNFDHVVVAIE 335
           KQ E  +M + ET+++F +RLV LT QMK+ GET+T L K+EKVLR LT NFD++VVAIE
Sbjct: 121 KQYEMTQMNEGETVSDFFSRLVTLTNQMKACGETITGLQKVEKVLRGLTANFDYIVVAIE 180

Query: 334 ESKNLDEMRLEELQGSLEAHELRIKQRSSEKETEQALQA--------QASXXXXXXXXXX 179
           ESK L +++LE+LQ SLEAHE+R+KQR+SEK+ +QALQA        + S          
Sbjct: 181 ESKVLSDLKLEDLQTSLEAHEMRLKQRTSEKKVDQALQAKFTKKGKEEGSKWNRGKEKWR 240

Query: 178 XXXXXXXXXXDFGEGTSEKGASDVGNKKKTDKSSIQCYNCQQFGHYRNQCKNKK 17
                        +   +       +KKK + + +QCY C++FGHY   C   K
Sbjct: 241 KKKNNVDAASKTSKNPYDSNKHSNNSKKKVNLNEVQCYCCEKFGHYVRNCPENK 294


>dbj|GAU44225.1| hypothetical protein TSUD_399890 [Trifolium subterraneum]
          Length = 295

 Score =  259 bits (661), Expect = 1e-79
 Identities = 130/220 (59%), Positives = 168/220 (76%), Gaps = 1/220 (0%)
 Frame = -3

Query: 871 MANKSTSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXX 692
           MAN++T++  FPANLP+F G+NYDRWCAQMKVIFR+QDVLE V +GV             
Sbjct: 1   MANQNTTSTQFPANLPVFKGENYDRWCAQMKVIFRFQDVLETVINGVVELDANADEATRT 60

Query: 691 XNRELQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLR 512
            + EL+KKD K LF+IHQCV+ N+ E II+ E +K AWD L N YGG ++LK ++LQ+L 
Sbjct: 61  EHHELKKKDAKALFIIHQCVNPNIFEKIIEEEISKGAWDTLKNTYGGDEKLKGIKLQALI 120

Query: 511 KQNESLKMKDQETITEFVTRLVALTQ-MKSSGETVTELSKIEKVLRSLTLNFDHVVVAIE 335
           +Q E ++M DQETI E++ R+++LT  MKS GE +++ SKIEKVLR+LT  FDH+VVAIE
Sbjct: 121 RQYEMMQMNDQETIAEYLARMLSLTNLMKSCGEALSDRSKIEKVLRTLTEKFDHIVVAIE 180

Query: 334 ESKNLDEMRLEELQGSLEAHELRIKQRSSEKETEQALQAQ 215
           ESK+L  M++EELQ SLEAHELR+KQRSS K  EQALQA+
Sbjct: 181 ESKDLATMKIEELQASLEAHELRVKQRSSNKAVEQALQAK 220


>ref|XP_006580852.1| PREDICTED: uncharacterized protein LOC102661360 [Glycine max]
          Length = 415

 Score =  261 bits (666), Expect = 7e-79
 Identities = 137/305 (44%), Positives = 193/305 (63%), Gaps = 20/305 (6%)
 Frame = -3

Query: 859 STSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXXXNRE 680
           ++SNG+FPA++P+  GKNYD WCAQMKVIFR+QDV E+V +GV              +R+
Sbjct: 2   TSSNGNFPASMPVLKGKNYDDWCAQMKVIFRFQDVTEVVQEGVQELDRNPTDAEKVAHRD 61

Query: 679 LQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLRKQNE 500
           L K+D K LF+IHQCVD +  + I   ++TK AWD L  +Y G  +LKKV+LQ+LR+Q E
Sbjct: 62  LMKRDAKALFIIHQCVDVDNFQKIRSADTTKKAWDTLEKSYAGDSKLKKVKLQTLRRQYE 121

Query: 499 SLKMKDQETITEFVTRLVALT-QMKSSGETVTELSKIEKVLRSLTLNFDHVVVAIEESKN 323
            L+M DQE I EF +R++A+T QM   G+  + L  I+KVLR+LT  FDH+VVAIE+ +N
Sbjct: 122 LLQMSDQENIGEFFSRVLAITNQMNVYGDKQSNLGIIDKVLRTLTPRFDHIVVAIEQGQN 181

Query: 322 LDEMRL-EELQGSLEAHELRIKQRSSEKETEQALQAQASXXXXXXXXXXXXXXXXXXXXD 146
           L+EM++ EELQG LEA E+R+ +R+S++  EQA+QAQ +                    +
Sbjct: 182 LEEMKIEEELQGILEAQEMRLNERNSQRSVEQAMQAQTTKGNNYDGGKNKKGKGKGKWKN 241

Query: 145 FGEGTSEKGASDVGN------------------KKKTDKSSIQCYNCQQFGHYRNQCKNK 20
                S +G S  GN                  KKK +K  IQCYNCQ++G++ ++C+NK
Sbjct: 242 NKWKGSSEGYSSSGNHNQNEETDKKDGGNHKVGKKKFNKKGIQCYNCQKWGYFADECRNK 301

Query: 19  KVPRN 5
           +VPRN
Sbjct: 302 RVPRN 306


>gb|PNX90720.1| pectinesterase, partial [Trifolium pratense]
          Length = 334

 Score =  251 bits (642), Expect = 2e-76
 Identities = 134/283 (47%), Positives = 180/283 (63%), Gaps = 2/283 (0%)
 Frame = -3

Query: 850 NGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXXXNRELQK 671
           N +F ANLP+F+GKN+D W  QMKVIF +Q+V E VN  +               RE  K
Sbjct: 4   NNNFHANLPVFDGKNWDLWVKQMKVIFTFQEVFEQVNAEIAPLPANATEEQRTTFREATK 63

Query: 670 KDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLRKQNESLK 491
           KD K LFLIHQCVDS V E I D E++K AWDIL  +YGG  ++KKV+LQ+L++Q E L+
Sbjct: 64  KDNKALFLIHQCVDSKVFEKIADAETSKGAWDILQKSYGGDAKVKKVKLQALKRQYELLQ 123

Query: 490 MKDQETITEFVTRLVALT-QMKSSGETVTELSKIEKVLRSLTLNFDHVVVAIEESKNLDE 314
           MK+ E I ++ TRLV LT QMKS G+T+ E  K+EKVLR+LT  FDH+VV IEE+K+L E
Sbjct: 124 MKNDEKIADYFTRLVTLTNQMKSCGDTLQEQEKVEKVLRTLTSRFDHIVVTIEETKDLSE 183

Query: 313 MRLEELQGSLEAHELRIKQRSSEKETEQALQAQASXXXXXXXXXXXXXXXXXXXXDFGE- 137
           +++E+LQ +LEAHE++  +R   KE EQAL A+                      D  + 
Sbjct: 184 VKIEDLQSTLEAHEMKHGERDHGKEDEQALYAKFKKFQSKKKWQKKNESKKGKESDEDKP 243

Query: 136 GTSEKGASDVGNKKKTDKSSIQCYNCQQFGHYRNQCKNKKVPR 8
            +S+K      N KK  K  IQC+NCQ+FG + ++C+ +KVPR
Sbjct: 244 ESSKKERGGSVNSKKGSKKHIQCFNCQEFGXFASECRGQKVPR 286


>dbj|GAU18816.1| hypothetical protein TSUD_81050 [Trifolium subterraneum]
          Length = 1380

 Score =  257 bits (657), Expect = 1e-71
 Identities = 138/295 (46%), Positives = 181/295 (61%), Gaps = 5/295 (1%)
 Frame = -3

Query: 871 MANKSTSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXX 692
           MA  S ++    A LPIF+G NYDRW AQMKV+FRY  VL++++ GV             
Sbjct: 1   MAETSKTSDQIHAKLPIFDGNNYDRWTAQMKVVFRYHGVLDVIHSGVTPLGEAPTETARA 60

Query: 691 XNRELQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLR 512
            +RE  KKD K ++  HQCVDSNV+E II+ E+ K AWD+L   Y G  + KKV+L +LR
Sbjct: 61  THREQMKKDDKAIYFFHQCVDSNVLEKIIEYETAKEAWDVLATTYAGDKQTKKVKLMALR 120

Query: 511 KQNESLKMKDQETITEFVTRLVALT-QMKSSGETVTELSKIEKVLRSLTLNFDHVVVAIE 335
           +Q   L+M+  ET+T+FV RL  LT QMKS GE VT+  K+EKV+  LT  FD++V AIE
Sbjct: 121 RQLGQLQMEPNETVTQFVNRLTVLTNQMKSCGEAVTDSLKVEKVITGLTPKFDNLVAAIE 180

Query: 334 ESKNLDEMRLEELQGSLEAHELRIKQRSS----EKETEQALQAQASXXXXXXXXXXXXXX 167
           +SK+LD ++LE+L GSLEAHEL++K R S    EKETE+AL  Q+               
Sbjct: 181 QSKDLDTLKLEQLIGSLEAHELKLKNRDSVKKDEKETEKALFTQSQKKGSGSYESWKKKG 240

Query: 166 XXXXXXDFGEGTSEKGASDVGNKKKTDKSSIQCYNCQQFGHYRNQCKNKKVPRNK 2
                   G+  S K     G  KK  K  IQCYNCQ++GH+ ++C N KVPR K
Sbjct: 241 K-------GKWKSNKNEGGNGKGKKKSKEHIQCYNCQKWGHFADECVNPKVPRKK 288


>ref|XP_006579114.1| PREDICTED: uncharacterized protein LOC102664804 [Glycine max]
          Length = 336

 Score =  239 bits (609), Expect = 2e-71
 Identities = 123/298 (41%), Positives = 180/298 (60%), Gaps = 15/298 (5%)
 Frame = -3

Query: 853 SNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXXXNRELQ 674
           SNG FP+NLPI  GKN++RW  QMK +  +QD+++++ +G+               ++L+
Sbjct: 3   SNGSFPSNLPILTGKNFNRWSVQMKALLGFQDLIDVIENGIEIPKEGASDSQKIEFKDLK 62

Query: 673 KKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLRKQNESL 494
           KKD K L ++HQCVD +  E I + +S K AWDILN AY GAD++KKV+LQ+LR+Q E L
Sbjct: 63  KKDCKALVILHQCVDDSHFEKIANAKSAKEAWDILNKAYAGADKIKKVRLQTLRRQFELL 122

Query: 493 KMKDQETITEFVTRL-VALTQMKSSGETVTELSKIEKVLRSLTLNFDHVVVAIEESKNLD 317
           +M++ E+I ++  RL V    + S G+T+T L+ +EKVL++L   FDH+VVAIEESK+L+
Sbjct: 123 QMEETESIGDYFGRLQVLANSITSCGDTITNLTLVEKVLKTLNPRFDHIVVAIEESKDLE 182

Query: 316 EMRLEELQGSLEAHELRIKQRSSEKETEQALQAQASXXXXXXXXXXXXXXXXXXXXDFGE 137
            + ++ELQGSLEAHE R+++R+++K TEQALQA                         G 
Sbjct: 183 SLSVDELQGSLEAHEQRLQERANDKATEQALQAHHQSRNGGSNYHRGKKGRGRFQNTRGR 242

Query: 136 GTSEK--------------GASDVGNKKKTDKSSIQCYNCQQFGHYRNQCKNKKVPRN 5
           G   K              G    G KKK DK +++C+NC + GHY  +C  K+   N
Sbjct: 243 GGYSKDKGKPQPDQRSGGSGTRGRGGKKKWDKRNVECFNCGKRGHYAEECWYKEKNAN 300


>dbj|GAU22886.1| hypothetical protein TSUD_376970 [Trifolium subterraneum]
          Length = 1121

 Score =  255 bits (651), Expect = 2e-71
 Identities = 138/294 (46%), Positives = 181/294 (61%), Gaps = 9/294 (3%)
 Frame = -3

Query: 871 MANKSTSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXX 692
           MAN +T++  FP NLP+F G++YDRWCAQMKVIFR+QDVLEIVNDGV             
Sbjct: 1   MANANTTSNQFPTNLPVFRGEHYDRWCAQMKVIFRFQDVLEIVNDGVEDLV--------- 51

Query: 691 XNRELQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLR 512
                             CVDSN+ E II+ E+ K A D L   YGG ++LKKV+LQ+LR
Sbjct: 52  ------------------CVDSNIFEKIIEEETAKGARDTLKKIYGGDEKLKKVKLQALR 93

Query: 511 KQNESLKMKDQETITEFVTRLVALT-QMKSSGETVTELSKIEKVLRSLTLNFDHVVVAIE 335
           KQ E  +M + ET+++F +RLV LT QMK+ GET+T+L K+EKVLR+LT NFD++VVAIE
Sbjct: 94  KQYEMTQMNEGETVSDFFSRLVTLTNQMKACGETITDLQKVEKVLRALTANFDYIVVAIE 153

Query: 334 ESKNLDEMRLEELQGSLEAHELRIKQRSSEKETEQALQA--------QASXXXXXXXXXX 179
           ESK L +M+LEELQ SLEAHE+R+KQR+SEK+ +QALQA        + S          
Sbjct: 154 ESKVLSDMKLEELQASLEAHEMRLKQRTSEKKVDQALQAKFTKKGKEEGSKWNKGKEKWR 213

Query: 178 XXXXXXXXXXDFGEGTSEKGASDVGNKKKTDKSSIQCYNCQQFGHYRNQCKNKK 17
                        +   +    +  +KKK + S +QCY C++FGHY   C   K
Sbjct: 214 KKKNNADAESKNSKNPDDSNKHNKNSKKKVNMSEVQCYCCEKFGHYARNCPVNK 267


>ref|XP_006582570.1| PREDICTED: uncharacterized protein LOC102664992 [Glycine max]
          Length = 394

 Score =  239 bits (609), Expect = 1e-70
 Identities = 126/288 (43%), Positives = 185/288 (64%), Gaps = 3/288 (1%)
 Frame = -3

Query: 859 STSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXXXNRE 680
           ++SNG+F A++P+  GKNYD WCAQM VIFR+QDV E+V +GV              +R+
Sbjct: 2   ASSNGNFSASMPVLKGKNYDDWCAQMNVIFRFQDVTEVVQEGVQELDRNPTDAQKVTHRD 61

Query: 679 LQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLRKQNE 500
           L KKD K LF+IHQCVD++  + I   ++ K AWD L  +Y G  +LKKV+LQ+LR+Q E
Sbjct: 62  LMKKDAKALFIIHQCVDADNFQKIRSADTAKKAWDTLEKSYAGDSKLKKVKLQTLRRQYE 121

Query: 499 SLKMKDQETITEFVTRLVALT-QMKSSGETVTELSKIEKVLRSLTLNFDHVVVAIEESKN 323
            L+M DQE+I EF +R++A+T Q+ + G+  ++L  I KVLR+LTL FDH+VVAIE+ +N
Sbjct: 122 LLQMSDQESIGEFFSRILAITNQINAYGDKQSDLEIINKVLRTLTLRFDHIVVAIEQGQN 181

Query: 322 LDEMRLEELQGSLEAHELRIKQRSSEKETEQALQAQASXXXXXXXXXXXXXXXXXXXXDF 143
           L+EM++EELQG LEA E+R+ +R+S++  EQA+QAQ +                     +
Sbjct: 182 LEEMKIEELQGILEAQEMRLNERNSQRSVEQAMQAQTTKGNNYDGGKNKKGKGKWKNNKW 241

Query: 142 GEGTSEKGASDVGN--KKKTDKSSIQCYNCQQFGHYRNQCKNKKVPRN 5
            +G+SE  +S   +   ++TDK            H  ++C+NK+VPRN
Sbjct: 242 -KGSSEGSSSSENHNQNEETDKKG-------GGNHKVDECRNKRVPRN 281


>dbj|GAU10169.1| hypothetical protein TSUD_421370, partial [Trifolium subterraneum]
          Length = 292

 Score =  234 bits (597), Expect = 3e-70
 Identities = 129/291 (44%), Positives = 178/291 (61%), Gaps = 1/291 (0%)
 Frame = -3

Query: 871 MANKSTSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXX 692
           MA KS    +F ANLPI +GKN+D W  QMKVIF  Q+  E VN  +             
Sbjct: 1   MAGKS----NFHANLPILDGKNWDTWVKQMKVIFIVQEADEQVNTILDPLPANATEQQRT 56

Query: 691 XNRELQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLR 512
             RE QKKD K LFLIHQCVDS V E I D  ++K+AWDIL  +YGG  ++KKV+LQ+L+
Sbjct: 57  TFREAQKKDSKALFLIHQCVDSKVFEKIADATTSKDAWDILQKSYGGDAKVKKVKLQALK 116

Query: 511 KQNESLKMKDQETITEFVTRLVALT-QMKSSGETVTELSKIEKVLRSLTLNFDHVVVAIE 335
           +Q E L+MK+ E + E+ TR+  LT QMK+   T++E   +EKVLR+LT  FDH+VV IE
Sbjct: 117 RQFELLEMKNDEAVAEYFTRVETLTNQMKNCRSTLSEEEMVEKVLRTLTHKFDHIVVTIE 176

Query: 334 ESKNLDEMRLEELQGSLEAHELRIKQRSSEKETEQALQAQASXXXXXXXXXXXXXXXXXX 155
           ++++L E+++E+LQ +LEAHEL+  +R+  KE EQAL  +                    
Sbjct: 177 QTRDLSEIKMEDLQSTLEAHELKHGERNHGKEDEQALFVKFKRYQDEKKKWQSKKGSKKG 236

Query: 154 XXDFGEGTSEKGASDVGNKKKTDKSSIQCYNCQQFGHYRNQCKNKKVPRNK 2
                E  +E    + G K K DKS+IQCYNC ++GHY ++CK  K  +++
Sbjct: 237 KESV-EDKTESSKKEGGQKTKKDKSTIQCYNCNKYGHYASECKAPKKKKSQ 286


>ref|XP_019429635.1| PREDICTED: uncharacterized protein LOC109337176 [Lupinus
           angustifolius]
          Length = 333

 Score =  235 bits (600), Expect = 4e-70
 Identities = 128/295 (43%), Positives = 175/295 (59%), Gaps = 15/295 (5%)
 Frame = -3

Query: 853 SNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXXXNRELQ 674
           SN  F   LP  NGKNY+RW  QMKV+F YQ+VL++V DG                RE  
Sbjct: 3   SNSGFTMVLPTLNGKNYERWQVQMKVLFGYQEVLDVVQDGFQAVGDEATEAQRSLFRECT 62

Query: 673 KKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLRKQNESL 494
           K+D K L +IHQCVD +  E I   +++K AWDIL   Y GA+++KKV+LQ+LR++ E L
Sbjct: 63  KRDCKALAMIHQCVDDSNFEKIAHSKTSKEAWDILGRCYAGAEKVKKVKLQTLRREYELL 122

Query: 493 KMKDQETITEFVTRLVALTQ-MKSSGETVTELSKIEKVLRSLTLNFDHVVVAIEESKNLD 317
           +MKD++TI ++ T+L +LT  MK  GET+ +   +E VLR+L   FDH+VVAIEESKNL+
Sbjct: 123 QMKDEDTIADYFTKLRSLTNLMKGCGETMKDQLIVENVLRTLNSKFDHIVVAIEESKNLE 182

Query: 316 EMRLEELQGSLEAHELRIKQRSSEKETEQALQAQASXXXXXXXXXXXXXXXXXXXXD--- 146
           ++++EELQ SLEAHE RIK+RS +++ EQALQA+ +                        
Sbjct: 183 DIKIEELQSSLEAHEQRIKERSLDRDPEQALQARFNKKFTSQGNFQKKIKGKWKGEKDKG 242

Query: 145 -----------FGEGTSEKGASDVGNKKKTDKSSIQCYNCQQFGHYRNQCKNKKV 14
                        E  S  G +   NKKK DK  IQC+ C+ FGHY  +C+ + V
Sbjct: 243 HEGSRQEPSSSSSERISVNGQALKRNKKKLDKKKIQCFRCKIFGHYAFECRTRLV 297


>ref|XP_019423054.1| PREDICTED: uncharacterized protein LOC109332526 [Lupinus
           angustifolius]
          Length = 331

 Score =  235 bits (599), Expect = 6e-70
 Identities = 126/285 (44%), Positives = 170/285 (59%), Gaps = 6/285 (2%)
 Frame = -3

Query: 853 SNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXXXNRELQ 674
           +N      LPI +GKNY+RW  QMKV+F YQ+VLEIV DG                RE +
Sbjct: 3   TNSGLTMTLPILDGKNYERWSVQMKVLFGYQEVLEIVQDGYESIGEDATKTQRSILRECK 62

Query: 673 KKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLRKQNESL 494
           KKD K LF+IHQCVD    E I + E+TK AWD L  +Y GA+++KKV+LQ+LR++ E L
Sbjct: 63  KKDCKALFIIHQCVDGANFEKIANAETTKEAWDNLEKSYAGAEKVKKVKLQTLRREYELL 122

Query: 493 KMKDQETITEFVTRLVALTQ-MKSSGETVTELSKIEKVLRSLTLNFDHVVVAIEESKNLD 317
           +MK+ ++I  + T++ +L+  MK  GE + +   +EKVLR+LT  FDHVVV IEESK+L+
Sbjct: 123 QMKEGDSIANYFTKIRSLSNLMKRCGEVMKDQLVVEKVLRTLTFKFDHVVVVIEESKDLE 182

Query: 316 EMRLEELQGSLEAHELRIKQRSSEKETEQALQAQASXXXXXXXXXXXXXXXXXXXXDFGE 137
             ++EELQ +LEAHE RIK+R  E+   QALQAQ +                       +
Sbjct: 183 AFKIEELQSTLEAHEQRIKERDQERNPNQALQAQFNRRNPGQGNFHKKSRGNWKNDRGRD 242

Query: 136 GTS-----EKGASDVGNKKKTDKSSIQCYNCQQFGHYRNQCKNKK 17
             S      +  S + NKKK DK  IQCYNC+  GHY   C+ K+
Sbjct: 243 DVSAQKHLHQDQSFINNKKKIDKKKIQCYNCKNLGHYACGCRFKQ 287


>ref|XP_006588085.1| PREDICTED: uncharacterized protein LOC102667204 [Glycine max]
          Length = 529

 Score =  241 bits (614), Expect = 7e-70
 Identities = 129/302 (42%), Positives = 187/302 (61%), Gaps = 17/302 (5%)
 Frame = -3

Query: 859 STSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXXXNRE 680
           ++SNG+FPA++P+  GKNYD WCAQMKVIFR+QDV E+V +GV              +R+
Sbjct: 2   ASSNGNFPASMPVLIGKNYDDWCAQMKVIFRFQDVTEVVQEGVQEPDKNPTDAQKVAHRD 61

Query: 679 LQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLRKQNE 500
           L K+D K LF+IHQCVD++  + I   ++ K AWD L  +Y G  +LKK++LQ+LR+Q E
Sbjct: 62  LMKRDAKALFIIHQCVDADNFQKIRSTDTAKKAWDTLEKSYAGDSKLKKMKLQTLRRQYE 121

Query: 499 SLKMKDQETITEFVTRLVALT-QMKSSGETVTELSKIEKVLRSLTLNFDHVVVAIEESKN 323
            L+M DQE+I EF +R++A+T QMK+ G+  ++L  I+ V+             IE+S+N
Sbjct: 122 LLQMSDQESIVEFFSRILAITNQMKAYGDKQSDLRIIDNVV-------------IEQSQN 168

Query: 322 LDEMRLEELQGSLEAHELRIKQRSSEKETEQALQAQASXXXXXXXXXXXXXXXXXXXXDF 143
           L+EM++EELQG LEA E+R+ +R+S++  EQA+Q Q +                     +
Sbjct: 169 LEEMKIEELQGILEAQEMRLNERNSQRLAEQAIQTQTTKGNNYDGGKNKKGKGKWKNNKW 228

Query: 142 ---GEGTSEKG---------ASDVGN----KKKTDKSSIQCYNCQQFGHYRNQCKNKKVP 11
              GEG+S  G             GN    KKK +K  IQCYNCQ++GHY ++C+NK+VP
Sbjct: 229 KGLGEGSSNSGNHNQNEETDKKSGGNHKVGKKKFNKKGIQCYNCQKWGHYVDECRNKRVP 288

Query: 10  RN 5
           RN
Sbjct: 289 RN 290


>dbj|GAU47338.1| hypothetical protein TSUD_101250, partial [Trifolium subterraneum]
          Length = 359

 Score =  233 bits (594), Expect = 7e-69
 Identities = 129/286 (45%), Positives = 172/286 (60%), Gaps = 1/286 (0%)
 Frame = -3

Query: 871 MANKSTSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXX 692
           MA KS    +F ANLPI +GKN+D W  QMKVIF  Q+V E VN  +             
Sbjct: 1   MAGKS----NFHANLPILDGKNWDTWVKQMKVIFIVQEVDEQVNTVLDPLPANATEQQRT 56

Query: 691 XNRELQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLR 512
             RE QKKD K LFLIHQCVD+ V E I D  ++K+AW+IL  +YGG   +KKV+LQ+L+
Sbjct: 57  TFREAQKKDIKALFLIHQCVDAKVFEKIADATTSKDAWNILQKSYGGDANVKKVKLQALK 116

Query: 511 KQNESLKMKDQETITEFVTRLVALT-QMKSSGETVTELSKIEKVLRSLTLNFDHVVVAIE 335
           +Q E L+MK  E I ++ TR+V LT QMK+ G T+ E   +EKVLR+LT  FDH+VV IE
Sbjct: 117 RQFELLEMKSDEAIADYFTRVVTLTNQMKNCGSTLNEEEMVEKVLRTLTHKFDHIVVTIE 176

Query: 334 ESKNLDEMRLEELQGSLEAHELRIKQRSSEKETEQALQAQASXXXXXXXXXXXXXXXXXX 155
           ++K+L E+++E+LQ +LEAHEL+  +R+  KE EQAL  +                    
Sbjct: 177 QTKDLSEIKMEDLQNTLEAHELKHGERNHGKEDEQALFVKFKKYQDEKKKWQNKKCSKKG 236

Query: 154 XXDFGEGTSEKGASDVGNKKKTDKSSIQCYNCQQFGHYRNQCKNKK 17
                + +        G K K DKS IQCYNC ++GHY ++CK  K
Sbjct: 237 KESVEDMSESSKKEGGGQKTKKDKSIIQCYNCNKYGHYDSECKTPK 282


>ref|XP_006582657.1| PREDICTED: uncharacterized protein LOC102669313 [Glycine max]
          Length = 216

 Score =  228 bits (580), Expect = 1e-68
 Identities = 109/215 (50%), Positives = 158/215 (73%), Gaps = 1/215 (0%)
 Frame = -3

Query: 859 STSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXXXNRE 680
           ++SNG+FPA++P+  GKNYD WCAQMKVIFR+QDV E+V +GV              +R+
Sbjct: 2   ASSNGNFPASMPVLKGKNYDDWCAQMKVIFRFQDVTEVVQEGVQEPDRNPTDAQKVAHRD 61

Query: 679 LQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLRKQNE 500
           L K+D K LF+IHQCVD++  + I   ++ K AWD L  +Y G  +LKKV+LQ+LR+Q E
Sbjct: 62  LMKRDAKTLFIIHQCVDADNFQKIRSADTAKKAWDTLEKSYAGDSKLKKVKLQTLRRQYE 121

Query: 499 SLKMKDQETITEFVTRLVALT-QMKSSGETVTELSKIEKVLRSLTLNFDHVVVAIEESKN 323
            L+M DQE+I EF +R++A+T QM + G+  ++L  I+KVLR+LT  FDH+VVAIE+ +N
Sbjct: 122 LLQMSDQESIGEFFSRILAITNQMNAYGDKQSDLGIIDKVLRTLTPRFDHIVVAIEQGQN 181

Query: 322 LDEMRLEELQGSLEAHELRIKQRSSEKETEQALQA 218
           L+EM++EELQG LEA E+R+ +++ ++  EQA+QA
Sbjct: 182 LEEMKIEELQGILEAQEMRLNEKNLQRSAEQAMQA 216


>ref|XP_006575979.1| PREDICTED: uncharacterized protein LOC102662365 [Glycine max]
          Length = 481

 Score =  235 bits (600), Expect = 3e-68
 Identities = 126/286 (44%), Positives = 183/286 (63%), Gaps = 1/286 (0%)
 Frame = -3

Query: 859 STSNGHFPANLPIFNGKNYDRWCAQMKVIFRYQDVLEIVNDGVPXXXXXXXXXXXXXNRE 680
           ++SNG+FPA++P+  GKNYD WCAQMKVIFR+QDV E+V +GV              +R+
Sbjct: 2   ASSNGNFPASMPVLKGKNYDDWCAQMKVIFRFQDVTEVVQEGVQEPDRNPTDAQKVAHRD 61

Query: 679 LQKKDGKGLFLIHQCVDSNVIEMIIDCESTKNAWDILNNAYGGADRLKKVQLQSLRKQNE 500
           L K+D K LF+IHQC D++  + I   ++TK AWD L  +Y G  +LKKV+LQ+LR+Q E
Sbjct: 62  LMKRDAKALFIIHQCEDADNFQKIRSADTTKKAWDTLEKSYAGDSKLKKVKLQTLRRQYE 121

Query: 499 SLKMKDQETITEFVTRLVALT-QMKSSGETVTELSKIEKVLRSLTLNFDHVVVAIEESKN 323
            L+M DQE+I EF +R++A+T QM + G+  ++L  I+KVLR+LT  FDH+VVAIE+ +N
Sbjct: 122 LLQMSDQESIGEFFSRILAITNQMNAYGDKQSDLGIIDKVLRTLTPRFDHIVVAIEQGQN 181

Query: 322 LDEMRLEELQGSLEAHELRIKQRSSEKETEQALQAQASXXXXXXXXXXXXXXXXXXXXDF 143
           L+EM++EELQG LEA ++R+ +R+S++  EQA+QAQ +                      
Sbjct: 182 LEEMKIEELQGILEAQKMRLNERNSQRSVEQAMQAQTT---------------------- 219

Query: 142 GEGTSEKGASDVGNKKKTDKSSIQCYNCQQFGHYRNQCKNKKVPRN 5
                 KG +  G K K  K   +  N +  G   ++C+NK+VPRN
Sbjct: 220 ------KGNNYDGGKNKKGKGKGK--NNKWKG--SDECRNKRVPRN 255


Top