BLASTX nr result

ID: Astragalus24_contig00012207 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00012207
         (844 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX91063.1| hypothetical protein L195_g047192, partial [Trifo...   224   7e-68
ref|XP_020208977.1| uncharacterized protein LOC109793916 [Cajanu...   222   4e-67
gb|KYP65733.1| hypothetical protein KK1_011995 [Cajanus cajan] >...   222   4e-66
ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798...   214   9e-64
gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Gly...   214   7e-63
gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Gly...   214   7e-63
ref|XP_021671488.1| uncharacterized protein LOC110658261 [Hevea ...   211   7e-63
ref|XP_019455138.1| PREDICTED: uncharacterized protein LOC109356...   221   1e-62
dbj|GAU50616.1| hypothetical protein TSUD_290710 [Trifolium subt...   211   2e-62
ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662...   211   2e-62
ref|XP_017423564.1| PREDICTED: uncharacterized protein LOC108332...   204   1e-60
gb|KHN30273.1| hypothetical protein glysoja_042433, partial [Gly...   207   2e-60
gb|PNX93614.1| retrovirus-related Pol polyprotein from transposo...   216   6e-60
gb|PNX71626.1| flavonol sulfotransferase-like protein, partial [...   209   7e-60
ref|XP_004497583.1| PREDICTED: uncharacterized protein LOC101505...   203   8e-60
gb|PNX82129.1| hypothetical protein L195_g038157 [Trifolium prat...   203   1e-59
gb|KHN34741.1| Retrovirus-related Pol polyprotein from transposo...   206   2e-59
gb|PNX85368.1| hypothetical protein L195_g041436, partial [Trifo...   200   5e-59
gb|PNY08535.1| retrovirus-related Pol polyprotein from transposo...   211   2e-58
dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subt...   207   7e-57

>gb|PNX91063.1| hypothetical protein L195_g047192, partial [Trifolium pratense]
          Length = 359

 Score =  224 bits (570), Expect = 7e-68
 Identities = 112/274 (40%), Positives = 158/274 (57%), Gaps = 9/274 (3%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MVL+W+QR +S ++ KSI++ DK    W +L+ RF QGDIF+IAD+ DDL +  QGT DI
Sbjct: 82  MVLSWLQRAISESISKSILWIDKASSVWTNLELRFSQGDIFRIADIQDDLTRFQQGTLDI 141

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           S+YYT+L A+W+E+DN+ P  +CTC  PC CGA    +K ++QD  I FLKGLNE+YS+V
Sbjct: 142 SNYYTQLTAMWEEIDNFRPTKNCTCAIPCTCGAASDFQKYKEQDKVIKFLKGLNEQYSHV 201

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRIANN---------IS 513
           RSQIM+++  P +SK F LV+ QER L + TP    TE Q    ++ ++           
Sbjct: 202 RSQIMLIEPLPILSKTFSLVLVQERQLNLPTPYDPSTEKQSLAMQVQSSSFNGGGRGKSQ 261

Query: 514 XXXXXXXXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPT 693
                                     R+CTHCGK N  +  C+  +G+P G+Q KNNK  
Sbjct: 262 FPNKGRGRAGFNGGRGRGGLGDGDDTRVCTHCGKNNHIVQNCFVKYGYPPGFQHKNNK-- 319

Query: 694 ASANSAGTESESQTKAEAPNTSNSNISLTHDQYQ 795
           AS N A   +  QT  +    S  +++   +QYQ
Sbjct: 320 ASVNHAANFASEQTSTQETAPSTPSLNTIQEQYQ 353


>ref|XP_020208977.1| uncharacterized protein LOC109793916 [Cajanus cajan]
          Length = 357

 Score =  222 bits (565), Expect = 4e-67
 Identities = 114/276 (41%), Positives = 164/276 (59%), Gaps = 11/276 (3%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MV++W+Q  +S  ++KSI++FD   D W+DLK RF QGD+F++A L +DL K  QG+ D+
Sbjct: 37  MVISWLQHSISEKIVKSILWFDTASDIWQDLKARFSQGDVFRVAQLQEDLYKFHQGSLDV 96

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           ++Y+T+LK +WDE+DN  PLS C C+  C+CGA+ S  K R+QD  I FL+GLN++Y++V
Sbjct: 97  TEYFTQLKEMWDEIDNLRPLSRCKCSIACSCGAVDSSYKYREQDAVIRFLRGLNDQYTHV 156

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVA-----TPTVSETE--NQVKTPRIANNISXX 519
           RSQIM+MD  P++SK F LV QQERHL  +     T  ++ T   +  +TP    + S  
Sbjct: 157 RSQIMLMDPLPSLSKTFSLVGQQERHLNQSAIHDDTKVLAATSFGSLPQTPTTQQHQSPQ 216

Query: 520 XXXXXXXXXXXXXXXXXXXXXXTN---RLCTHCGKTNQTIDFCYFIHGFPQGYQVK-NNK 687
                                 T+   ++CTHCG+ N T+D CYF HGFP GYQ K    
Sbjct: 217 QQQFGFRRGGYSHGRGRGRGGRTHGSIKICTHCGRNNHTVDTCYFKHGFPPGYQSKGGTS 276

Query: 688 PTASANSAGTESESQTKAEAPNTSNSNISLTHDQYQ 795
              + N+  T S S   +  P ++N N   T +Q Q
Sbjct: 277 ANFTVNAVETTSPS---SMVPESNNPNFGFTQEQCQ 309


>gb|KYP65733.1| hypothetical protein KK1_011995 [Cajanus cajan]
 gb|KYP72745.1| hypothetical protein KK1_005345 [Cajanus cajan]
          Length = 445

 Score =  222 bits (565), Expect = 4e-66
 Identities = 114/276 (41%), Positives = 164/276 (59%), Gaps = 11/276 (3%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MV++W+Q  +S  ++KSI++FD   D W+DLK RF QGD+F++A L +DL K  QG+ D+
Sbjct: 82  MVISWLQHSISEKIVKSILWFDTASDIWQDLKARFSQGDVFRVAQLQEDLYKFHQGSLDV 141

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           ++Y+T+LK +WDE+DN  PLS C C+  C+CGA+ S  K R+QD  I FL+GLN++Y++V
Sbjct: 142 TEYFTQLKEMWDEIDNLRPLSRCKCSIACSCGAVDSSYKYREQDAVIRFLRGLNDQYTHV 201

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVA-----TPTVSETE--NQVKTPRIANNISXX 519
           RSQIM+MD  P++SK F LV QQERHL  +     T  ++ T   +  +TP    + S  
Sbjct: 202 RSQIMLMDPLPSLSKTFSLVGQQERHLNQSAIHDDTKVLAATSFGSLPQTPTTQQHQSPQ 261

Query: 520 XXXXXXXXXXXXXXXXXXXXXXTN---RLCTHCGKTNQTIDFCYFIHGFPQGYQVK-NNK 687
                                 T+   ++CTHCG+ N T+D CYF HGFP GYQ K    
Sbjct: 262 QQQFGFRRGGYSHGRGRGRGGRTHGSIKICTHCGRNNHTVDTCYFKHGFPPGYQSKGGTS 321

Query: 688 PTASANSAGTESESQTKAEAPNTSNSNISLTHDQYQ 795
              + N+  T S S   +  P ++N N   T +Q Q
Sbjct: 322 ANFTVNAVETTSPS---SMVPESNNPNFGFTQEQCQ 354


>ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798459 [Glycine max]
          Length = 389

 Score =  214 bits (545), Expect = 9e-64
 Identities = 110/270 (40%), Positives = 160/270 (59%), Gaps = 4/270 (1%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MVLAWI R +S ++ KS+++ D     WK+L+ RF Q DIF+I+DL +DL +  QGT D+
Sbjct: 82  MVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDLQEDLYRFRQGTLDV 141

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           SDY+T+LK  WDEL+NY P+  C C+ PC+CG I S+R  R+QDY + FLKGLN+ +S+ 
Sbjct: 142 SDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYVVRFLKGLNDRFSHS 201

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERH-LQVATPTVSE-TENQVKTPRIANNISXXXXXXX 534
           +SQIMMM+  P I   F LVIQQER  L   + +VSE T +     ++ +N S       
Sbjct: 202 KSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSNFNGKGG 261

Query: 535 XXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASANSAG 714
                             NR+CTHCGKTN  +D C+   G+P GY+   +K ++S++ A 
Sbjct: 262 YYNKGKGSSKGG------NRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQAN 315

Query: 715 TESESQT--KAEAPNTSNSNISLTHDQYQG 798
             S +      +  +++ S+   T + YQG
Sbjct: 316 NTSNASALESTQQGSSAQSSFQFTQEMYQG 345


>gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Glycine soja]
          Length = 484

 Score =  214 bits (546), Expect = 7e-63
 Identities = 111/270 (41%), Positives = 160/270 (59%), Gaps = 4/270 (1%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MVLAWI R +S ++ KS+++ D     WK+L+ RF Q DIF+I+DL +DL +  QGT D+
Sbjct: 74  MVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDLQEDLYRFRQGTLDV 133

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           SDY+T+LK  WDEL+NY P+  C C+ PC+CG I S+R  R+QDY I FLKGLN+ +S+ 
Sbjct: 134 SDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYVIRFLKGLNDRFSHS 193

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERH-LQVATPTVSE-TENQVKTPRIANNISXXXXXXX 534
           +SQIMMM+  P I   F LVIQQER  L   + +VSE T +     ++ +N S       
Sbjct: 194 KSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSNFNGKGG 253

Query: 535 XXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASANSAG 714
                             NR+CTHCGKTN  +D C+   G+P GY+   +K ++S++ A 
Sbjct: 254 YYNKGKGSSKGG------NRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQAN 307

Query: 715 TESESQT--KAEAPNTSNSNISLTHDQYQG 798
             S +      +  +++ S+   T + YQG
Sbjct: 308 NTSNASALESTQQGSSAQSSFQFTQEMYQG 337


>gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Glycine soja]
          Length = 484

 Score =  214 bits (546), Expect = 7e-63
 Identities = 111/270 (41%), Positives = 160/270 (59%), Gaps = 4/270 (1%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MVLAWI R +S ++ KS+++ D     WK+L+ RF Q DIF+I+DL +DL +  QGT D+
Sbjct: 74  MVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFRISDLQEDLYRFRQGTLDV 133

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           SDY+T+LK  WDEL+NY P+  C C+ PC+CG I S+R  R+QDY I FLKGLN+ +S+ 
Sbjct: 134 SDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYREQDYVIRFLKGLNDRFSHS 193

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERH-LQVATPTVSE-TENQVKTPRIANNISXXXXXXX 534
           +SQIMMM+  P I   F LVIQQER  L   + +VSE T +     ++ +N S       
Sbjct: 194 KSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSNFNGKGG 253

Query: 535 XXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASANSAG 714
                             NR+CTHCGKTN  +D C+   G+P GY+   +K ++S++ A 
Sbjct: 254 YYNKGKGSSKGG------NRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQAN 307

Query: 715 TESESQT--KAEAPNTSNSNISLTHDQYQG 798
             S +      +  +++ S+   T + YQG
Sbjct: 308 NTSNASALESTQQGSSAQSSFQFTQEMYQG 337


>ref|XP_021671488.1| uncharacterized protein LOC110658261 [Hevea brasiliensis]
          Length = 363

 Score =  211 bits (537), Expect = 7e-63
 Identities = 110/272 (40%), Positives = 160/272 (58%), Gaps = 7/272 (2%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MVL+W+   +S ++  SI++ DK  D WKDLK+ F QGDI +I DL +D+  + QG R +
Sbjct: 1   MVLSWLIHSLSPSITHSILWIDKTVDVWKDLKEAFSQGDILRILDLQEDIFSIKQGDRSV 60

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           +DY+TELK LWDEL N+ P+  C+C   CA GA   ++K  D DY I FLKGLN++Y+  
Sbjct: 61  TDYFTELKILWDELLNFRPIPVCSCENSCAYGAFLKIKKYHDHDYVIRFLKGLNDQYAIA 120

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPR-IANNISXXXXXXXX 537
           +SQIM++DLFP+I+KAF L++QQER L      +  TE +V   R + +++S        
Sbjct: 121 KSQIMLLDLFPSINKAFSLLVQQERQL----APILATEPKVFVNRSVRSDVSNSGAKPFF 176

Query: 538 XXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASANSAGT 717
                            +R CT+CGK   TI+ CY  HG+P GY+ +    +A AN+   
Sbjct: 177 APKTFTGIPKQFSSSRDDRFCTYCGKPRHTIETCYKKHGYPLGYKPRGY--SAFANNIFG 234

Query: 718 ESESQTKAEAP------NTSNSNISLTHDQYQ 795
            +E ++  + P      +  NSNI LT +QYQ
Sbjct: 235 SAEIESPTDTPISLAQGSNGNSNIGLTQEQYQ 266


>ref|XP_019455138.1| PREDICTED: uncharacterized protein LOC109356267 [Lupinus
           angustifolius]
          Length = 834

 Score =  221 bits (562), Expect = 1e-62
 Identities = 112/274 (40%), Positives = 159/274 (58%), Gaps = 8/274 (2%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MVL+WIQ  V  +++KSI++ D   +AWKDL DRF  GDIF+IA L  +   + QG  DI
Sbjct: 80  MVLSWIQHCVDESIVKSILWIDTTAEAWKDLHDRFSHGDIFRIAALQKEFYHLDQGNLDI 139

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           SDY+T+LK LWDE++++ P  SC C TPC CGA+ S++  ++QDY I FL+GLNE++++V
Sbjct: 140 SDYFTKLKTLWDEIEDFRPFPSCKCNTPCICGAMDSLKTYKEQDYVIRFLEGLNEQFAHV 199

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQV------KTPRIANNI--SX 516
           +SQIM+MD  P I+KAF L+IQQER  Q+  P   E +N+V      +  +  NN   + 
Sbjct: 200 KSQIMLMDPLPNITKAFALLIQQERQTQLPVPPSLEPDNRVMNVSSRQDSQYRNNSTNNS 259

Query: 517 XXXXXXXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTA 696
                                   NR CT+C +TN TI+ CY  HG+P GYQ   +    
Sbjct: 260 FRGRGIIPFRGRGNRAAGFGRGQNNRFCTYCERTNHTIETCYLKHGYPPGYQSTRSSKMV 319

Query: 697 SANSAGTESESQTKAEAPNTSNSNISLTHDQYQG 798
           +  +  +   S     A  T N++ S T +Q QG
Sbjct: 320 NHTTGYSFDTSTNNEAAHQTQNNSTSFTKEQVQG 353


>dbj|GAU50616.1| hypothetical protein TSUD_290710 [Trifolium subterraneum]
          Length = 404

 Score =  211 bits (537), Expect = 2e-62
 Identities = 106/270 (39%), Positives = 153/270 (56%), Gaps = 6/270 (2%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MVL+WIQR +S  ++KSI++ D     WK L+ RF  GDIF+IAD+ +++ +  QGT DI
Sbjct: 82  MVLSWIQRSISETIVKSIMWCDCAAVVWKCLERRFAHGDIFRIADILEEIARYQQGTLDI 141

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           S Y+T L  LW+EL+N+ PL  C+C  PC CGA   ++K ++QD  I FLKGLNE+Y++V
Sbjct: 142 SSYFTHLTTLWEELENFRPLKDCSCAIPCTCGAASDLKKYKEQDKVIKFLKGLNEQYASV 201

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRIANNI------SXXX 522
           RSQIM++D  P I + F LV+QQER + +   T +  + Q    ++              
Sbjct: 202 RSQIMLLDPLPDIDRCFSLVLQQERQMLIPIITDNSVDQQASIMQVRQTSYNHGKHYTSF 261

Query: 523 XXXXXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASA 702
                                 NR CTHCG+ N  +D C+ +HG+P GYQ KN+K   S 
Sbjct: 262 SSTHHGGRGRGRGNHHGGRGPNNRTCTHCGRHNHIVDTCFELHGYPPGYQHKNSK---SV 318

Query: 703 NSAGTESESQTKAEAPNTSNSNISLTHDQY 792
           N A T S +  K    N +++ I+   +QY
Sbjct: 319 NVAATASNATLKEGHINLTSATINTIQEQY 348


>ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662412 [Glycine max]
          Length = 424

 Score =  211 bits (538), Expect = 2e-62
 Identities = 102/272 (37%), Positives = 160/272 (58%), Gaps = 8/272 (2%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           +VL+W+QR  S  + KS+++ D+    WK L++RF QGDIF++AD+ +++  + QGT DI
Sbjct: 81  LVLSWLQRSTSEEIAKSLLWCDRASFVWKSLENRFSQGDIFRVADIQEEVACLQQGTLDI 140

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           S Y+T+L  LW+E++N+ P+  CTC  PC+CGA   +RK ++QD  I FLKGL ++YS+V
Sbjct: 141 SSYFTKLMTLWEEIENFRPIRDCTCAIPCSCGAATDLRKFKEQDKVIKFLKGLGDQYSHV 200

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRIANNISXXXXXXXXX 540
           RSQIM+M   P +  AF L++QQER   + + T S  ENQ      +   S         
Sbjct: 201 RSQIMLMSPLPTLDNAFNLILQQERQFNLPSTTDSSIENQSSVNHFSQTPSRPSNNSGCG 260

Query: 541 XXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASAN----- 705
                           NRLCTHC +TN T++ C+  HG+P G+Q + +  + +A+     
Sbjct: 261 RGRGYSSGGRG-----NRLCTHCNRTNHTVETCFIKHGYPPGFQHRKSNSSGNASVVNSV 315

Query: 706 -SAGTE--SESQTKAEAPNTSNSNISLTHDQY 792
             AG+   S S + + + N S++++S   +QY
Sbjct: 316 QDAGSAHISSSSSASTSTNGSSASLSTIQEQY 347


>ref|XP_017423564.1| PREDICTED: uncharacterized protein LOC108332768 [Vigna angularis]
          Length = 317

 Score =  204 bits (518), Expect = 1e-60
 Identities = 99/236 (41%), Positives = 142/236 (60%), Gaps = 13/236 (5%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MVLAWI R +  ++L+SI++ D+  + W+DL+D F Q D+F+++DL +++ ++ QGT  I
Sbjct: 84  MVLAWIHRSIDDSILQSILWIDQASEVWQDLQDHFSQADMFRVSDLQEEIFRLQQGTLTI 143

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           S Y+T+LK LWDE +NY P+  C C+ PC C AIQ+ ++ RDQDY I FLKGLNE++S+V
Sbjct: 144 SQYFTQLKGLWDEFENYRPILHCKCSIPCTCEAIQAYKRYRDQDYVIRFLKGLNEQFSHV 203

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRI-------------A 501
           RSQIM++D  P I+K F L++QQER  Q+ T   +E  +  KT  +              
Sbjct: 204 RSQIMLLDPLPPINKVFSLIVQQER--QMTTIERTELSSDAKTFAVNTYQYTSLGRGLAV 261

Query: 502 NNISXXXXXXXXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGY 669
           N                            N+LCT+CGKTN T++ CYF HGF  GY
Sbjct: 262 NPNQYFGYGRGRGNRGRGRATIGRGQNSFNKLCTYCGKTNHTVETCYFKHGFHLGY 317


>gb|KHN30273.1| hypothetical protein glysoja_042433, partial [Glycine soja]
          Length = 456

 Score =  207 bits (527), Expect = 2e-60
 Identities = 100/272 (36%), Positives = 160/272 (58%), Gaps = 8/272 (2%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           +VL+W+QR  S  + KS+++ D+    WK L++RF QGDIF++AD+ +++ ++ QGT +I
Sbjct: 66  LVLSWLQRSTSEEIAKSLLWCDRASFVWKSLENRFSQGDIFRVADIQEEVARLQQGTLEI 125

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           S Y+T+L  LW+E++N+ P+  CTC  PC+CGA   +RK ++QD  I FLKGL ++YS+V
Sbjct: 126 SSYFTKLMTLWEEIENFRPIRDCTCAIPCSCGAATDLRKFKEQDKVIKFLKGLGDQYSHV 185

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRIANNISXXXXXXXXX 540
           RSQIM+M   P +  AF L++QQER   + + T S  ENQ      +   S         
Sbjct: 186 RSQIMLMSPLPTLDNAFNLILQQERQFNLPSTTDSSIENQSSVNHFSQTPSRPSNNFGCG 245

Query: 541 XXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASAN----- 705
                           NRL THC +TN T++ C+  HG+P G+Q + +  + +A+     
Sbjct: 246 RGRGYSSGGRG-----NRLRTHCNRTNHTVETCFIKHGYPPGFQHRKSNSSGNASMVNSV 300

Query: 706 -SAGTE--SESQTKAEAPNTSNSNISLTHDQY 792
             AG+   S S + + + N S++++S   +QY
Sbjct: 301 QDAGSAHISSSSSASTSTNGSSASLSTIQEQY 332


>gb|PNX93614.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Trifolium pratense]
          Length = 1430

 Score =  216 bits (549), Expect = 6e-60
 Identities = 116/287 (40%), Positives = 167/287 (58%), Gaps = 9/287 (3%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MVL+WIQR +S ++ KSII+FD     WKDL+ RF  GD+FKI+DL +++ ++ QG+ DI
Sbjct: 82  MVLSWIQRSISPDIAKSIIWFDHASAVWKDLEFRFSHGDMFKISDLQEEILRLHQGSLDI 141

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           S YYT+LK+L +E++ Y P+  CTC  PC+CGA+  M+K R+QD  + FLKGLNE+YS+V
Sbjct: 142 SSYYTQLKSLSEEIEIYRPVRDCTCAIPCSCGAVADMKKYREQDCVLKFLKGLNEQYSHV 201

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETE-----NQVKTPRIANNISXXXX 525
           RSQIMMM+  P + K F LV+QQER+L V     S+ E      QV++    +  S    
Sbjct: 202 RSQIMMMEPLPPLHKVFSLVLQQERNLPVFNTVDSQNELSAMAMQVQSTGSNSQPSKNFN 261

Query: 526 XXXXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASAN 705
                               + R CTHCG  N  ID C+  +GFP GYQ   +K   S+N
Sbjct: 262 FGSGNRGRGKGRRNFGRGQHSTRYCTHCGGDNHIIDNCFVKYGFPPGYQ---SKGVQSSN 318

Query: 706 SAGTESESQTKAEAPNTSN----SNISLTHDQYQGFWTYFNKQDNNN 834
           +      S T +++   S+    S+++    Q+Q F   F +Q  +N
Sbjct: 319 AKSVNLASTTNSDSSLVSSSAMASSLNELQGQFQQFLKLFQQQTESN 365


>gb|PNX71626.1| flavonol sulfotransferase-like protein, partial [Trifolium
           pratense]
          Length = 591

 Score =  209 bits (532), Expect = 7e-60
 Identities = 106/270 (39%), Positives = 154/270 (57%), Gaps = 4/270 (1%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MVLAWI R +S ++ +S+++ D     WK+L+ RF QGDIF+I+DL ++L ++ QG  D+
Sbjct: 80  MVLAWIHRSLSESIARSVLWIDSAAGLWKNLRTRFSQGDIFRISDLQEELYRLRQGNLDV 139

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           SDY+T+LK LWDEL+NY P+  C C+  C CGAI+S +  R+QDY I FLKGLN+ +SN 
Sbjct: 140 SDYFTKLKVLWDELENYRPIPFCKCSIACTCGAIESFKVYREQDYVIRFLKGLNDRFSNT 199

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHL--QVATP-TVSETENQVKTPRIANNISXXXXXX 531
           +SQIM+M+  P +   F ++IQQER +   +  P T    E    T  +AN+        
Sbjct: 200 KSQIMLMNPLPDVDTVFSMLIQQEREIAYSILDPITHDAPEVDSSTALLANSHYRNQNGK 259

Query: 532 XXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVK-NNKPTASANS 708
                              NRLCT+C  TN  +  C+  +G+P GY+ K  N    S   
Sbjct: 260 TNYYGKGKGQAPNSAPKGYNRLCTYCKGTNHIVQNCWIKYGYPPGYKNKGKNSSQPSHTV 319

Query: 709 AGTESESQTKAEAPNTSNSNISLTHDQYQG 798
           A  +S +Q  +++  T+     LT DQY G
Sbjct: 320 AAVDSSTQPDSQSSTTATPPFGLTQDQYDG 349


>ref|XP_004497583.1| PREDICTED: uncharacterized protein LOC101505117 [Cicer arietinum]
          Length = 355

 Score =  203 bits (516), Expect = 8e-60
 Identities = 98/270 (36%), Positives = 158/270 (58%), Gaps = 6/270 (2%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           M L+W+Q  +  ++  SI++ D  +  WK L+++F QGDIFKI+D+ DDL ++ QG  DI
Sbjct: 41  MTLSWLQCSILESIAHSILWIDNAHTVWKILENQFSQGDIFKISDIQDDLTRLQQGNLDI 100

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
            +Y+T+L +LW+++D++ P   C C   C CG    +RK ++QD  I FL GLNE++SNV
Sbjct: 101 INYFTKLTSLWEQIDSFRPTRDCVCAIQCTCGDTTDLRKYKNQDRVIKFLNGLNEQFSNV 160

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRIANNI-----SXXXX 525
           RSQIM+++  P++ K F LV+ QER L V   + S  ENQ    ++ NN           
Sbjct: 161 RSQIMLLEPLPSLDKTFSLVLGQERQLNVQASSNSAPENQAMAMQVQNNHYNGGGRGTNN 220

Query: 526 XXXXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASAN 705
                               +NR+CTHCG+TN T++ C+  HG+P G+Q ++++   +A 
Sbjct: 221 SNNRGKGHNNSAFGRGPYQNSNRICTHCGRTNHTVETCFLKHGYPPGFQQRHSR---AAF 277

Query: 706 SAGTESESQTKAEAPNTS-NSNISLTHDQY 792
           +  T S+SQ  +     S ++++++  DQY
Sbjct: 278 NTATASDSQDSSPVDQESTDASLTIIQDQY 307


>gb|PNX82129.1| hypothetical protein L195_g038157 [Trifolium pratense]
          Length = 392

 Score =  203 bits (517), Expect = 1e-59
 Identities = 103/276 (37%), Positives = 152/276 (55%), Gaps = 3/276 (1%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MVLAWI R +S ++ +S+++ D     WK+L+ RF QGDIF+I+D+ ++L K  QGT DI
Sbjct: 82  MVLAWIHRSISDSIARSVLWIDTAAGVWKNLRIRFSQGDIFRISDIQEELYKFRQGTLDI 141

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           SDY+T+LK LWDEL+NY P+  C C+  C CGAI S+   R QDY I FLKGLN+++S+ 
Sbjct: 142 SDYFTQLKVLWDELENYRPIPHCKCSIACTCGAIDSINIYRQQDYVIRFLKGLNDKFSHT 201

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRIANN-ISXXXXXXXX 537
           +SQIM+M+  P I   F ++IQQER  ++    +    N       +N  ++        
Sbjct: 202 KSQIMLMNPLPDIDTVFSMLIQQER--EIGNSVIDSIVNDAPDKNSSNVFLANSSYGNFH 259

Query: 538 XXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGY--QVKNNKPTASANSA 711
                           +NR CTHC  TN  ++ C+  HG+P GY  + KN+  +  ANSA
Sbjct: 260 GKYNSKGKGQHSGSKGSNRFCTHCQGTNHIVENCWIKHGYPIGYKGKGKNSFQSTQANSA 319

Query: 712 GTESESQTKAEAPNTSNSNISLTHDQYQGFWTYFNK 819
              +         +++      T +QY G    F +
Sbjct: 320 AVPNSPMQLDSTTSSTKPPFGFTQEQYHGILGLFQQ 355


>gb|KHN34741.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 495

 Score =  206 bits (523), Expect = 2e-59
 Identities = 109/262 (41%), Positives = 156/262 (59%), Gaps = 2/262 (0%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MVLAWI R +S ++ KS+++ D     WK+L+ RF   DIF+I+DL +DL +  QGT D+
Sbjct: 71  MVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSHSDIFRISDLQEDLYRFRQGTLDV 130

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           SDY+T+LK  WDEL+NY P+  C C+ PC+CG I S+R  R+QDY I FLKGLN+ +S+ 
Sbjct: 131 SDYFTQLKIYWDELENYRPIPYCKCSIPCSCGGIDSVRVYREQDYVIRFLKGLNDRFSHS 190

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERH-LQVATPTVSE-TENQVKTPRIANNISXXXXXXX 534
           +SQIMMM+  P I   F LVIQQER  L   + +VSE T +     ++ +N S       
Sbjct: 191 KSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAMAMQVNSNQSNFNGKGG 250

Query: 535 XXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASANSAG 714
                             NR+CTHCGKTN  +D C+   G+P GY+   +K ++S++ A 
Sbjct: 251 YYNKGKGSSKGG------NRVCTHCGKTNHIVDNCFEKIGYPPGYKTNKSKNSSSSSQAN 304

Query: 715 TESESQTKAEAPNTSNSNISLT 780
             S + +  E+    +S  S+T
Sbjct: 305 NTSNA-SALESTQQGSSAQSIT 325


>gb|PNX85368.1| hypothetical protein L195_g041436, partial [Trifolium pratense]
          Length = 337

 Score =  200 bits (509), Expect = 5e-59
 Identities = 98/236 (41%), Positives = 138/236 (58%), Gaps = 7/236 (2%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MVL+W+QR +S ++ KSI++ DK    W +L+ RF QGDIF+IAD+ DDL +  QGT DI
Sbjct: 82  MVLSWLQRSISESISKSILWIDKASSVWTNLELRFSQGDIFRIADIQDDLTRFQQGTLDI 141

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           S+YYT+L A+W+E+DN+ P  +CTC  PC CGA    +K ++QD  I FLKGLNE+YS+V
Sbjct: 142 SNYYTQLTAMWEEIDNFRPTKNCTCAIPCTCGAASDFQKYKEQDKVIKFLKGLNEQYSHV 201

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHLQVATPTVSETENQVKTPRIANNIS-------XX 519
           RS IM+++  P +SK F +V+ QER L +       TE Q    ++ ++ S         
Sbjct: 202 RSHIMLIEPLPNLSKTFSMVLGQERQLNLPILPDPSTEKQPLAMQVQSSSSNGGGRGKSQ 261

Query: 520 XXXXXXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNK 687
                                     CTHCGK N  +  C+  +G+P G+Q KNNK
Sbjct: 262 YPNKGRGRANFSGGRGGLGGGRDTGGCTHCGKNNHIVQNCFVKYGYPPGFQQKNNK 317


>gb|PNY08535.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Trifolium pratense]
          Length = 1205

 Score =  211 bits (538), Expect = 2e-58
 Identities = 108/270 (40%), Positives = 154/270 (57%), Gaps = 4/270 (1%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MVLAW+ R VS ++ +SI++ D     WK+L+ RF QGDIF+I+D+ ++L +  QG  DI
Sbjct: 80  MVLAWLHRSVSESIARSILWIDSAAGVWKNLRIRFSQGDIFRISDIQEELYRFRQGNLDI 139

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           SDY+T+LK LWDEL+NY P+  C C+ PC CGAI S +  R+QDY I FLKGLN+ +SN 
Sbjct: 140 SDYFTKLKVLWDELENYRPIPLCKCSIPCTCGAIDSFKVYREQDYVIRFLKGLNDRFSNT 199

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHL--QVATP-TVSETENQVKTPRIANNISXXXXXX 531
           +SQIM+M+  P +   F ++IQQER +   +  P T    E    T  +AN+ S      
Sbjct: 200 KSQIMLMNPLPDVDTVFSMLIQQEREIAYSILDPITHDAPEVDSSTALLANSHSRNQNGK 259

Query: 532 XXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVK-NNKPTASANS 708
                              +RLCT+C  TN  +  C+  +G+P GY+ K  N    S   
Sbjct: 260 SNYYGKGKGQAPNSAPKGHDRLCTYCKGTNHVVQNCWIKYGYPPGYKNKGKNSSQPSHTV 319

Query: 709 AGTESESQTKAEAPNTSNSNISLTHDQYQG 798
           A  +S +Q  +++  T+     LT DQY G
Sbjct: 320 AAVDSSTQLDSQSSTTATPPFGLTQDQYDG 349


>dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subterraneum]
          Length = 1178

 Score =  207 bits (526), Expect = 7e-57
 Identities = 106/272 (38%), Positives = 156/272 (57%), Gaps = 6/272 (2%)
 Frame = +1

Query: 1   MVLAWIQRPVSANVLKSIIFFDKGYDAWKDLKDRFDQGDIFKIADL*DDLCKVTQGTRDI 180
           MVLAWI R +S ++ +S+++ D     WK+L+ RF QGDIF+I+DL ++L ++ QG  D+
Sbjct: 80  MVLAWIHRSLSDSIARSVLWIDSAASLWKNLRTRFSQGDIFRISDLQEELYRLRQGNLDV 139

Query: 181 SDYYTELKALWDELDNYMPLSSCTCTTPCACGAIQSMRKIRDQDYTINFLKGLNEEYSNV 360
           SDY+T+L+ LWDEL+NY P+  C C+  C CGA++S +  R+QDY I FLKGLN+ +SN 
Sbjct: 140 SDYFTKLQVLWDELENYRPIPLCKCSIACTCGAVESFKLYREQDYVIRFLKGLNDRFSNT 199

Query: 361 RSQIMMMDLFPAISKAFLLVIQQERHL--QVATP-TVSETENQVKTPRIANNISXXXXXX 531
           +SQIM+++  P +   F ++IQQER +   +  P T    E    T  +AN+        
Sbjct: 200 KSQIMLINPLPDVDTVFSMLIQQEREIAYSILDPITHDAPEVDFSTALLANSHYKNQNGK 259

Query: 532 XXXXXXXXXXXXXXXXXXTNRLCTHCGKTNQTIDFCYFIHGFPQGYQVKNNKPTASANS- 708
                              NRLCTHC  TN  +  C+  +G+P GY  KNN+  +S  S 
Sbjct: 260 SNYYGKGRGQAPNSAPKGHNRLCTHCRGTNHIVQDCWIKYGYPPGY--KNNRKNSSQPSH 317

Query: 709 --AGTESESQTKAEAPNTSNSNISLTHDQYQG 798
             A  +S +Q  ++  NT+     LT  QY G
Sbjct: 318 IVAAVDSSTQHDSQFSNTATPPFGLTQVQYDG 349


Top