BLASTX nr result

ID: Astragalus23_contig00028154 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00028154
         (1538 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX71626.1| flavonol sulfotransferase-like protein, partial [...   428   e-141
gb|PNX82129.1| hypothetical protein L195_g038157 [Trifolium prat...   420   e-141
dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subt...   413   e-135
gb|PNY08535.1| retrovirus-related Pol polyprotein from transposo...   408   e-134
gb|PNX55412.1| hypothetical protein L195_g049041, partial [Trifo...   391   e-130
gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Gly...   335   e-118
gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Gly...   334   e-118
ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798...   347   e-113
gb|PNX91084.1| hypothetical protein L195_g047213, partial [Trifo...   313   e-105
gb|KHN34741.1| Retrovirus-related Pol polyprotein from transposo...   316   e-101
gb|PNX71325.1| hypothetical protein L195_g027200, partial [Trifo...   291   2e-94
ref|XP_014621696.1| PREDICTED: uncharacterized protein LOC106795...   262   2e-88
ref|XP_014626210.1| PREDICTED: uncharacterized protein LOC106797...   261   5e-88
ref|XP_014627175.1| PREDICTED: uncharacterized protein LOC106797...   259   2e-87
dbj|GAU50616.1| hypothetical protein TSUD_290710 [Trifolium subt...   248   5e-82
ref|XP_014632403.1| PREDICTED: uncharacterized protein LOC106798...   241   5e-82
gb|KYP65733.1| hypothetical protein KK1_011995 [Cajanus cajan] >...   255   7e-80
ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662...   243   8e-79
gb|PNX93614.1| retrovirus-related Pol polyprotein from transposo...   234   1e-77
ref|XP_019455138.1| PREDICTED: uncharacterized protein LOC109356...   261   1e-74

>gb|PNX71626.1| flavonol sulfotransferase-like protein, partial [Trifolium pratense]
          Length = 591

 Score =  428 bits (1101), Expect = e-141
 Identities = 213/332 (64%), Positives = 250/332 (75%), Gaps = 5/332 (1%)
 Frame = +1

Query: 358  DPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFRIFDIQE 522
            DP+Y+PWIRCNTMVLAWIHRSLSESIA+S+      AG+WKNLRTRFSQGDIFRI D+QE
Sbjct: 68   DPLYSPWIRCNTMVLAWIHRSLSESIARSVLWIDSAAGLWKNLRTRFSQGDIFRISDLQE 127

Query: 523  ELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYREQDYVIR 702
            ELY+ RQGNL++SDYFT+LKVLWDELE+YRP+P CKC+IACTCGA++S K+YREQDYVIR
Sbjct: 128  ELYRLRQGNLDVSDYFTKLKVLWDELENYRPIPFCKCSIACTCGAIESFKVYREQDYVIR 187

Query: 703  FLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPETEVSTTL 882
            FLKGLNDRFS TKSQIMLM PLP++DTVFSMLIQQEREI +S+LDPI HDAPE + ST L
Sbjct: 188  FLKGLNDRFSNTKSQIMLMNPLPDVDTVFSMLIQQEREIAYSILDPITHDAPEVDSSTAL 247

Query: 883  LANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGYPPGFKG 1062
            LANS Y                      KG NR+CT+C  TNH + + W+K+GYPPG+K 
Sbjct: 248  LANSHYRNQNGKTNYYGKGKGQAPNSAPKGYNRLCTYCKGTNHIVQNCWIKYGYPPGYKN 307

Query: 1063 KGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSMVXXXXXXXXXX 1242
            KGKN  Q   S+ V A +SSTQ DS Q+S +   PFGLTQ+QY GILSM+          
Sbjct: 308  KGKNSSQ--PSHTVAAVDSSTQPDS-QSSTTATPPFGLTQDQYDGILSMI-----QQSKS 359

Query: 1243 XXXXXXNFVSTTPLALNSQSSSDHDWLQGSDW 1338
                  N VSTTPLAL+SQSS+ +DW QGS W
Sbjct: 360  QPTPTVNSVSTTPLALHSQSSTSNDWYQGSXW 391



 Score = 67.4 bits (163), Expect(2) = 1e-17
 Identities = 29/34 (85%), Positives = 32/34 (94%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTL 340
           VSPPLDHKNYHTW+ SM IALISKNK+KF+DGTL
Sbjct: 28  VSPPLDHKNYHTWSRSMQIALISKNKDKFIDGTL 61



 Score = 53.1 bits (126), Expect(2) = 1e-17
 Identities = 22/25 (88%), Positives = 25/25 (100%)
 Frame = +3

Query: 165 SYSDFSTNSANPYYLHPNENPALIL 239
           +YSDFS+NSANPYYLHPNENPA+IL
Sbjct: 3   TYSDFSSNSANPYYLHPNENPAVIL 27


>gb|PNX82129.1| hypothetical protein L195_g038157 [Trifolium pratense]
          Length = 392

 Score =  420 bits (1080), Expect = e-141
 Identities = 212/340 (62%), Positives = 247/340 (72%), Gaps = 5/340 (1%)
 Frame = +1

Query: 340  PKLQSTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFR 504
            PK   TDP+Y PWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR RFSQGDIFR
Sbjct: 64   PKPSITDPLYGPWIRCNTMVLAWIHRSISDSIARSVLWIDTAAGVWKNLRIRFSQGDIFR 123

Query: 505  IFDIQEELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYRE 684
            I DIQEELYKFRQG L+ISDYFTQLKVLWDELE+YRP+P CKC+IACTCGA+DS+ IYR+
Sbjct: 124  ISDIQEELYKFRQGTLDISDYFTQLKVLWDELENYRPIPHCKCSIACTCGAIDSINIYRQ 183

Query: 685  QDYVIRFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPET 864
            QDYVIRFLKGLND+FS TKSQIMLM PLP+IDTVFSMLIQQEREI +SV+D IV+DAP+ 
Sbjct: 184  QDYVIRFLKGLNDKFSHTKSQIMLMNPLPDIDTVFSMLIQQEREIGNSVIDSIVNDAPDK 243

Query: 865  EVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGY 1044
              S   LANS Y                     +KG+NR CTHC  TNH +++ W+KHGY
Sbjct: 244  NSSNVFLANSSYGNFHGKYNSKGKGQHSG----SKGSNRFCTHCQGTNHIVENCWIKHGY 299

Query: 1045 PPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSMVXXXX 1224
            P G+KGKGKN FQ +Q+N+     S  Q DS   ++S K PFG TQEQY GIL +     
Sbjct: 300  PIGYKGKGKNSFQSTQANSAAVPNSPMQLDS--TTSSTKPPFGFTQEQYHGILGL----- 352

Query: 1225 XXXXXXXXXXXXNFVSTTPLALNSQSSSDHDWLQGSDWYS 1344
                        N VST+PLA NSQSS+ ++  QGSDWYS
Sbjct: 353  FQQLKHQPTPASNSVSTSPLAFNSQSSNGNELYQGSDWYS 392



 Score = 68.6 bits (166), Expect(2) = 2e-20
 Identities = 29/35 (82%), Positives = 33/35 (94%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTLP 343
           VSPPLDHKNYHTWA SM+IALISKNK+KF+DG+ P
Sbjct: 30  VSPPLDHKNYHTWARSMNIALISKNKDKFIDGSFP 64



 Score = 60.8 bits (146), Expect(2) = 2e-20
 Identities = 27/29 (93%), Positives = 28/29 (96%)
 Frame = +3

Query: 153 MATTSYSDFSTNSANPYYLHPNENPALIL 239
           MAT +YSDFSTNSANPYYLHPNENPALIL
Sbjct: 1   MATINYSDFSTNSANPYYLHPNENPALIL 29


>dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subterraneum]
          Length = 1178

 Score =  413 bits (1062), Expect(3) = e-135
 Identities = 208/331 (62%), Positives = 245/331 (74%), Gaps = 5/331 (1%)
 Frame = +1

Query: 358  DPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFRIFDIQE 522
            DP+Y+PWIRCNTMVLAWIHRSLS+SIA+S+      A +WKNLRTRFSQGDIFRI D+QE
Sbjct: 68   DPLYSPWIRCNTMVLAWIHRSLSDSIARSVLWIDSAASLWKNLRTRFSQGDIFRISDLQE 127

Query: 523  ELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYREQDYVIR 702
            ELY+ RQGNL++SDYFT+L+VLWDELE+YRP+PLCKC+IACTCGAV+S K+YREQDYVIR
Sbjct: 128  ELYRLRQGNLDVSDYFTKLQVLWDELENYRPIPLCKCSIACTCGAVESFKLYREQDYVIR 187

Query: 703  FLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPETEVSTTL 882
            FLKGLNDRFS TKSQIML+ PLP++DTVFSMLIQQEREI +S+LDPI HDAPE + ST L
Sbjct: 188  FLKGLNDRFSNTKSQIMLINPLPDVDTVFSMLIQQEREIAYSILDPITHDAPEVDFSTAL 247

Query: 883  LANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGYPPGFKG 1062
            LANS Y                      KG NR+CTHC  TNH +   W+K+GYPPG+K 
Sbjct: 248  LANSHYKNQNGKSNYYGKGRGQAPNSAPKGHNRLCTHCRGTNHIVQDCWIKYGYPPGYKN 307

Query: 1063 KGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSMVXXXXXXXXXX 1242
              KN  Q   S+ V A +SSTQ+DS Q S +   PFGLTQ QY GI+SM+          
Sbjct: 308  NRKNSSQ--PSHIVAAVDSSTQHDS-QFSNTATPPFGLTQVQYDGIISMI-----QQSKS 359

Query: 1243 XXXXXXNFVSTTPLALNSQSSSDHDWLQGSD 1335
                  N VSTTPLA +SQSS+ +DW QGSD
Sbjct: 360  QPTPTVNSVSTTPLAFHSQSSNSNDWYQGSD 390



 Score = 67.4 bits (163), Expect(3) = e-135
 Identities = 29/34 (85%), Positives = 32/34 (94%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTL 340
           VSPPLDHKNYHTW+ SM IALISKNK+KF+DGTL
Sbjct: 28  VSPPLDHKNYHTWSRSMQIALISKNKDKFIDGTL 61



 Score = 54.7 bits (130), Expect(3) = e-135
 Identities = 23/25 (92%), Positives = 25/25 (100%)
 Frame = +3

Query: 165 SYSDFSTNSANPYYLHPNENPALIL 239
           +YSDFSTNSANPYYLHPNENPA+IL
Sbjct: 3   TYSDFSTNSANPYYLHPNENPAVIL 27


>gb|PNY08535.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1205

 Score =  408 bits (1048), Expect(3) = e-134
 Identities = 207/324 (63%), Positives = 244/324 (75%), Gaps = 5/324 (1%)
 Frame = +1

Query: 358  DPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFRIFDIQE 522
            DP+++PWIRCNTMVLAW+HRS+SESIA+SI      AGVWKNLR RFSQGDIFRI DIQE
Sbjct: 68   DPLFSPWIRCNTMVLAWLHRSVSESIARSILWIDSAAGVWKNLRIRFSQGDIFRISDIQE 127

Query: 523  ELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYREQDYVIR 702
            ELY+FRQGNL+ISDYFT+LKVLWDELE+YRP+PLCKC+I CTCGA+DS K+YREQDYVIR
Sbjct: 128  ELYRFRQGNLDISDYFTKLKVLWDELENYRPIPLCKCSIPCTCGAIDSFKVYREQDYVIR 187

Query: 703  FLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPETEVSTTL 882
            FLKGLNDRFS TKSQIMLM PLP++DTVFSMLIQQEREI +S+LDPI HDAPE + ST L
Sbjct: 188  FLKGLNDRFSNTKSQIMLMNPLPDVDTVFSMLIQQEREIAYSILDPITHDAPEVDSSTAL 247

Query: 883  LANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGYPPGFKG 1062
            LANS                        KG +R+CT+C  TNH + + W+K+GYPPG+K 
Sbjct: 248  LANSHSRNQNGKSNYYGKGKGQAPNSAPKGHDRLCTYCKGTNHVVQNCWIKYGYPPGYKN 307

Query: 1063 KGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSMVXXXXXXXXXX 1242
            KGKN  Q   S+ V A +SSTQ DS Q+S +   PFGLTQ+QY GILSM+          
Sbjct: 308  KGKNSSQ--PSHTVAAVDSSTQLDS-QSSTTATPPFGLTQDQYDGILSMI-----RQSKS 359

Query: 1243 XXXXXXNFVSTTPLALNSQSSSDH 1314
                  N VSTTPLAL+SQSS+++
Sbjct: 360  QPTPTVNSVSTTPLALHSQSSTNN 383



 Score = 67.4 bits (163), Expect(3) = e-134
 Identities = 29/34 (85%), Positives = 32/34 (94%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTL 340
           VSPPLDHKNYHTW+ SM IALISKNK+KF+DGTL
Sbjct: 28  VSPPLDHKNYHTWSRSMQIALISKNKDKFIDGTL 61



 Score = 55.1 bits (131), Expect(3) = e-134
 Identities = 23/25 (92%), Positives = 25/25 (100%)
 Frame = +3

Query: 165 SYSDFSTNSANPYYLHPNENPALIL 239
           +YSDFSTNSANPYYLHPNENPA+IL
Sbjct: 3   TYSDFSTNSANPYYLHPNENPAMIL 27


>gb|PNX55412.1| hypothetical protein L195_g049041, partial [Trifolium pratense]
          Length = 338

 Score =  391 bits (1004), Expect = e-130
 Identities = 202/331 (61%), Positives = 235/331 (70%), Gaps = 8/331 (2%)
 Frame = +1

Query: 340  PKLQSTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFR 504
            PK   TDP+Y PWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNL+ RFSQGDIFR
Sbjct: 19   PKPSITDPLYGPWIRCNTMVLAWIHRSISDSIARSVLWIDTAAGVWKNLKIRFSQGDIFR 78

Query: 505  IFDIQEELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYRE 684
            I DIQEELYKFRQG L+ISDYFTQLKVLWDELE+YRP+P CKC+IACTCGA+DS+ IYR+
Sbjct: 79   ISDIQEELYKFRQGTLDISDYFTQLKVLWDELENYRPIPHCKCSIACTCGAIDSINIYRQ 138

Query: 685  QDYVIRFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPET 864
            QDYVIRFLKGLNDRFS TKSQIMLM PLP+IDTVFSMLIQQEREI +SV+D IV+DAP+ 
Sbjct: 139  QDYVIRFLKGLNDRFSHTKSQIMLMNPLPDIDTVFSMLIQQEREIGNSVIDSIVNDAPDR 198

Query: 865  EVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGY 1044
              S  LLANS Y                     +KG NR CT+C  TNH +++ W+KHGY
Sbjct: 199  NSSNVLLANSYYGKYNSKGKGQNSG--------SKGGNRFCTYCKGTNHIVENCWIKHGY 250

Query: 1045 PPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQ---ASASNKAPFGLTQEQYQGILSMVX 1215
            P G+KGKGKN  Q +Q N+V A  +     S Q    ++S K  FG TQEQY GIL +  
Sbjct: 251  PIGYKGKGKNLSQSTQVNSVAAPNAVVPKSSLQLDSTTSSTKPLFGFTQEQYHGILGL-- 308

Query: 1216 XXXXXXXXXXXXXXXNFVSTTPLALNSQSSS 1308
                           N VST+PL  NSQSS+
Sbjct: 309  ---FQQLQSQPSPSSNSVSTSPLVFNSQSSN 336


>gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Glycine soja]
          Length = 484

 Score =  335 bits (858), Expect(4) = e-118
 Identities = 176/332 (53%), Positives = 223/332 (67%), Gaps = 9/332 (2%)
 Frame = +1

Query: 340  PKLQSTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFR 504
            PK   +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR RFSQ DIFR
Sbjct: 56   PKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFR 115

Query: 505  IFDIQEELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYRE 684
            I D+QE+LY+FRQG L++SDYFTQLK+ WDELE+YRP+P CKC+I C+CG +DSV++YRE
Sbjct: 116  ISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYRE 175

Query: 685  QDYVIRFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPET 864
            QDYVIRFLKGLNDRFS +KSQIM+M PLP+ID VFS++IQQERE+  S  D +     ++
Sbjct: 176  QDYVIRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDS 235

Query: 865  EVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGY 1044
             ++  + +N                        +KG NRVCTHC +TNH +D+ + K GY
Sbjct: 236  AMAMQVNSNQSNFNGKGGYYNKGKG-------SSKGGNRVCTHCGKTNHIVDNCFEKIGY 288

Query: 1045 PPGFK-GKGKNPFQQSQSN---NVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSMV 1212
            PPG+K  K KN    SQ+N   N +A ES+ Q  SAQ+S      F  TQE YQGIL  +
Sbjct: 289  PPGYKTNKSKNSSSSSQANNTSNASALESTQQGSSAQSS------FQFTQEMYQGILEAL 342

Query: 1213 XXXXXXXXXXXXXXXXNFVSTTPLALNSQSSS 1308
                            N V+T+P AL+S SS+
Sbjct: 343  -----QQSKVGSQPKANLVTTSPFALHSPSSN 369



 Score = 62.0 bits (149), Expect(4) = e-118
 Identities = 27/35 (77%), Positives = 31/35 (88%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTLP 343
           VSP L  KNYHTW+ SMHIALISKNK+KF+DG+LP
Sbjct: 22  VSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLP 56



 Score = 53.9 bits (128), Expect(4) = e-118
 Identities = 27/69 (39%), Positives = 39/69 (56%)
 Frame = +2

Query: 1322 SKAVIGIARMRRGLYILDIEDPXXXXXXXXXXXXXXXNVLHGDSQLWHLRLGHISDIGLK 1501
            S   IG A+++RGLY++D  D                ++     +LWH RLGH+S+ G++
Sbjct: 404  SLETIGTAKLQRGLYVIDTAD----------MIRSCNSISSHSFELWHSRLGHVSNSGMQ 453

Query: 1502 TISKQFPFI 1528
             ISKQFPFI
Sbjct: 454  AISKQFPFI 462



 Score = 48.5 bits (114), Expect(4) = e-118
 Identities = 20/21 (95%), Positives = 21/21 (100%)
 Frame = +3

Query: 177 FSTNSANPYYLHPNENPALIL 239
           FSTNSANPYYLHPNENPAL+L
Sbjct: 1   FSTNSANPYYLHPNENPALVL 21


>gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Glycine soja]
          Length = 484

 Score =  334 bits (856), Expect(4) = e-118
 Identities = 176/332 (53%), Positives = 223/332 (67%), Gaps = 9/332 (2%)
 Frame = +1

Query: 340  PKLQSTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFR 504
            PK   +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR RFSQ DIFR
Sbjct: 56   PKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFR 115

Query: 505  IFDIQEELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYRE 684
            I D+QE+LY+FRQG L++SDYFTQLK+ WDELE+YRP+P CKC+I C+CG +DSV++YRE
Sbjct: 116  ISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYRE 175

Query: 685  QDYVIRFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPET 864
            QDYVIRFLKGLNDRFS +KSQIM+M PLP+ID VFS++IQQERE+  S  D +     ++
Sbjct: 176  QDYVIRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDS 235

Query: 865  EVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGY 1044
             ++  + +N                        +KG NRVCTHC +TNH +D+ + K GY
Sbjct: 236  AMAMQVNSNQSNFNGKGGYYNKGKG-------SSKGGNRVCTHCGKTNHIVDNCFEKIGY 288

Query: 1045 PPGFK-GKGKNPFQQSQSN---NVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSMV 1212
            PPG+K  K KN    SQ+N   N +A ES+ Q  SAQ+S      F  TQE YQGIL  +
Sbjct: 289  PPGYKTNKSKNSSSSSQANNTSNASALESTQQGSSAQSS------FQFTQEMYQGILEAL 342

Query: 1213 XXXXXXXXXXXXXXXXNFVSTTPLALNSQSSS 1308
                            N V+T+P AL+S SS+
Sbjct: 343  -----QQSKVGSQPKANSVTTSPFALHSPSSN 369



 Score = 62.0 bits (149), Expect(4) = e-118
 Identities = 27/35 (77%), Positives = 31/35 (88%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTLP 343
           VSP L  KNYHTW+ SMHIALISKNK+KF+DG+LP
Sbjct: 22  VSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLP 56



 Score = 53.9 bits (128), Expect(4) = e-118
 Identities = 27/69 (39%), Positives = 39/69 (56%)
 Frame = +2

Query: 1322 SKAVIGIARMRRGLYILDIEDPXXXXXXXXXXXXXXXNVLHGDSQLWHLRLGHISDIGLK 1501
            S   IG A+++RGLY++D  D                ++     +LWH RLGH+S+ G++
Sbjct: 404  SLETIGTAKLQRGLYVIDTAD----------MIRSCNSISSHSFELWHSRLGHVSNSGMQ 453

Query: 1502 TISKQFPFI 1528
             ISKQFPFI
Sbjct: 454  AISKQFPFI 462



 Score = 48.5 bits (114), Expect(4) = e-118
 Identities = 20/21 (95%), Positives = 21/21 (100%)
 Frame = +3

Query: 177 FSTNSANPYYLHPNENPALIL 239
           FSTNSANPYYLHPNENPAL+L
Sbjct: 1   FSTNSANPYYLHPNENPALVL 21


>ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798459 [Glycine max]
          Length = 389

 Score =  347 bits (889), Expect(3) = e-113
 Identities = 180/344 (52%), Positives = 231/344 (67%), Gaps = 9/344 (2%)
 Frame = +1

Query: 340  PKLQSTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFR 504
            PK   +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR RFSQ DIFR
Sbjct: 64   PKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFR 123

Query: 505  IFDIQEELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYRE 684
            I D+QE+LY+FRQG L++SDYFTQLK+ WDELE+YRP+P CKC+I C+CG +DSV++YRE
Sbjct: 124  ISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYRE 183

Query: 685  QDYVIRFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPET 864
            QDYV+RFLKGLNDRFS +KSQIM+M PLP+ID VFS++IQQERE+  S  D +     ++
Sbjct: 184  QDYVVRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDS 243

Query: 865  EVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGY 1044
             ++  + +N                        +KG NRVCTHC +TNH +D+ + K GY
Sbjct: 244  AMAMQVNSNQSNFNGKGGYYNKGKG-------SSKGGNRVCTHCGKTNHIVDNCFEKIGY 296

Query: 1045 PPGFK-GKGKNPFQQSQSN---NVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSMV 1212
            PPG+K  K KN    SQ+N   N +A ES+ Q  SAQ+S      F  TQE YQGIL  +
Sbjct: 297  PPGYKTNKSKNSSSSSQANNTSNASALESTQQGSSAQSS------FQFTQEMYQGILEAL 350

Query: 1213 XXXXXXXXXXXXXXXXNFVSTTPLALNSQSSSDHDWLQGSDWYS 1344
                            N V+T+P AL+S SS+ ++   G+DWYS
Sbjct: 351  -----QQSKVGSQPKANSVTTSPFALHSPSSNPNESFSGNDWYS 389



 Score = 62.0 bits (149), Expect(3) = e-113
 Identities = 27/35 (77%), Positives = 31/35 (88%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTLP 343
           VSP L  KNYHTW+ SMHIALISKNK+KF+DG+LP
Sbjct: 30  VSPSLTAKNYHTWSHSMHIALISKNKDKFIDGSLP 64



 Score = 54.3 bits (129), Expect(3) = e-113
 Identities = 23/29 (79%), Positives = 26/29 (89%)
 Frame = +3

Query: 153 MATTSYSDFSTNSANPYYLHPNENPALIL 239
           MA  ++ DFSTNSANPYYLHPNENPAL+L
Sbjct: 1   MALQNFVDFSTNSANPYYLHPNENPALVL 29


>gb|PNX91084.1| hypothetical protein L195_g047213, partial [Trifolium pratense]
          Length = 417

 Score =  313 bits (802), Expect(3) = e-105
 Identities = 164/298 (55%), Positives = 210/298 (70%), Gaps = 10/298 (3%)
 Frame = +1

Query: 340  PKLQSTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFR 504
            PK   +DP+YAPWIRCNTMVLAWIHRS+SESIA+S+      AGVWKNLR RFSQ DIFR
Sbjct: 121  PKPPVSDPLYAPWIRCNTMVLAWIHRSISESIARSVLWIETAAGVWKNLRVRFSQSDIFR 180

Query: 505  IFDIQEELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYRE 684
            I D+QE++Y+FRQG L++SDYFTQLKV WDELE+YRPLP CKC+I C+CG +DSV+ YRE
Sbjct: 181  ISDLQEDMYRFRQGTLDVSDYFTQLKVYWDELENYRPLPYCKCSIPCSCGVIDSVRAYRE 240

Query: 685  QDYVIRFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREI-----THSVLDPIVH 849
            QD+VIRFLKGLN+RFS +KSQIM+M PLP+ID  FS++IQQERE+     + SV +    
Sbjct: 241  QDFVIRFLKGLNERFSHSKSQIMMMNPLPDIDRAFSLVIQQEREMLSFNNSDSVSEATSD 300

Query: 850  DAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*W 1029
             A   +V++T  +NS                       ++  NRVCTHC +TNH +D+ +
Sbjct: 301  SAMVMQVNST-KSNSHGKKSFXYKEKGQG--------SSQSGNRVCTHCGKTNHIVDNCF 351

Query: 1030 VKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQYQGIL 1203
             K GYPPG+K    N F +S S+ VN + S++  +S Q  +S ++ F  TQE YQGIL
Sbjct: 352  EKIGYPPGYK---TNKF-KSSSSQVNNTSSASALESVQQGSSAQSNFQFTQEMYQGIL 405



 Score = 63.2 bits (152), Expect(3) = e-105
 Identities = 28/35 (80%), Positives = 31/35 (88%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTLP 343
           VSP L  KNYHTW+ SMHIALISKNKEKF+DG+LP
Sbjct: 87  VSPSLTAKNYHTWSRSMHIALISKNKEKFIDGSLP 121



 Score = 57.0 bits (136), Expect(3) = e-105
 Identities = 24/39 (61%), Positives = 32/39 (82%)
 Frame = +3

Query: 123 LRLKKPLLPIMATTSYSDFSTNSANPYYLHPNENPALIL 239
           +++++ L+  MA  +Y DF TNSANPYYLHPNENPAL+L
Sbjct: 48  VKIRRLLVGTMALQNYIDFPTNSANPYYLHPNENPALVL 86


>gb|KHN34741.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Glycine soja]
          Length = 495

 Score =  316 bits (809), Expect(3) = e-101
 Identities = 159/288 (55%), Positives = 203/288 (70%), Gaps = 9/288 (3%)
 Frame = +1

Query: 340  PKLQSTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFR 504
            PK   +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR RFS  DIFR
Sbjct: 53   PKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSHSDIFR 112

Query: 505  IFDIQEELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYRE 684
            I D+QE+LY+FRQG L++SDYFTQLK+ WDELE+YRP+P CKC+I C+CG +DSV++YRE
Sbjct: 113  ISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPYCKCSIPCSCGGIDSVRVYRE 172

Query: 685  QDYVIRFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPET 864
            QDYVIRFLKGLNDRFS +KSQIM+M PLP+ID VFS++IQQERE+  S  D +     ++
Sbjct: 173  QDYVIRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDS 232

Query: 865  EVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGY 1044
             ++  + +N                        +KG NRVCTHC +TNH +D+ + K GY
Sbjct: 233  AMAMQVNSNQSNFNGKGGYYNKGKG-------SSKGGNRVCTHCGKTNHIVDNCFEKIGY 285

Query: 1045 PPGFK-GKGKNPFQQSQSN---NVNASESSTQNDSAQASASNKAPFGL 1176
            PPG+K  K KN    SQ+N   N +A ES+ Q  SAQ+  +  +PF L
Sbjct: 286  PPGYKTNKSKNSSSSSQANNTSNASALESTQQGSSAQSITT--SPFAL 331



 Score = 62.0 bits (149), Expect(3) = e-101
 Identities = 27/35 (77%), Positives = 31/35 (88%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTLP 343
           VSP L  KNYHTW+ SMHIALISKNK+KF+DG+LP
Sbjct: 19  VSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLP 53



 Score = 42.7 bits (99), Expect(3) = e-101
 Identities = 17/18 (94%), Positives = 18/18 (100%)
 Frame = +3

Query: 186 NSANPYYLHPNENPALIL 239
           NSANPYYLHPNENPAL+L
Sbjct: 1   NSANPYYLHPNENPALVL 18


>gb|PNX71325.1| hypothetical protein L195_g027200, partial [Trifolium pratense]
          Length = 655

 Score =  291 bits (745), Expect(3) = 2e-94
 Identities = 166/379 (43%), Positives = 207/379 (54%), Gaps = 45/379 (11%)
 Frame = +1

Query: 343  KLQSTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFRI 507
            K  + DPM+A WIRCN MVLAW HRS+SESIA+SI      AGVW +L+ RFSQGDIFRI
Sbjct: 289  KPPTNDPMFAQWIRCNNMVLAWFHRSVSESIAKSILSISTAAGVWSDLKNRFSQGDIFRI 348

Query: 508  FDIQEELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYREQ 687
             DIQEELY+FRQGNL++SDYFT L+V WDELE YRP+P CKC+IACTCG   S+K +REQ
Sbjct: 349  SDIQEELYRFRQGNLDVSDYFTGLRVYWDELEDYRPIPYCKCSIACTCGGYTSMKQFREQ 408

Query: 688  DYVIRFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPETE 867
            DYVIRFLKGLN+RF+ TKS IM M PLP +   FS+++QQERE+  + +     D     
Sbjct: 409  DYVIRFLKGLNERFTHTKSHIMAMDPLPTVSKAFSLVLQQERELLGNGITTSQTDENAIA 468

Query: 868  VSTT----------------------------LLANSQYXXXXXXXXXXXXXXXXXXXLP 963
            ++                              +LAN                        
Sbjct: 469  LAANASRNASNYGSKNASNYGSGTSRNRGNPPVLANPSNFSGNNAANGHGRGKNFYANKG 528

Query: 964  AKGTNRVCTHCNRTNHTIDS*WVKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQ 1143
              G NR+CT+C RTNH ID  +  HG+PPG+K KGK     SQ+N+     S  Q+ + Q
Sbjct: 529  PSGQNRMCTYCGRTNHIIDGCFELHGFPPGYKPKGK-----SQANSAQTDASVAQHQAPQ 583

Query: 1144 ASASNKAPFGLTQEQYQGILSMV-XXXXXXXXXXXXXXXXNFVSTTPLALNSQSSS---- 1308
             S       G TQEQ+QGIL+++                 N V T P A N  S+     
Sbjct: 584  FS-------GFTQEQFQGILTLIQQSQQPHSGSTSAVHQSNSVMTHPFAFNCDSNKTSGK 636

Query: 1309 -------DHDWLQGSDWYS 1344
                   D +  Q  DWYS
Sbjct: 637  SPFVWILDTEQFQEDDWYS 655



 Score = 59.3 bits (142), Expect(3) = 2e-94
 Identities = 26/33 (78%), Positives = 29/33 (87%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGT 337
           V+P LD+KNYH WA  MHIALISKNKEKF+DGT
Sbjct: 254 VTPLLDNKNYHNWARLMHIALISKNKEKFIDGT 286



 Score = 47.8 bits (112), Expect(3) = 2e-94
 Identities = 18/30 (60%), Positives = 26/30 (86%)
 Frame = +3

Query: 150 IMATTSYSDFSTNSANPYYLHPNENPALIL 239
           IMA  +Y+D+ TN +NP+YLHPNENP+++L
Sbjct: 224 IMAFPNYTDYLTNPSNPFYLHPNENPSVVL 253


>ref|XP_014621696.1| PREDICTED: uncharacterized protein LOC106795617 [Glycine max]
          Length = 275

 Score =  262 bits (670), Expect(3) = 2e-88
 Identities = 122/189 (64%), Positives = 154/189 (81%), Gaps = 5/189 (2%)
 Frame = +1

Query: 340 PKLQSTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFR 504
           PK   +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR RFSQ DIFR
Sbjct: 64  PKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFR 123

Query: 505 IFDIQEELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYRE 684
           I D+QE+LY+FRQG L++SDYFTQLK+ WDELE+YRP+P CKC+I C+CG +DSV++Y E
Sbjct: 124 ISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYCE 183

Query: 685 QDYVIRFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPET 864
           QDYVIRFLKGLNDRFS +KSQIM+M PLP+ID VFS++IQQERE+  S  D +     ++
Sbjct: 184 QDYVIRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDS 243

Query: 865 EVSTTLLAN 891
            ++  + +N
Sbjct: 244 AMAMQVNSN 252



 Score = 62.0 bits (149), Expect(3) = 2e-88
 Identities = 27/35 (77%), Positives = 31/35 (88%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTLP 343
           VSP L  KNYHTW+ SMHIALISKNK+KF+DG+LP
Sbjct: 30  VSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLP 64



 Score = 54.3 bits (129), Expect(3) = 2e-88
 Identities = 23/29 (79%), Positives = 26/29 (89%)
 Frame = +3

Query: 153 MATTSYSDFSTNSANPYYLHPNENPALIL 239
           MA  ++ DFSTNSANPYYLHPNENPAL+L
Sbjct: 1   MALQNFVDFSTNSANPYYLHPNENPALVL 29


>ref|XP_014626210.1| PREDICTED: uncharacterized protein LOC106797041 [Glycine max]
          Length = 275

 Score =  261 bits (667), Expect(3) = 5e-88
 Identities = 122/189 (64%), Positives = 154/189 (81%), Gaps = 5/189 (2%)
 Frame = +1

Query: 340 PKLQSTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFR 504
           PK   +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR RFSQ DIFR
Sbjct: 64  PKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFR 123

Query: 505 IFDIQEELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYRE 684
           I D+QE+LY+FRQG L++SDYFTQLK+ WDELE+YRP+P  KC+I C+CG +DSV++YRE
Sbjct: 124 ISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHYKCSIPCSCGGIDSVRVYRE 183

Query: 685 QDYVIRFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPET 864
           QDYVIRFLKGLNDRFS +KSQIM+M PLP+ID VFS++IQQERE+  S  D +     ++
Sbjct: 184 QDYVIRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDS 243

Query: 865 EVSTTLLAN 891
            ++  + +N
Sbjct: 244 AMAMQVNSN 252



 Score = 62.0 bits (149), Expect(3) = 5e-88
 Identities = 27/35 (77%), Positives = 31/35 (88%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTLP 343
           VSP L  KNYHTW+ SMHIALISKNK+KF+DG+LP
Sbjct: 30  VSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLP 64



 Score = 53.9 bits (128), Expect(3) = 5e-88
 Identities = 22/29 (75%), Positives = 26/29 (89%)
 Frame = +3

Query: 153 MATTSYSDFSTNSANPYYLHPNENPALIL 239
           MA  +++DFSTNSANPYYLHPNENP L+L
Sbjct: 1   MALQNFADFSTNSANPYYLHPNENPTLVL 29


>ref|XP_014627175.1| PREDICTED: uncharacterized protein LOC106797397 [Glycine max]
          Length = 275

 Score =  259 bits (661), Expect(3) = 2e-87
 Identities = 121/189 (64%), Positives = 153/189 (80%), Gaps = 5/189 (2%)
 Frame = +1

Query: 340 PKLQSTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFR 504
           PK   +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR RFSQ DIFR
Sbjct: 64  PKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFR 123

Query: 505 IFDIQEELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYRE 684
           I D+QE+LY+FRQG L++SDYFTQLK+ WDELE+YRP+P CKC+I  +CG +DSV++YRE
Sbjct: 124 ISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPYSCGGIDSVRVYRE 183

Query: 685 QDYVIRFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPET 864
           QDYVIR LKGLNDRFS +KSQIM+M PLP+ID VFS++IQQERE+  S  D +     ++
Sbjct: 184 QDYVIRLLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDS 243

Query: 865 EVSTTLLAN 891
            ++  + +N
Sbjct: 244 AMAMQVNSN 252



 Score = 62.0 bits (149), Expect(3) = 2e-87
 Identities = 27/35 (77%), Positives = 31/35 (88%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTLP 343
           VSP L  KNYHTW+ SMHIALISKNK+KF+DG+LP
Sbjct: 30  VSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLP 64



 Score = 54.3 bits (129), Expect(3) = 2e-87
 Identities = 23/29 (79%), Positives = 26/29 (89%)
 Frame = +3

Query: 153 MATTSYSDFSTNSANPYYLHPNENPALIL 239
           MA  ++ DFSTNSANPYYLHPNENPAL+L
Sbjct: 1   MALQNFVDFSTNSANPYYLHPNENPALVL 29


>dbj|GAU50616.1| hypothetical protein TSUD_290710 [Trifolium subterraneum]
          Length = 404

 Score =  248 bits (632), Expect(3) = 5e-82
 Identities = 140/343 (40%), Positives = 195/343 (56%), Gaps = 13/343 (3%)
 Frame = +1

Query: 355  TDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFRIFDIQ 519
            +DP++ PWIRCN MVL+WI RS+SE+I +SI      A VWK L  RF+ GDIFRI DI 
Sbjct: 69   SDPLHEPWIRCNNMVLSWIQRSISETIVKSIMWCDCAAVVWKCLERRFAHGDIFRIADIL 128

Query: 520  EELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYREQDYVI 699
            EE+ +++QG L+IS YFT L  LW+ELE++RPL  C CAI CTCGA   +K Y+EQD VI
Sbjct: 129  EEIARYQQGTLDISSYFTHLTTLWEELENFRPLKDCSCAIPCTCGAASDLKKYKEQDKVI 188

Query: 700  RFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVL--DPIVHDAPETEVS 873
            +FLKGLN++++  +SQIML+ PLP+ID  FS+++QQER++   ++  + +   A   +V 
Sbjct: 189  KFLKGLNEQYASVRSQIMLLDPLPDIDRCFSLVLQQERQMLIPIITDNSVDQQASIMQVR 248

Query: 874  TTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKG-TNRVCTHCNRTNHTIDS*WVKHGYPP 1050
             T   + ++                      +G  NR CTHC R NH +D+ +  HGYPP
Sbjct: 249  QTSYNHGKHYTSFSSTHHGGRGRGRGNHHGGRGPNNRTCTHCGRHNHIVDTCFELHGYPP 308

Query: 1051 GFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQYQGIL-----SMVX 1215
            G++ K       S+S NV A+ S+        + ++ A     QEQY  IL     S + 
Sbjct: 309  GYQHK------NSKSVNVAATASNATLKEGHINLTS-ATINTIQEQYNQILQLLQHSALQ 361

Query: 1216 XXXXXXXXXXXXXXXNFVSTTPLALNSQSSSDHDWLQGSDWYS 1344
                           N + + P ALNS SS   D+   SDW S
Sbjct: 362  ASSTPSNPSPTQASANSIISLPTALNSSSSPTFDFNPNSDWCS 404



 Score = 59.7 bits (143), Expect(3) = 5e-82
 Identities = 27/35 (77%), Positives = 30/35 (85%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTLP 343
           V+P LDHKNY TW+ SM +ALISKNK KFVDGTLP
Sbjct: 30  VTPLLDHKNYQTWSRSMKVALISKNKLKFVDGTLP 64



 Score = 49.7 bits (117), Expect(3) = 5e-82
 Identities = 19/29 (65%), Positives = 24/29 (82%)
 Frame = +3

Query: 153 MATTSYSDFSTNSANPYYLHPNENPALIL 239
           MA   Y+DF+TN  NPYY+HPNENP++IL
Sbjct: 1   MANQPYADFATNPTNPYYIHPNENPSIIL 29


>ref|XP_014632403.1| PREDICTED: uncharacterized protein LOC106798995 [Glycine max]
          Length = 277

 Score =  241 bits (614), Expect(3) = 5e-82
 Identities = 109/150 (72%), Positives = 130/150 (86%), Gaps = 5/150 (3%)
 Frame = +1

Query: 340 PKLQSTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFR 504
           PK    DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR RFSQ DIFR
Sbjct: 64  PKPPVFDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIFR 123

Query: 505 IFDIQEELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYRE 684
           I D+QE+LY+FRQG L++SDYFTQLK+ WDELE+YRP+P CKC+I C+CG +DSV++YRE
Sbjct: 124 ISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYRE 183

Query: 685 QDYVIRFLKGLNDRFSQTKSQIMLMKPLPE 774
           QDYVIRFLKGLNDRFS +KSQIM+M PLP+
Sbjct: 184 QDYVIRFLKGLNDRFSHSKSQIMMMNPLPD 213



 Score = 62.0 bits (149), Expect(3) = 5e-82
 Identities = 27/35 (77%), Positives = 31/35 (88%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTLP 343
           VSP L  KNYHTW+ SMHIALISKNK+KF+DG+LP
Sbjct: 30  VSPSLTAKNYHTWSRSMHIALISKNKDKFIDGSLP 64



 Score = 54.3 bits (129), Expect(3) = 5e-82
 Identities = 23/29 (79%), Positives = 26/29 (89%)
 Frame = +3

Query: 153 MATTSYSDFSTNSANPYYLHPNENPALIL 239
           MA  ++ DFSTNSANPYYLHPNENPAL+L
Sbjct: 1   MALQNFVDFSTNSANPYYLHPNENPALVL 29


>gb|KYP65733.1| hypothetical protein KK1_011995 [Cajanus cajan]
 gb|KYP72745.1| hypothetical protein KK1_005345 [Cajanus cajan]
          Length = 445

 Score =  255 bits (651), Expect(3) = 7e-80
 Identities = 138/340 (40%), Positives = 201/340 (59%), Gaps = 16/340 (4%)
 Frame = +1

Query: 337  PPKLQSTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIF 501
            PP   S+  ++ PW RCNTMV++W+  S+SE I +SI      + +W++L+ RFSQGD+F
Sbjct: 65   PP--HSSSILFEPWGRCNTMVISWLQHSISEKIVKSILWFDTASDIWQDLKARFSQGDVF 122

Query: 502  RIFDIQEELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYR 681
            R+  +QE+LYKF QG+L++++YFTQLK +WDE+++ RPL  CKC+IAC+CGAVDS   YR
Sbjct: 123  RVAQLQEDLYKFHQGSLDVTEYFTQLKEMWDEIDNLRPLSRCKCSIACSCGAVDSSYKYR 182

Query: 682  EQDYVIRFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPE 861
            EQD VIRFL+GLND+++  +SQIMLM PLP +   FS++ QQER +  S     +HD  +
Sbjct: 183  EQDAVIRFLRGLNDQYTHVRSQIMLMDPLPSLSKTFSLVGQQERHLNQSA----IHDDTK 238

Query: 862  TEVSTTLLA-----------NSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTN 1008
               +T+  +           + Q                        G+ ++CTHC R N
Sbjct: 239  VLAATSFGSLPQTPTTQQHQSPQQQQFGFRRGGYSHGRGRGRGGRTHGSIKICTHCGRNN 298

Query: 1009 HTIDS*WVKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQ 1188
            HT+D+ + KHG+PPG++ KG      S +  VNA E  T + S+    SN   FG TQEQ
Sbjct: 299  HTVDTCYFKHGFPPGYQSKGGT----SANFTVNAVE--TTSPSSMVPESNNPNFGFTQEQ 352

Query: 1189 YQGILSMVXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSS 1308
             Q +LS++                N V ++PLA+N  S++
Sbjct: 353  CQELLSLL--QQSKTIPTPSSHSANSVVSSPLAMNFNSNA 390



 Score = 47.4 bits (111), Expect(3) = 7e-80
 Identities = 19/29 (65%), Positives = 23/29 (79%)
 Frame = +3

Query: 153 MATTSYSDFSTNSANPYYLHPNENPALIL 239
           M   SY+DF+TN  NPYYLHPNE P+L+L
Sbjct: 1   MEDQSYADFTTNPYNPYYLHPNETPSLVL 29



 Score = 47.4 bits (111), Expect(3) = 7e-80
 Identities = 22/34 (64%), Positives = 28/34 (82%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTL 340
           V+P LD KNYHT A +M +AL+SK+K KF+DGTL
Sbjct: 30  VTPLLDGKNYHTRARAMRMALMSKHKVKFIDGTL 63


>ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662412 [Glycine max]
          Length = 424

 Score =  243 bits (621), Expect(3) = 8e-79
 Identities = 132/330 (40%), Positives = 193/330 (58%), Gaps = 9/330 (2%)
 Frame = +1

Query: 334  NPPKLQSTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDI 498
            +PP +  +DP+Y PW+RCN +VL+W+ RS SE IA+S+      + VWK+L  RFSQGDI
Sbjct: 63   SPPPI--SDPLYEPWLRCNNLVLSWLQRSTSEEIAKSLLWCDRASFVWKSLENRFSQGDI 120

Query: 499  FRIFDIQEELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIY 678
            FR+ DIQEE+   +QG L+IS YFT+L  LW+E+E++RP+  C CAI C+CGA   ++ +
Sbjct: 121  FRVADIQEEVACLQQGTLDISSYFTKLMTLWEEIENFRPIRDCTCAIPCSCGAATDLRKF 180

Query: 679  REQDYVIRFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAP 858
            +EQD VI+FLKGL D++S  +SQIMLM PLP +D  F++++QQER+         +    
Sbjct: 181  KEQDKVIKFLKGLGDQYSHVRSQIMLMSPLPTLDNAFNLILQQERQFN-------LPSTT 233

Query: 859  ETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKH 1038
            ++ +      N                         +G NR+CTHCNRTNHT+++ ++KH
Sbjct: 234  DSSIENQSSVNHFSQTPSRPSNNSGCGRGRGYSSGGRG-NRLCTHCNRTNHTVETCFIKH 292

Query: 1039 GYPPGFKGKGKNPF-QQSQSNNVNASESSTQNDSAQASAS---NKAPFGLTQEQYQGILS 1206
            GYPPGF+ +  N     S  N+V  + S+  + S+ AS S   + A     QEQY  IL 
Sbjct: 293  GYPPGFQHRKSNSSGNASVVNSVQDAGSAHISSSSSASTSTNGSSASLSTIQEQYTQILQ 352

Query: 1207 MVXXXXXXXXXXXXXXXXNFVSTTPLALNS 1296
            ++                N  ST+P ++NS
Sbjct: 353  LL-------------QQSNLQSTSPSSVNS 369



 Score = 52.0 bits (123), Expect(3) = 8e-79
 Identities = 24/34 (70%), Positives = 26/34 (76%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTL 340
           V P LD+KNY  W  SM +ALISKNK KFVDGTL
Sbjct: 29  VQPVLDNKNYQIWCRSMKVALISKNKVKFVDGTL 62



 Score = 50.8 bits (120), Expect(3) = 8e-79
 Identities = 20/25 (80%), Positives = 24/25 (96%)
 Frame = +3

Query: 165 SYSDFSTNSANPYYLHPNENPALIL 239
           SYSDF+TN +NPYY+HPNENP+LIL
Sbjct: 4   SYSDFATNPSNPYYMHPNENPSLIL 28


>gb|PNX93614.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1430

 Score =  234 bits (597), Expect(3) = 1e-77
 Identities = 128/324 (39%), Positives = 191/324 (58%), Gaps = 7/324 (2%)
 Frame = +1

Query: 355  TDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFRIFDIQ 519
            +DP+Y PWIRCN+MVL+WI RS+S  IA+SI      + VWK+L  RFS GD+F+I D+Q
Sbjct: 69   SDPLYEPWIRCNSMVLSWIQRSISPDIAKSIIWFDHASAVWKDLEFRFSHGDMFKISDLQ 128

Query: 520  EELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLCKCAIACTCGAVDSVKIYREQDYVI 699
            EE+ +  QG+L+IS Y+TQLK L +E+E YRP+  C CAI C+CGAV  +K YREQD V+
Sbjct: 129  EEILRLHQGSLDISSYYTQLKSLSEEIEIYRPVRDCTCAIPCSCGAVADMKKYREQDCVL 188

Query: 700  RFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQERE--ITHSVLDPIVHDAPETEVS 873
            +FLKGLN+++S  +SQIM+M+PLP +  VFS+++QQER   + ++V       A   +V 
Sbjct: 189  KFLKGLNEQYSHVRSQIMMMEPLPPLHKVFSLVLQQERNLPVFNTVDSQNELSAMAMQVQ 248

Query: 874  TTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGYPPG 1053
            +T   +                         + + R CTHC   NH ID+ +VK+G+PPG
Sbjct: 249  STGSNSQPSKNFNFGSGNRGRGKGRRNFGRGQHSTRYCTHCGGDNHIIDNCFVKYGFPPG 308

Query: 1054 FKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSMVXXXXXXX 1233
            ++ KG    Q S + +VN + ++  + S  +S++  +     Q Q+Q  L +        
Sbjct: 309  YQSKG---VQSSNAKSVNLASTTNSDSSLVSSSAMASSLNELQGQFQQFLKL---FQQQT 362

Query: 1234 XXXXXXXXXNFVSTTPLALNSQSS 1305
                     N + + P+ALN+ SS
Sbjct: 363  ESNPTPASVNSIISDPVALNANSS 386



 Score = 54.3 bits (129), Expect(3) = 1e-77
 Identities = 26/38 (68%), Positives = 29/38 (76%)
 Frame = +2

Query: 239 VSPPLDHKNYHTWAGSMHIALISKNKEKFVDGTLPNCR 352
           V+P LD KNYH+W  SM IAL+SKNK KFVDGTL   R
Sbjct: 30  VTPLLDGKNYHSWLRSMKIALLSKNKMKFVDGTLEQPR 67



 Score = 53.5 bits (127), Expect(3) = 1e-77
 Identities = 21/29 (72%), Positives = 26/29 (89%)
 Frame = +3

Query: 153 MATTSYSDFSTNSANPYYLHPNENPALIL 239
           M T++YSDF+TN  NPYYLHPNENPA++L
Sbjct: 1   METSTYSDFATNPTNPYYLHPNENPAVVL 29


>ref|XP_019455138.1| PREDICTED: uncharacterized protein LOC109356267 [Lupinus
            angustifolius]
          Length = 834

 Score =  261 bits (668), Expect = 1e-74
 Identities = 145/327 (44%), Positives = 194/327 (59%), Gaps = 19/327 (5%)
 Frame = +1

Query: 289  AHRLDLEEQRKIC*RNP--PKLQSTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI----- 447
            A RL LE + K+   N   P+    DP+Y PW+RCNTMVL+WI   + ESI +SI     
Sbjct: 43   AMRLTLESKNKLNFINGSLPRPSPKDPLYGPWVRCNTMVLSWIQHCVDESIVKSILWIDT 102

Query: 448  PAGVWKNLRTRFSQGDIFRIFDIQEELYKFRQGNLEISDYFTQLKVLWDELESYRPLPLC 627
             A  WK+L  RFS GDIFRI  +Q+E Y   QGNL+ISDYFT+LK LWDE+E +RP P C
Sbjct: 103  TAEAWKDLHDRFSHGDIFRIAALQKEFYHLDQGNLDISDYFTKLKTLWDEIEDFRPFPSC 162

Query: 628  KCAIACTCGAVDSVKIYREQDYVIRFLKGLNDRFSQTKSQIMLMKPLPEIDTVFSMLIQQ 807
            KC   C CGA+DS+K Y+EQDYVIRFL+GLN++F+  KSQIMLM PLP I   F++LIQQ
Sbjct: 163  KCNTPCICGAMDSLKTYKEQDYVIRFLEGLNEQFAHVKSQIMLMDPLPNITKAFALLIQQ 222

Query: 808  EREITHSVLDPIVHDAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKG----- 972
            ER+    V   +  D     VS+    +SQY                   +P +G     
Sbjct: 223  ERQTQLPVPPSLEPDNRVMNVSSR--QDSQY-----RNNSTNNSFRGRGIIPFRGRGNRA 275

Query: 973  -------TNRVCTHCNRTNHTIDS*WVKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQN 1131
                    NR CT+C RTNHTI++ ++KHGYPPG++    +      +    + ++ST N
Sbjct: 276  AGFGRGQNNRFCTYCERTNHTIETCYLKHGYPPGYQSTRSSKMVNHTTG--YSFDTSTNN 333

Query: 1132 DSAQASASNKAPFGLTQEQYQGILSMV 1212
            ++A  + +N   F  T+EQ QGIL ++
Sbjct: 334  EAAHQTQNNSTSF--TKEQVQGILDLL 358


Top