BLASTX nr result

ID: Astragalus22_contig00029563 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00029563
         (1589 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX82129.1| hypothetical protein L195_g038157 [Trifolium prat...   458   e-155
gb|PNX71626.1| flavonol sulfotransferase-like protein, partial [...   461   e-154
dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subt...   448   e-142
gb|PNY08535.1| retrovirus-related Pol polyprotein from transposo...   443   e-140
gb|PNX55412.1| hypothetical protein L195_g049041, partial [Trifo...   384   e-127
ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798...   382   e-126
gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Gly...   364   e-117
gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Gly...   363   e-117
gb|PNX91084.1| hypothetical protein L195_g047213, partial [Trifo...   352   e-113
gb|KHN34741.1| Retrovirus-related Pol polyprotein from transposo...   340   e-107
gb|PNX71325.1| hypothetical protein L195_g027200, partial [Trifo...   323   2e-99
ref|XP_014621696.1| PREDICTED: uncharacterized protein LOC106795...   298   2e-94
ref|XP_014626210.1| PREDICTED: uncharacterized protein LOC106797...   296   7e-94
ref|XP_014627175.1| PREDICTED: uncharacterized protein LOC106797...   294   4e-93
ref|XP_014632403.1| PREDICTED: uncharacterized protein LOC106798...   276   5e-86
gb|KYP65733.1| hypothetical protein KK1_011995 [Cajanus cajan] >...   279   8e-85
dbj|GAU50616.1| hypothetical protein TSUD_290710 [Trifolium subt...   271   2e-82
ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662...   266   3e-80
ref|XP_019455138.1| PREDICTED: uncharacterized protein LOC109356...   273   9e-79
ref|XP_016673106.1| PREDICTED: uncharacterized protein LOC107892...   251   3e-75

>gb|PNX82129.1| hypothetical protein L195_g038157 [Trifolium pratense]
          Length = 392

 Score =  458 bits (1178), Expect = e-155
 Identities = 243/409 (59%), Positives = 283/409 (69%), Gaps = 14/409 (3%)
 Frame = +3

Query: 159  MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEE 314
            MAT +YSDFSTNSANPYYLHPNENPALIL        P L   N H         L  + 
Sbjct: 1    MATINYSDFSTNSANPYYLHPNENPALILVS------PPLDHKNYHTWARSMNIALISKN 54

Query: 315  QRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRT 479
            + K    + PK   TDP+Y PWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR 
Sbjct: 55   KDKFIDGSFPKPSITDPLYGPWIRCNTMVLAWIHRSISDSIARSVLWIDTAAGVWKNLRI 114

Query: 480  RFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGA 659
            RFSQGDIFRISDIQ+ELYKFRQG L+ISDYFTQLKV WDELE+YRP+P CKC+IACTCGA
Sbjct: 115  RFSQGDIFRISDIQEELYKFRQGTLDISDYFTQLKVLWDELENYRPIPHCKCSIACTCGA 174

Query: 660  VDSVKIYREQDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVLD 836
            +DS+ IYR+QDYVIRFLKGLND+FS TKSQIM M PLP+IDTVFSMLIQQEREI +SV+D
Sbjct: 175  IDSINIYRQQDYVIRFLKGLNDKFSHTKSQIMLMNPLPDIDTVFSMLIQQEREIGNSVID 234

Query: 837  PIVHDAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTI 1016
             IV+DAP+   S   LANS Y                     +KG+NR CTHC  TNH +
Sbjct: 235  SIVNDAPDKNSSNVFLANSSYGNFHGKYNSKGKGQHSG----SKGSNRFCTHCQGTNHIV 290

Query: 1017 DS*WVKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQYQG 1196
            ++ W+KHGYP G+KGKGKN FQ +Q+N+     S  Q DS   ++S K PFG TQEQY G
Sbjct: 291  ENCWIKHGYPIGYKGKGKNSFQSTQANSAAVPNSPMQLDS--TTSSTKPPFGFTQEQYHG 348

Query: 1197 ILSMVXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSSDHDWLQGSDWYS 1343
            IL +                 N VST+PLA NSQSS+ ++  QGSDWYS
Sbjct: 349  ILGL-----FQQLKHQPTPASNSVSTSPLAFNSQSSNGNELYQGSDWYS 392


>gb|PNX71626.1| flavonol sulfotransferase-like protein, partial [Trifolium pratense]
          Length = 591

 Score =  461 bits (1186), Expect = e-154
 Identities = 240/403 (59%), Positives = 284/403 (70%), Gaps = 14/403 (3%)
 Frame = +3

Query: 171  SYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*RNPPKL 350
            +YSDFS+NSANPYYLHPNENPA+IL        P L   N H      Q  +  +N  K 
Sbjct: 3    TYSDFSSNSANPYYLHPNENPAVILVS------PPLDHKNYHTWSRSMQIALISKNKDKF 56

Query: 351  QN--------TDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQ 491
             +         DP+Y+PWIRCNTMVLAWIHRSLSESIA+S+      AG+WKNLRTRFSQ
Sbjct: 57   IDGTLVKPSPLDPLYSPWIRCNTMVLAWIHRSLSESIARSVLWIDSAAGLWKNLRTRFSQ 116

Query: 492  GDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSV 671
            GDIFRISD+Q+ELY+ RQGNL++SDYFT+LKV WDELE+YRP+P CKC+IACTCGA++S 
Sbjct: 117  GDIFRISDLQEELYRLRQGNLDVSDYFTKLKVLWDELENYRPIPFCKCSIACTCGAIESF 176

Query: 672  KIYREQDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVLDPIVH 848
            K+YREQDYVIRFLKGLNDRFS TKSQIM M PLP++DTVFSMLIQQEREI +S+LDPI H
Sbjct: 177  KVYREQDYVIRFLKGLNDRFSNTKSQIMLMNPLPDVDTVFSMLIQQEREIAYSILDPITH 236

Query: 849  DAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*W 1028
            DAPE + ST LLANS Y                      KG NR+CT+C  TNH + + W
Sbjct: 237  DAPEVDSSTALLANSHYRNQNGKTNYYGKGKGQAPNSAPKGYNRLCTYCKGTNHIVQNCW 296

Query: 1029 VKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSM 1208
            +K+GYPPG+K KGKN  Q   S+ V A +SSTQ DS Q+S +   PFGLTQ+QY GILSM
Sbjct: 297  IKYGYPPGYKNKGKNSSQ--PSHTVAAVDSSTQPDS-QSSTTATPPFGLTQDQYDGILSM 353

Query: 1209 VXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSSDHDWLQGSDW 1337
            +                N VSTTPLAL+SQSS+ +DW QGS W
Sbjct: 354  I-----QQSKSQPTPTVNSVSTTPLALHSQSSTSNDWYQGSXW 391


>dbj|GAU41679.1| hypothetical protein TSUD_272630 [Trifolium subterraneum]
          Length = 1178

 Score =  448 bits (1152), Expect = e-142
 Identities = 236/402 (58%), Positives = 279/402 (69%), Gaps = 14/402 (3%)
 Frame = +3

Query: 171  SYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*RNPPKL 350
            +YSDFSTNSANPYYLHPNENPA+IL        P L   N H      Q  +  +N  K 
Sbjct: 3    TYSDFSTNSANPYYLHPNENPAVILVS------PPLDHKNYHTWSRSMQIALISKNKDKF 56

Query: 351  QN--------TDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQ 491
             +         DP+Y+PWIRCNTMVLAWIHRSLS+SIA+S+      A +WKNLRTRFSQ
Sbjct: 57   IDGTLVKPSPLDPLYSPWIRCNTMVLAWIHRSLSDSIARSVLWIDSAASLWKNLRTRFSQ 116

Query: 492  GDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSV 671
            GDIFRISD+Q+ELY+ RQGNL++SDYFT+L+V WDELE+YRP+PLCKC+IACTCGAV+S 
Sbjct: 117  GDIFRISDLQEELYRLRQGNLDVSDYFTKLQVLWDELENYRPIPLCKCSIACTCGAVESF 176

Query: 672  KIYREQDYVIRFLKGLNDRFSQTKSQIMM-KPLPEIDTVFSMLIQQEREITHSVLDPIVH 848
            K+YREQDYVIRFLKGLNDRFS TKSQIM+  PLP++DTVFSMLIQQEREI +S+LDPI H
Sbjct: 177  KLYREQDYVIRFLKGLNDRFSNTKSQIMLINPLPDVDTVFSMLIQQEREIAYSILDPITH 236

Query: 849  DAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*W 1028
            DAPE + ST LLANS Y                      KG NR+CTHC  TNH +   W
Sbjct: 237  DAPEVDFSTALLANSHYKNQNGKSNYYGKGRGQAPNSAPKGHNRLCTHCRGTNHIVQDCW 296

Query: 1029 VKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSM 1208
            +K+GYPPG+K   KN  Q   S+ V A +SSTQ+DS Q S +   PFGLTQ QY GI+SM
Sbjct: 297  IKYGYPPGYKNNRKNSSQ--PSHIVAAVDSSTQHDS-QFSNTATPPFGLTQVQYDGIISM 353

Query: 1209 VXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSSDHDWLQGSD 1334
            +                N VSTTPLA +SQSS+ +DW QGSD
Sbjct: 354  I-----QQSKSQPTPTVNSVSTTPLAFHSQSSNSNDWYQGSD 390


>gb|PNY08535.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
            [Trifolium pratense]
          Length = 1205

 Score =  443 bits (1140), Expect = e-140
 Identities = 242/424 (57%), Positives = 287/424 (67%), Gaps = 22/424 (5%)
 Frame = +3

Query: 171  SYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*RNPPKL 350
            +YSDFSTNSANPYYLHPNENPA+IL        P L   N H      Q  +  +N  K 
Sbjct: 3    TYSDFSTNSANPYYLHPNENPAMILVS------PPLDHKNYHTWSRSMQIALISKNKDKF 56

Query: 351  QN--------TDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQ 491
             +         DP+++PWIRCNTMVLAW+HRS+SESIA+SI      AGVWKNLR RFSQ
Sbjct: 57   IDGTLVKPSPLDPLFSPWIRCNTMVLAWLHRSVSESIARSILWIDSAAGVWKNLRIRFSQ 116

Query: 492  GDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSV 671
            GDIFRISDIQ+ELY+FRQGNL+ISDYFT+LKV WDELE+YRP+PLCKC+I CTCGA+DS 
Sbjct: 117  GDIFRISDIQEELYRFRQGNLDISDYFTKLKVLWDELENYRPIPLCKCSIPCTCGAIDSF 176

Query: 672  KIYREQDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVLDPIVH 848
            K+YREQDYVIRFLKGLNDRFS TKSQIM M PLP++DTVFSMLIQQEREI +S+LDPI H
Sbjct: 177  KVYREQDYVIRFLKGLNDRFSNTKSQIMLMNPLPDVDTVFSMLIQQEREIAYSILDPITH 236

Query: 849  DAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*W 1028
            DAPE + ST LLANS                        KG +R+CT+C  TNH + + W
Sbjct: 237  DAPEVDSSTALLANSHSRNQNGKSNYYGKGKGQAPNSAPKGHDRLCTYCKGTNHVVQNCW 296

Query: 1029 VKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSM 1208
            +K+GYPPG+K KGKN  Q   S+ V A +SSTQ DS Q+S +   PFGLTQ+QY GILSM
Sbjct: 297  IKYGYPPGYKNKGKNSSQ--PSHTVAAVDSSTQLDS-QSSTTATPPFGLTQDQYDGILSM 353

Query: 1209 VXXXXXXXXXXXXXXXXNFVSTTPLALNSQSS-----SDHDWL---QGSDWYSWDAKGIV 1364
            +                N VSTTPLAL+SQSS     S + W+     +D  ++D K   
Sbjct: 354  I-----RQSKSQPTPTVNSVSTTPLALHSQSSTNNGKSSNFWILDTGATDHITYDIKTFN 408

Query: 1365 YLRH 1376
              RH
Sbjct: 409  SYRH 412


>gb|PNX55412.1| hypothetical protein L195_g049041, partial [Trifolium pratense]
          Length = 338

 Score =  384 bits (986), Expect = e-127
 Identities = 200/331 (60%), Positives = 234/331 (70%), Gaps = 9/331 (2%)
 Frame = +3

Query: 342  PKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFR 506
            PK   TDP+Y PWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNL+ RFSQGDIFR
Sbjct: 19   PKPSITDPLYGPWIRCNTMVLAWIHRSISDSIARSVLWIDTAAGVWKNLKIRFSQGDIFR 78

Query: 507  ISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSVKIYRE 686
            ISDIQ+ELYKFRQG L+ISDYFTQLKV WDELE+YRP+P CKC+IACTCGA+DS+ IYR+
Sbjct: 79   ISDIQEELYKFRQGTLDISDYFTQLKVLWDELENYRPIPHCKCSIACTCGAIDSINIYRQ 138

Query: 687  QDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPET 863
            QDYVIRFLKGLNDRFS TKSQIM M PLP+IDTVFSMLIQQEREI +SV+D IV+DAP+ 
Sbjct: 139  QDYVIRFLKGLNDRFSHTKSQIMLMNPLPDIDTVFSMLIQQEREIGNSVIDSIVNDAPDR 198

Query: 864  EVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGY 1043
              S  LLANS Y                     +KG NR CT+C  TNH +++ W+KHGY
Sbjct: 199  NSSNVLLANSYYGKYNSKGKGQNSG--------SKGGNRFCTYCKGTNHIVENCWIKHGY 250

Query: 1044 PPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQ---ASASNKAPFGLTQEQYQGILSMVX 1214
            P G+KGKGKN  Q +Q N+V A  +     S Q    ++S K  FG TQEQY GIL +  
Sbjct: 251  PIGYKGKGKNLSQSTQVNSVAAPNAVVPKSSLQLDSTTSSTKPLFGFTQEQYHGILGL-- 308

Query: 1215 XXXXXXXXXXXXXXXNFVSTTPLALNSQSSS 1307
                           N VST+PL  NSQSS+
Sbjct: 309  ---FQQLQSQPSPSSNSVSTSPLVFNSQSSN 336


>ref|XP_014630525.1| PREDICTED: uncharacterized protein LOC106798459 [Glycine max]
          Length = 389

 Score =  382 bits (981), Expect = e-126
 Identities = 209/413 (50%), Positives = 267/413 (64%), Gaps = 18/413 (4%)
 Frame = +3

Query: 159  MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEE 314
            MA  ++ DFSTNSANPYYLHPNENPAL+L        P L   N H         L  + 
Sbjct: 1    MALQNFVDFSTNSANPYYLHPNENPALVLVS------PSLTAKNYHTWSHSMHIALISKN 54

Query: 315  QRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRT 479
            + K    + PK   +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR 
Sbjct: 55   KDKFIDGSLPKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRI 114

Query: 480  RFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGA 659
            RFSQ DIFRISD+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P CKC+I C+CG 
Sbjct: 115  RFSQSDIFRISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGG 174

Query: 660  VDSVKIYREQDYVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQEREITHSVLD 836
            +DSV++YREQDYV+RFLKGLNDRFS +KSQI MM PLP+ID VFS++IQQERE+  S  D
Sbjct: 175  IDSVRVYREQDYVVRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSD 234

Query: 837  PIVHDAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTI 1016
             +     ++ ++  + +N                        +KG NRVCTHC +TNH +
Sbjct: 235  SVSEATSDSAMAMQVNSNQSNFNGKGGYYNKGKG-------SSKGGNRVCTHCGKTNHIV 287

Query: 1017 DS*WVKHGYPPGFK-GKGKNPFQQSQSN---NVNASESSTQNDSAQASASNKAPFGLTQE 1184
            D+ + K GYPPG+K  K KN    SQ+N   N +A ES+ Q  SAQ+S      F  TQE
Sbjct: 288  DNCFEKIGYPPGYKTNKSKNSSSSSQANNTSNASALESTQQGSSAQSS------FQFTQE 341

Query: 1185 QYQGILSMVXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSSDHDWLQGSDWYS 1343
             YQGIL  +                N V+T+P AL+S SS+ ++   G+DWYS
Sbjct: 342  MYQGILEAL-----QQSKVGSQPKANSVTTSPFALHSPSSNPNESFSGNDWYS 389


>gb|KHN07990.1| hypothetical protein glysoja_045923, partial [Glycine soja]
          Length = 484

 Score =  364 bits (935), Expect = e-117
 Identities = 202/393 (51%), Positives = 254/393 (64%), Gaps = 18/393 (4%)
 Frame = +3

Query: 183  FSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEEQRKIC*RN 338
            FSTNSANPYYLHPNENPAL+L        P L   N H         L  + + K    +
Sbjct: 1    FSTNSANPYYLHPNENPALVLVS------PSLTAKNYHTWSRSMHIALISKNKDKFIDGS 54

Query: 339  PPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIF 503
             PK   +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR RFSQ DIF
Sbjct: 55   LPKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIF 114

Query: 504  RISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSVKIYR 683
            RISD+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P CKC+I C+CG +DSV++YR
Sbjct: 115  RISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYR 174

Query: 684  EQDYVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPE 860
            EQDYVIRFLKGLNDRFS +KSQI MM PLP+ID VFS++IQQERE+  S  D +     +
Sbjct: 175  EQDYVIRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSD 234

Query: 861  TEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHG 1040
            + ++  + +N                        +KG NRVCTHC +TNH +D+ + K G
Sbjct: 235  SAMAMQVNSNQSNFNGKGGYYNKGKG-------SSKGGNRVCTHCGKTNHIVDNCFEKIG 287

Query: 1041 YPPGFK-GKGKNPFQQSQSN---NVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSM 1208
            YPPG+K  K KN    SQ+N   N +A ES+ Q  SAQ+S      F  TQE YQGIL  
Sbjct: 288  YPPGYKTNKSKNSSSSSQANNTSNASALESTQQGSSAQSS------FQFTQEMYQGILEA 341

Query: 1209 VXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSS 1307
            +                N V+T+P AL+S SS+
Sbjct: 342  L-----QQSKVGSQPKANLVTTSPFALHSPSSN 369



 Score = 87.0 bits (214), Expect = 2e-14
 Identities = 42/89 (47%), Positives = 55/89 (61%)
 Frame = +1

Query: 1321 SKAVIGIAGMRRGLYILDIEDPXXXXXXXXXXXXXXXNVSHGDSQLWHLRLGHISDIGLK 1500
            S   IG A ++RGLY++D  D                ++S    +LWH RLGH+S+ G++
Sbjct: 404  SLETIGTAKLQRGLYVIDTAD----------MIRSCNSISSHSFELWHSRLGHVSNSGMQ 453

Query: 1501 TISKQFPFISSSNNMLPCDSCHFAKQKKL 1587
             ISKQFPFI   NNM PCDSCHF+KQK+L
Sbjct: 454  AISKQFPFIPCKNNMSPCDSCHFSKQKRL 482


>gb|KHN02608.1| hypothetical protein glysoja_043563, partial [Glycine soja]
          Length = 484

 Score =  363 bits (933), Expect = e-117
 Identities = 202/393 (51%), Positives = 254/393 (64%), Gaps = 18/393 (4%)
 Frame = +3

Query: 183  FSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEEQRKIC*RN 338
            FSTNSANPYYLHPNENPAL+L        P L   N H         L  + + K    +
Sbjct: 1    FSTNSANPYYLHPNENPALVLVS------PSLTAKNYHTWSRSMHIALISKNKDKFIDGS 54

Query: 339  PPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIF 503
             PK   +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR RFSQ DIF
Sbjct: 55   LPKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSQSDIF 114

Query: 504  RISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSVKIYR 683
            RISD+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P CKC+I C+CG +DSV++YR
Sbjct: 115  RISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGGIDSVRVYR 174

Query: 684  EQDYVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPE 860
            EQDYVIRFLKGLNDRFS +KSQI MM PLP+ID VFS++IQQERE+  S  D +     +
Sbjct: 175  EQDYVIRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSD 234

Query: 861  TEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHG 1040
            + ++  + +N                        +KG NRVCTHC +TNH +D+ + K G
Sbjct: 235  SAMAMQVNSNQSNFNGKGGYYNKGKG-------SSKGGNRVCTHCGKTNHIVDNCFEKIG 287

Query: 1041 YPPGFK-GKGKNPFQQSQSN---NVNASESSTQNDSAQASASNKAPFGLTQEQYQGILSM 1208
            YPPG+K  K KN    SQ+N   N +A ES+ Q  SAQ+S      F  TQE YQGIL  
Sbjct: 288  YPPGYKTNKSKNSSSSSQANNTSNASALESTQQGSSAQSS------FQFTQEMYQGILEA 341

Query: 1209 VXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSS 1307
            +                N V+T+P AL+S SS+
Sbjct: 342  L-----QQSKVGSQPKANSVTTSPFALHSPSSN 369



 Score = 87.0 bits (214), Expect = 2e-14
 Identities = 42/89 (47%), Positives = 55/89 (61%)
 Frame = +1

Query: 1321 SKAVIGIAGMRRGLYILDIEDPXXXXXXXXXXXXXXXNVSHGDSQLWHLRLGHISDIGLK 1500
            S   IG A ++RGLY++D  D                ++S    +LWH RLGH+S+ G++
Sbjct: 404  SLETIGTAKLQRGLYVIDTAD----------MIRSCNSISSHSFELWHSRLGHVSNSGMQ 453

Query: 1501 TISKQFPFISSSNNMLPCDSCHFAKQKKL 1587
             ISKQFPFI   NNM PCDSCHF+KQK+L
Sbjct: 454  AISKQFPFIPCKNNMSPCDSCHFSKQKRL 482


>gb|PNX91084.1| hypothetical protein L195_g047213, partial [Trifolium pratense]
          Length = 417

 Score =  352 bits (903), Expect = e-113
 Identities = 194/377 (51%), Positives = 252/377 (66%), Gaps = 19/377 (5%)
 Frame = +3

Query: 129  LRLKKPLLPIMATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--- 299
            +++++ L+  MA  +Y DF TNSANPYYLHPNENPAL+L        P L   N H    
Sbjct: 48   VKIRRLLVGTMALQNYIDFPTNSANPYYLHPNENPALVLVS------PSLTAKNYHTWSR 101

Query: 300  -----LDLEEQRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI----- 449
                 L  + + K    + PK   +DP+YAPWIRCNTMVLAWIHRS+SESIA+S+     
Sbjct: 102  SMHIALISKNKEKFIDGSLPKPPVSDPLYAPWIRCNTMVLAWIHRSISESIARSVLWIET 161

Query: 450  PAGVWKNLRTRFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLC 629
             AGVWKNLR RFSQ DIFRISD+Q+++Y+FRQG L++SDYFTQLKV+WDELE+YRPLP C
Sbjct: 162  AAGVWKNLRVRFSQSDIFRISDLQEDMYRFRQGTLDVSDYFTQLKVYWDELENYRPLPYC 221

Query: 630  KCAIACTCGAVDSVKIYREQDYVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQ 806
            KC+I C+CG +DSV+ YREQD+VIRFLKGLN+RFS +KSQI MM PLP+ID  FS++IQQ
Sbjct: 222  KCSIPCSCGVIDSVRAYREQDFVIRFLKGLNERFSHSKSQIMMMNPLPDIDRAFSLVIQQ 281

Query: 807  EREI-----THSVLDPIVHDAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKG 971
            ERE+     + SV +     A   +V++T  +NS                       ++ 
Sbjct: 282  EREMLSFNNSDSVSEATSDSAMVMQVNST-KSNSHGKKSFXYKEKGQG--------SSQS 332

Query: 972  TNRVCTHCNRTNHTIDS*WVKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASA 1151
             NRVCTHC +TNH +D+ + K GYPPG+K    N F +S S+ VN + S++  +S Q  +
Sbjct: 333  GNRVCTHCGKTNHIVDNCFEKIGYPPGYK---TNKF-KSSSSQVNNTSSASALESVQQGS 388

Query: 1152 SNKAPFGLTQEQYQGIL 1202
            S ++ F  TQE YQGIL
Sbjct: 389  SAQSNFQFTQEMYQGIL 405


>gb|KHN34741.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial
            [Glycine soja]
          Length = 495

 Score =  340 bits (871), Expect = e-107
 Identities = 182/346 (52%), Positives = 231/346 (66%), Gaps = 18/346 (5%)
 Frame = +3

Query: 192  NSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEEQRKIC*RNPPK 347
            NSANPYYLHPNENPAL+L        P L   N H         L  + + K    + PK
Sbjct: 1    NSANPYYLHPNENPALVLVS------PSLTAKNYHTWSRSMHIALISKNKDKFIDGSLPK 54

Query: 348  LQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFRIS 512
               +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR RFS  DIFRIS
Sbjct: 55   PPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRIRFSHSDIFRIS 114

Query: 513  DIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSVKIYREQD 692
            D+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P CKC+I C+CG +DSV++YREQD
Sbjct: 115  DLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPYCKCSIPCSCGGIDSVRVYREQD 174

Query: 693  YVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPETEV 869
            YVIRFLKGLNDRFS +KSQI MM PLP+ID VFS++IQQERE+  S  D +     ++ +
Sbjct: 175  YVIRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSDSVSEATSDSAM 234

Query: 870  STTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGYPP 1049
            +  + +N                        +KG NRVCTHC +TNH +D+ + K GYPP
Sbjct: 235  AMQVNSNQSNFNGKGGYYNKGKG-------SSKGGNRVCTHCGKTNHIVDNCFEKIGYPP 287

Query: 1050 GFK-GKGKNPFQQSQSN---NVNASESSTQNDSAQASASNKAPFGL 1175
            G+K  K KN    SQ+N   N +A ES+ Q  SAQ+  +  +PF L
Sbjct: 288  GYKTNKSKNSSSSSQANNTSNASALESTQQGSSAQSITT--SPFAL 331



 Score = 64.7 bits (156), Expect = 3e-07
 Identities = 26/39 (66%), Positives = 33/39 (84%)
 Frame = +1

Query: 1471 LGHISDIGLKTISKQFPFISSSNNMLPCDSCHFAKQKKL 1587
            +GH+S+ G++ ISKQFPFI   NNM PCDSCHF+KQK+L
Sbjct: 443  IGHVSNSGMQAISKQFPFIPCKNNMSPCDSCHFSKQKRL 481


>gb|PNX71325.1| hypothetical protein L195_g027200, partial [Trifolium pratense]
          Length = 655

 Score =  323 bits (828), Expect = 2e-99
 Identities = 189/450 (42%), Positives = 244/450 (54%), Gaps = 54/450 (12%)
 Frame = +3

Query: 156  IMATTSYSDFSTNSANPYYLHPNENPALILFL--------HH*IELPHLGKINAHRLDLE 311
            IMA  +Y+D+ TN +NP+YLHPNENP+++L          H+   L H+  I+ ++    
Sbjct: 224  IMAFPNYTDYLTNPSNPFYLHPNENPSVVLVTPLLDNKNYHNWARLMHIALISKNK---- 279

Query: 312  EQRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLR 476
               K       K    DPM+A WIRCN MVLAW HRS+SESIA+SI      AGVW +L+
Sbjct: 280  --EKFIDGTFSKPPTNDPMFAQWIRCNNMVLAWFHRSVSESIAKSILSISTAAGVWSDLK 337

Query: 477  TRFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCG 656
             RFSQGDIFRISDIQ+ELY+FRQGNL++SDYFT L+V+WDELE YRP+P CKC+IACTCG
Sbjct: 338  NRFSQGDIFRISDIQEELYRFRQGNLDVSDYFTGLRVYWDELEDYRPIPYCKCSIACTCG 397

Query: 657  AVDSVKIYREQDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVL 833
               S+K +REQDYVIRFLKGLN+RF+ TKS IM M PLP +   FS+++QQERE+  + +
Sbjct: 398  GYTSMKQFREQDYVIRFLKGLNERFTHTKSHIMAMDPLPTVSKAFSLVLQQERELLGNGI 457

Query: 834  DPIVHDAPETEVSTT----------------------------LLANSQYXXXXXXXXXX 929
                 D     ++                              +LAN             
Sbjct: 458  TTSQTDENAIALAANASRNASNYGSKNASNYGSGTSRNRGNPPVLANPSNFSGNNAANGH 517

Query: 930  XXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGYPPGFKGKGKNPFQQSQSNNVNA 1109
                         G NR+CT+C RTNH ID  +  HG+PPG+K KGK     SQ+N+   
Sbjct: 518  GRGKNFYANKGPSGQNRMCTYCGRTNHIIDGCFELHGFPPGYKPKGK-----SQANSAQT 572

Query: 1110 SESSTQNDSAQASASNKAPFGLTQEQYQGILSMV-XXXXXXXXXXXXXXXXNFVSTTPLA 1286
              S  Q+ + Q S       G TQEQ+QGIL+++                 N V T P A
Sbjct: 573  DASVAQHQAPQFS-------GFTQEQFQGILTLIQQSQQPHSGSTSAVHQSNSVMTHPFA 625

Query: 1287 LNSQSSS-----------DHDWLQGSDWYS 1343
             N  S+            D +  Q  DWYS
Sbjct: 626  FNCDSNKTSGKSPFVWILDTEQFQEDDWYS 655


>ref|XP_014621696.1| PREDICTED: uncharacterized protein LOC106795617 [Glycine max]
          Length = 275

 Score =  298 bits (762), Expect = 2e-94
 Identities = 151/258 (58%), Positives = 190/258 (73%), Gaps = 14/258 (5%)
 Frame = +3

Query: 159 MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEE 314
           MA  ++ DFSTNSANPYYLHPNENPAL+L        P L   N H         L  + 
Sbjct: 1   MALQNFVDFSTNSANPYYLHPNENPALVLVS------PSLTAKNYHTWSRSMHIALISKN 54

Query: 315 QRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRT 479
           + K    + PK   +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR 
Sbjct: 55  KDKFIDGSLPKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRI 114

Query: 480 RFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGA 659
           RFSQ DIFRISD+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P CKC+I C+CG 
Sbjct: 115 RFSQSDIFRISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGG 174

Query: 660 VDSVKIYREQDYVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQEREITHSVLD 836
           +DSV++Y EQDYVIRFLKGLNDRFS +KSQI MM PLP+ID VFS++IQQERE+  S  D
Sbjct: 175 IDSVRVYCEQDYVIRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSD 234

Query: 837 PIVHDAPETEVSTTLLAN 890
            +     ++ ++  + +N
Sbjct: 235 SVSEATSDSAMAMQVNSN 252


>ref|XP_014626210.1| PREDICTED: uncharacterized protein LOC106797041 [Glycine max]
          Length = 275

 Score =  296 bits (758), Expect = 7e-94
 Identities = 150/258 (58%), Positives = 190/258 (73%), Gaps = 14/258 (5%)
 Frame = +3

Query: 159 MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEE 314
           MA  +++DFSTNSANPYYLHPNENP L+L        P L   N H         L  + 
Sbjct: 1   MALQNFADFSTNSANPYYLHPNENPTLVLVS------PSLTAKNYHTWSRSMHIALISKN 54

Query: 315 QRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRT 479
           + K    + PK   +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR 
Sbjct: 55  KDKFIDGSLPKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRI 114

Query: 480 RFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGA 659
           RFSQ DIFRISD+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P  KC+I C+CG 
Sbjct: 115 RFSQSDIFRISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHYKCSIPCSCGG 174

Query: 660 VDSVKIYREQDYVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQEREITHSVLD 836
           +DSV++YREQDYVIRFLKGLNDRFS +KSQI MM PLP+ID VFS++IQQERE+  S  D
Sbjct: 175 IDSVRVYREQDYVIRFLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSD 234

Query: 837 PIVHDAPETEVSTTLLAN 890
            +     ++ ++  + +N
Sbjct: 235 SVSEATSDSAMAMQVNSN 252


>ref|XP_014627175.1| PREDICTED: uncharacterized protein LOC106797397 [Glycine max]
          Length = 275

 Score =  294 bits (753), Expect = 4e-93
 Identities = 150/258 (58%), Positives = 189/258 (73%), Gaps = 14/258 (5%)
 Frame = +3

Query: 159 MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEE 314
           MA  ++ DFSTNSANPYYLHPNENPAL+L        P L   N H         L  + 
Sbjct: 1   MALQNFVDFSTNSANPYYLHPNENPALVLVS------PSLTAKNYHTWSRSMHIALISKN 54

Query: 315 QRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRT 479
           + K    + PK   +DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR 
Sbjct: 55  KDKFIDGSLPKPPVSDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRI 114

Query: 480 RFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGA 659
           RFSQ DIFRISD+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P CKC+I  +CG 
Sbjct: 115 RFSQSDIFRISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPYSCGG 174

Query: 660 VDSVKIYREQDYVIRFLKGLNDRFSQTKSQI-MMKPLPEIDTVFSMLIQQEREITHSVLD 836
           +DSV++YREQDYVIR LKGLNDRFS +KSQI MM PLP+ID VFS++IQQERE+  S  D
Sbjct: 175 IDSVRVYREQDYVIRLLKGLNDRFSHSKSQIMMMNPLPDIDHVFSLVIQQEREMLGSNSD 234

Query: 837 PIVHDAPETEVSTTLLAN 890
            +     ++ ++  + +N
Sbjct: 235 SVSEATSDSAMAMQVNSN 252


>ref|XP_014632403.1| PREDICTED: uncharacterized protein LOC106798995 [Glycine max]
          Length = 277

 Score =  276 bits (706), Expect = 5e-86
 Identities = 138/219 (63%), Positives = 166/219 (75%), Gaps = 14/219 (6%)
 Frame = +3

Query: 159 MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHR--------LDLEE 314
           MA  ++ DFSTNSANPYYLHPNENPAL+L        P L   N H         L  + 
Sbjct: 1   MALQNFVDFSTNSANPYYLHPNENPALVLVS------PSLTAKNYHTWSRSMHIALISKN 54

Query: 315 QRKIC*RNPPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRT 479
           + K    + PK    DP+YAPWIRCNTMVLAWIHRS+S+SIA+S+      AGVWKNLR 
Sbjct: 55  KDKFIDGSLPKPPVFDPLYAPWIRCNTMVLAWIHRSISDSIAKSVLWIDTAAGVWKNLRI 114

Query: 480 RFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGA 659
           RFSQ DIFRISD+Q++LY+FRQG L++SDYFTQLK++WDELE+YRP+P CKC+I C+CG 
Sbjct: 115 RFSQSDIFRISDLQEDLYRFRQGTLDVSDYFTQLKIYWDELENYRPIPHCKCSIPCSCGG 174

Query: 660 VDSVKIYREQDYVIRFLKGLNDRFSQTKSQI-MMKPLPE 773
           +DSV++YREQDYVIRFLKGLNDRFS +KSQI MM PLP+
Sbjct: 175 IDSVRVYREQDYVIRFLKGLNDRFSHSKSQIMMMNPLPD 213


>gb|KYP65733.1| hypothetical protein KK1_011995 [Cajanus cajan]
 gb|KYP72745.1| hypothetical protein KK1_005345 [Cajanus cajan]
          Length = 445

 Score =  279 bits (713), Expect = 8e-85
 Identities = 158/404 (39%), Positives = 233/404 (57%), Gaps = 21/404 (5%)
 Frame = +3

Query: 159  MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*RN 338
            M   SY+DF+TN  NPYYLHPNE P+L+L         +  +  A R+ L  + K+   +
Sbjct: 1    MEDQSYADFTTNPYNPYYLHPNETPSLVLVTPLLDGKNYHTRARAMRMALMSKHKVKFID 60

Query: 339  ----PPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQ 491
                PP   ++  ++ PW RCNTMV++W+  S+SE I +SI      + +W++L+ RFSQ
Sbjct: 61   GTLTPP--HSSSILFEPWGRCNTMVISWLQHSISEKIVKSILWFDTASDIWQDLKARFSQ 118

Query: 492  GDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSV 671
            GD+FR++ +Q++LYKF QG+L++++YFTQLK  WDE+++ RPL  CKC+IAC+CGAVDS 
Sbjct: 119  GDVFRVAQLQEDLYKFHQGSLDVTEYFTQLKEMWDEIDNLRPLSRCKCSIACSCGAVDSS 178

Query: 672  KIYREQDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVLDPIVH 848
              YREQD VIRFL+GLND+++  +SQIM M PLP +   FS++ QQER +  S     +H
Sbjct: 179  YKYREQDAVIRFLRGLNDQYTHVRSQIMLMDPLPSLSKTFSLVGQQERHLNQSA----IH 234

Query: 849  DAPETEVSTTLLA-----------NSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHC 995
            D  +   +T+  +           + Q                        G+ ++CTHC
Sbjct: 235  DDTKVLAATSFGSLPQTPTTQQHQSPQQQQFGFRRGGYSHGRGRGRGGRTHGSIKICTHC 294

Query: 996  NRTNHTIDS*WVKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGL 1175
             R NHT+D+ + KHG+PPG++ KG      S +  VNA E  T + S+    SN   FG 
Sbjct: 295  GRNNHTVDTCYFKHGFPPGYQSKGGT----SANFTVNAVE--TTSPSSMVPESNNPNFGF 348

Query: 1176 TQEQYQGILSMVXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSS 1307
            TQEQ Q +LS++                N V ++PLA+N  S++
Sbjct: 349  TQEQCQELLSLL--QQSKTIPTPSSHSANSVVSSPLAMNFNSNA 390


>dbj|GAU50616.1| hypothetical protein TSUD_290710 [Trifolium subterraneum]
          Length = 404

 Score =  271 bits (693), Expect = 2e-82
 Identities = 161/417 (38%), Positives = 227/417 (54%), Gaps = 22/417 (5%)
 Frame = +3

Query: 159  MATTSYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*RN 338
            MA   Y+DF+TN  NPYY+HPNENP++IL        P L   N        +  +  +N
Sbjct: 1    MANQPYADFATNPTNPYYIHPNENPSIILVT------PLLDHKNYQTWSRSMKVALISKN 54

Query: 339  PPKLQN--------TDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRT 479
              K  +        +DP++ PWIRCN MVL+WI RS+SE+I +SI      A VWK L  
Sbjct: 55   KLKFVDGTLPLPHVSDPLHEPWIRCNNMVLSWIQRSISETIVKSIMWCDCAAVVWKCLER 114

Query: 480  RFSQGDIFRISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGA 659
            RF+ GDIFRI+DI +E+ +++QG L+IS YFT L   W+ELE++RPL  C CAI CTCGA
Sbjct: 115  RFAHGDIFRIADILEEIARYQQGTLDISSYFTHLTTLWEELENFRPLKDCSCAIPCTCGA 174

Query: 660  VDSVKIYREQDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVL- 833
               +K Y+EQD VI+FLKGLN++++  +SQIM + PLP+ID  FS+++QQER++   ++ 
Sbjct: 175  ASDLKKYKEQDKVIKFLKGLNEQYASVRSQIMLLDPLPDIDRCFSLVLQQERQMLIPIIT 234

Query: 834  -DPIVHDAPETEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKG-TNRVCTHCNRTN 1007
             + +   A   +V  T   + ++                      +G  NR CTHC R N
Sbjct: 235  DNSVDQQASIMQVRQTSYNHGKHYTSFSSTHHGGRGRGRGNHHGGRGPNNRTCTHCGRHN 294

Query: 1008 HTIDS*WVKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQ 1187
            H +D+ +  HGYPPG++ K       S+S NV A+ S+        + ++ A     QEQ
Sbjct: 295  HIVDTCFELHGYPPGYQHK------NSKSVNVAATASNATLKEGHINLTS-ATINTIQEQ 347

Query: 1188 YQGIL-----SMVXXXXXXXXXXXXXXXXNFVSTTPLALNSQSSSDHDWLQGSDWYS 1343
            Y  IL     S +                N + + P ALNS SS   D+   SDW S
Sbjct: 348  YNQILQLLQHSALQASSTPSNPSPTQASANSIISLPTALNSSSSPTFDFNPNSDWCS 404


>ref|XP_006576053.1| PREDICTED: uncharacterized protein LOC102662412 [Glycine max]
          Length = 424

 Score =  266 bits (680), Expect = 3e-80
 Identities = 151/389 (38%), Positives = 224/389 (57%), Gaps = 14/389 (3%)
 Frame = +3

Query: 171  SYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*----RN 338
            SYSDF+TN +NPYY+HPNENP+LIL         +     + ++ L  + K+       +
Sbjct: 4    SYSDFATNPSNPYYMHPNENPSLILVQPVLDNKNYQIWCRSMKVALISKNKVKFVDGTLS 63

Query: 339  PPKLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIF 503
            PP +  +DP+Y PW+RCN +VL+W+ RS SE IA+S+      + VWK+L  RFSQGDIF
Sbjct: 64   PPPI--SDPLYEPWLRCNNLVLSWLQRSTSEEIAKSLLWCDRASFVWKSLENRFSQGDIF 121

Query: 504  RISDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSVKIYR 683
            R++DIQ+E+   +QG L+IS YFT+L   W+E+E++RP+  C CAI C+CGA   ++ ++
Sbjct: 122  RVADIQEEVACLQQGTLDISSYFTKLMTLWEEIENFRPIRDCTCAIPCSCGAATDLRKFK 181

Query: 684  EQDYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPE 860
            EQD VI+FLKGL D++S  +SQIM M PLP +D  F++++QQER+         +    +
Sbjct: 182  EQDKVIKFLKGLGDQYSHVRSQIMLMSPLPTLDNAFNLILQQERQFN-------LPSTTD 234

Query: 861  TEVSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHG 1040
            + +      N                         +G NR+CTHCNRTNHT+++ ++KHG
Sbjct: 235  SSIENQSSVNHFSQTPSRPSNNSGCGRGRGYSSGGRG-NRLCTHCNRTNHTVETCFIKHG 293

Query: 1041 YPPGFKGKGKNPF-QQSQSNNVNASESSTQNDSAQASAS---NKAPFGLTQEQYQGILSM 1208
            YPPGF+ +  N     S  N+V  + S+  + S+ AS S   + A     QEQY  IL +
Sbjct: 294  YPPGFQHRKSNSSGNASVVNSVQDAGSAHISSSSSASTSTNGSSASLSTIQEQYTQILQL 353

Query: 1209 VXXXXXXXXXXXXXXXXNFVSTTPLALNS 1295
            +                N  ST+P ++NS
Sbjct: 354  L-------------QQSNLQSTSPSSVNS 369


>ref|XP_019455138.1| PREDICTED: uncharacterized protein LOC109356267 [Lupinus
            angustifolius]
          Length = 834

 Score =  273 bits (698), Expect = 9e-79
 Identities = 156/367 (42%), Positives = 211/367 (57%), Gaps = 20/367 (5%)
 Frame = +3

Query: 171  SYSDFSTNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*RNP--P 344
            ++S  S    NP++LH NENPAL+L         +     A RL LE + K+   N   P
Sbjct: 3    NHSILSDQLNNPFFLHSNENPALVLVTPPMNTKDYHSWARAMRLTLESKNKLNFINGSLP 62

Query: 345  KLQNTDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFRI 509
            +    DP+Y PW+RCNTMVL+WI   + ESI +SI      A  WK+L  RFS GDIFRI
Sbjct: 63   RPSPKDPLYGPWVRCNTMVLSWIQHCVDESIVKSILWIDTTAEAWKDLHDRFSHGDIFRI 122

Query: 510  SDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSVKIYREQ 689
            + +Q E Y   QGNL+ISDYFT+LK  WDE+E +RP P CKC   C CGA+DS+K Y+EQ
Sbjct: 123  AALQKEFYHLDQGNLDISDYFTKLKTLWDEIEDFRPFPSCKCNTPCICGAMDSLKTYKEQ 182

Query: 690  DYVIRFLKGLNDRFSQTKSQIM-MKPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPETE 866
            DYVIRFL+GLN++F+  KSQIM M PLP I   F++LIQQER+    V   +  D     
Sbjct: 183  DYVIRFLEGLNEQFAHVKSQIMLMDPLPNITKAFALLIQQERQTQLPVPPSLEPDNRVMN 242

Query: 867  VSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKG------------TNRVCTHCNRTNH 1010
            VS+    +SQY                   +P +G             NR CT+C RTNH
Sbjct: 243  VSSR--QDSQY-----RNNSTNNSFRGRGIIPFRGRGNRAAGFGRGQNNRFCTYCERTNH 295

Query: 1011 TIDS*WVKHGYPPGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASASNKAPFGLTQEQY 1190
            TI++ ++KHGYPPG++    +      +    + ++ST N++A  + +N   F  T+EQ 
Sbjct: 296  TIETCYLKHGYPPGYQSTRSSKMVNHTTG--YSFDTSTNNEAAHQTQNNSTSF--TKEQV 351

Query: 1191 QGILSMV 1211
            QGIL ++
Sbjct: 352  QGILDLL 358


>ref|XP_016673106.1| PREDICTED: uncharacterized protein LOC107892548 [Gossypium hirsutum]
          Length = 366

 Score =  251 bits (642), Expect = 3e-75
 Identities = 147/401 (36%), Positives = 219/401 (54%), Gaps = 16/401 (3%)
 Frame = +3

Query: 189  TNSANPYYLHPNENPALILFLHH*IELPHLGKINAHRLDLEEQRKIC*RNPPKLQN---- 356
            T  ++PYYLHPNENPAL+L        P L   N H         +  +N  +  N    
Sbjct: 7    TLPSSPYYLHPNENPALVLVS------PVLSSSNYHSWSRAMTMALLSKNKLQFVNGTIT 60

Query: 357  ----TDPMYAPWIRCNTMVLAWIHRSLSESIAQSI-----PAGVWKNLRTRFSQGDIFRI 509
                TDP+Y+ W RCNTMVL+W+H S+S SI  S+      + VW++LR RFSQGD+FRI
Sbjct: 61   VPLRTDPLYSAWERCNTMVLSWLHHSISPSIMNSVLWLDFASDVWRDLRERFSQGDVFRI 120

Query: 510  SDIQDELYKFRQGNLEISDYFTQLKVFWDELESYRPLPLCKCAIACTCGAVDSVKIYREQ 689
            SD+Q+E+  F+Q +  ++DYFT+LK+ WDEL ++RP+P+C C  +C+CG   +++ Y + 
Sbjct: 121  SDLQEEINSFKQEDRSVTDYFTELKILWDELMNFRPIPVCSCPTSCSCGVFATLQKYHDN 180

Query: 690  DYVIRFLKGLNDRFSQTKSQIMM-KPLPEIDTVFSMLIQQEREITHSVLDPIVHDAPETE 866
            DYVIRFLKGL+DRF+  +SQIM+  PLP I+  FS++IQQER +       +   + +  
Sbjct: 181  DYVIRFLKGLHDRFAAVRSQIMLIDPLPTINKAFSLVIQQERHL-------LAASSSQLF 233

Query: 867  VSTTLLANSQYXXXXXXXXXXXXXXXXXXXLPAKGTNRVCTHCNRTNHTIDS*WVKHGYP 1046
            VS TL  +                        +   +R CT C ++ HT+D+ + KHGYP
Sbjct: 234  VSNTLRQHPSSRKSQP---------------KSASDSRQCTFCGKSRHTVDTCYEKHGYP 278

Query: 1047 PGFKGKGKNPFQQSQSNNVNASESSTQNDSAQASA--SNKAPFGLTQEQYQGILSMVXXX 1220
            PG+K +G+     S+++ V     +   D++Q+ A     +P  LTQ+Q Q +L+++   
Sbjct: 279  PGYKSRGRT----SRAHAVLTDGEAPSLDASQSVAILPPDSPVTLTQDQLQQLLTLL--- 331

Query: 1221 XXXXXXXXXXXXXNFVSTTPLALNSQSSSDHDWLQGSDWYS 1343
                         N  S+ PL     S S    +   DWYS
Sbjct: 332  ---PSSTSPTHVTNTASSLPL---QPSVSSGPSIFDDDWYS 366


Top