BLASTX nr result
ID: Astragalus24_contig00021072
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus24_contig00021072 (709 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003626840.1| NYN domain protein [Medicago truncatula] >gi... 333 e-108 ref|XP_003626882.2| NYN domain protein [Medicago truncatula] >gi... 331 e-107 ref|XP_013449336.1| endonuclease or glycosyl hydrolase [Medicago... 337 e-107 ref|XP_013457542.1| NYN domain protein [Medicago truncatula] >gi... 316 e-103 ref|XP_004502893.1| PREDICTED: uncharacterized protein LOC101500... 251 8e-74 gb|OIV89239.1| hypothetical protein TanjilG_24359 [Lupinus angus... 246 1e-71 gb|KYP69395.1| hypothetical protein KK1_008585 [Cajanus cajan] 238 7e-70 ref|XP_020210769.1| uncharacterized protein LOC109795685 [Cajanu... 238 3e-69 gb|PNY09478.1| limkain-B1, partial [Trifolium pratense] 220 8e-67 ref|XP_013461549.1| endonuclease or glycosyl hydrolase [Medicago... 232 1e-66 ref|XP_019417454.1| PREDICTED: uncharacterized protein LOC109328... 231 2e-66 ref|XP_003602534.1| endonuclease or glycosyl hydrolase [Medicago... 232 2e-66 ref|XP_019417452.1| PREDICTED: uncharacterized protein LOC109328... 231 3e-66 ref|XP_019417453.1| PREDICTED: uncharacterized protein LOC109328... 231 3e-66 ref|XP_019417451.1| PREDICTED: uncharacterized protein LOC109328... 231 3e-66 gb|PNY12010.1| limkain-B1 [Trifolium pratense] 221 1e-62 dbj|GAU13177.1| hypothetical protein TSUD_179200 [Trifolium subt... 215 2e-60 ref|XP_014630372.1| PREDICTED: uncharacterized protein LOC102660... 211 4e-59 ref|XP_014630954.1| PREDICTED: uncharacterized protein LOC106794... 198 2e-54 gb|KHN34394.1| hypothetical protein glysoja_016722 [Glycine soja] 195 1e-53 >ref|XP_003626840.1| NYN domain protein [Medicago truncatula] gb|AET01316.1| NYN domain protein [Medicago truncatula] Length = 643 Score = 333 bits (855), Expect = e-108 Identities = 171/238 (71%), Positives = 189/238 (79%), Gaps = 2/238 (0%) Frame = +2 Query: 2 ALCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQN 181 ALCHAATIMW+WSSMLKG+DLTGKHFN+PPDGP SWY NSN+PLENPFS V+ TSSQN Sbjct: 171 ALCHAATIMWEWSSMLKGDDLTGKHFNYPPDGPTYSWYENSNVPLENPFSVVELHTSSQN 230 Query: 182 VEIHKSYSDIKLGEVSKS--DIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDM 355 E E+ K DIKL + SKS S QVMKIL SHPNGISIGDLRAELT CDM Sbjct: 231 SE----------EEIYKPTLDIKLSQASKSFSSQVMKILCSHPNGISIGDLRAELTNCDM 280 Query: 356 PLGKHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRLVPSTTSAVKNKQKDSAATQKVHNE 535 PL K FYG+KKFSNFL+S+ +VQLQYLG NFWV LVPSTTSAVKN Q D AATQK+ N+ Sbjct: 281 PLVKRFYGNKKFSNFLISMSYVQLQYLGGDNFWVCLVPSTTSAVKNNQNDGAATQKLPND 340 Query: 536 GNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSSFPSFMESNVH 709 G MDRSAD +PKISS V+SEGD+ KSFQS PSQGKP+GEY DGKSS P FM+S VH Sbjct: 341 GKNMDRSADGVPKISSSCVNSEGDDLKSFQSIPSQGKPLGEYADGKSSTPLFMDSIVH 398 >ref|XP_003626882.2| NYN domain protein [Medicago truncatula] gb|AET01358.2| NYN domain protein [Medicago truncatula] Length = 617 Score = 331 bits (848), Expect = e-107 Identities = 170/238 (71%), Positives = 188/238 (78%), Gaps = 2/238 (0%) Frame = +2 Query: 2 ALCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQN 181 ALCHAATIMW+WSSMLKG+DLTGKHFN+PPDGP SWY NSN+PLENPFS V+ TSSQN Sbjct: 169 ALCHAATIMWEWSSMLKGDDLTGKHFNYPPDGPTYSWYENSNVPLENPFSVVELHTSSQN 228 Query: 182 VEIHKSYSDIKLGEVSKS--DIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDM 355 E E+ K D KL + SKS S QVMKIL SHPNGISIGDLRAELT CDM Sbjct: 229 SE----------EEIYKPTLDKKLSQASKSFSSQVMKILCSHPNGISIGDLRAELTNCDM 278 Query: 356 PLGKHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRLVPSTTSAVKNKQKDSAATQKVHNE 535 PL K FYG+KKFSNFL+S+ +VQLQYLG NFWV LVPSTTSAVKN Q D AATQK+ N+ Sbjct: 279 PLVKRFYGNKKFSNFLISMSYVQLQYLGGNNFWVCLVPSTTSAVKNNQNDGAATQKLPND 338 Query: 536 GNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSSFPSFMESNVH 709 G MDRSAD +PKISS V+SEGD+ KSFQS PSQGKP+GEY DGKSS P FM+S VH Sbjct: 339 GKNMDRSADGVPKISSSCVNSEGDDLKSFQSIPSQGKPLGEYADGKSSTPLFMDSIVH 396 >ref|XP_013449336.1| endonuclease or glycosyl hydrolase [Medicago truncatula] gb|KEH23363.1| endonuclease or glycosyl hydrolase [Medicago truncatula] Length = 853 Score = 337 bits (864), Expect = e-107 Identities = 174/238 (73%), Positives = 189/238 (79%), Gaps = 2/238 (0%) Frame = +2 Query: 2 ALCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQN 181 ALCHA TIMW WSS+LKGEDLTGKHFNHPPDGP NS YGNSN+PLENPFS VD TSSQN Sbjct: 101 ALCHAPTIMWDWSSLLKGEDLTGKHFNHPPDGPTNSRYGNSNVPLENPFSLVDFHTSSQN 160 Query: 182 VEIHKSYSDIKLGEVSKS--DIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDM 355 E E+ K DIKL E SKS SRQVMKIL SHPNGISIGDLRAELTKCD+ Sbjct: 161 AE-----------EIYKPTLDIKLCEASKSVSRQVMKILCSHPNGISIGDLRAELTKCDL 209 Query: 356 PLGKHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRLVPSTTSAVKNKQKDSAATQKVHNE 535 PL K FYG+KKFS+FLVS+ +VQLQYLG NFWVRLVPSTTSAVKNKQKD TQ +H+E Sbjct: 210 PLDKRFYGNKKFSDFLVSMSYVQLQYLGGGNFWVRLVPSTTSAVKNKQKDCVLTQNLHDE 269 Query: 536 GNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSSFPSFMESNVH 709 G MDRSAD +P+ISS VS EGD+ KSFQ PSQG P+GEY DGK SFPS +ESNVH Sbjct: 270 GKNMDRSADGVPRISSSCVSCEGDDLKSFQFLPSQGNPLGEYADGKPSFPS-LESNVH 326 Score = 157 bits (397), Expect = 2e-40 Identities = 91/180 (50%), Positives = 110/180 (61%), Gaps = 13/180 (7%) Frame = +2 Query: 5 LCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFN-----------SWYGNSNMPLENPFS 151 +C ATI+WQWSS+LKG+ LTGK+FNHPPD WY NS +P ENPFS Sbjct: 594 ICKVATIVWQWSSVLKGKYLTGKYFNHPPDWYIRKREQTPLEYPPEWYRNSKVPHENPFS 653 Query: 152 AVDPPTSSQNVEIHKSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLR 331 A + PTSSQN +I S IKL V +S SR++ K+LSS+PNGISIGDL Sbjct: 654 AAEEPTSSQNAKILD----------PSSYIKLIYVQQSVSRKIRKVLSSYPNGISIGDLT 703 Query: 332 AELTKCDMPLGKHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRLVPSTTSAV--KNKQKD 505 L C G+ KK SN L SIP VQL Y+G+ NF VRL+PSTTS KN+Q+D Sbjct: 704 FHLGDC---FGRGLPDRKKLSNILASIPDVQLLYIGDDNFCVRLMPSTTSNAEEKNEQRD 760 >ref|XP_013457542.1| NYN domain protein [Medicago truncatula] gb|KEH31573.1| NYN domain protein [Medicago truncatula] Length = 516 Score = 316 bits (810), Expect = e-103 Identities = 162/227 (71%), Positives = 180/227 (79%) Frame = +2 Query: 2 ALCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQN 181 ALCHAATIMW+WSSML G+DLTGKHFN+PPDGP NS Y NSN PLENPFS V+ TSSQN Sbjct: 178 ALCHAATIMWEWSSMLNGDDLTGKHFNYPPDGPTNSCYENSNAPLENPFSVVELHTSSQN 237 Query: 182 VEIHKSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPL 361 E E+SK + + KS SRQVMKIL SHPNGISIGDLRAELTKCD+PL Sbjct: 238 SE-----------EISKPTLDI----KSFSRQVMKILCSHPNGISIGDLRAELTKCDVPL 282 Query: 362 GKHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRLVPSTTSAVKNKQKDSAATQKVHNEGN 541 K FYG+KKFS+FL+SI +V+LQYLG NFWV LVPSTTSAVKN QKD A TQKV N+G Sbjct: 283 VKRFYGNKKFSDFLISISYVELQYLGGDNFWVCLVPSTTSAVKNNQKDGATTQKVRNDGK 342 Query: 542 TMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSSF 682 MDRSAD +PKISS VSSEGD+ KSFQS PSQGKP+GEY DGKSS+ Sbjct: 343 NMDRSADGVPKISSSCVSSEGDDLKSFQSIPSQGKPLGEYADGKSSY 389 >ref|XP_004502893.1| PREDICTED: uncharacterized protein LOC101500766 [Cicer arietinum] Length = 1034 Score = 251 bits (642), Expect = 8e-74 Identities = 131/235 (55%), Positives = 157/235 (66%), Gaps = 13/235 (5%) Frame = +2 Query: 14 AATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQNVEIH 193 AATIMWQW S+LKGEDLTGKHFNHPPDGPF SWYGNS +PLE+PFS+ D TSS NVEIH Sbjct: 189 AATIMWQWPSLLKGEDLTGKHFNHPPDGPFGSWYGNSKVPLEDPFSSADQSTSSSNVEIH 248 Query: 194 KSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLGKHF 373 + G + KS R+V ILSSHPNGI+I DLR ELTKCD+ LGK Sbjct: 249 EP--------------SPGGIPKSFMRRVRHILSSHPNGIAISDLRTELTKCDVSLGKSM 294 Query: 374 YGHKKFSNFLVSIPHVQLQYLGEVNFWVRLV-------------PSTTSAVKNKQKDSAA 514 +G+K FS L+SIPHVQL++LG +F V LV PS+ SAV+N + Sbjct: 295 FGYKSFSRLLLSIPHVQLKHLGHGSFCVHLVTSEPPEAFERSTAPSSASAVENDESGYTI 354 Query: 515 TQKVHNEGNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSS 679 T K+HNEG +DR A P ISSL+ ++ D+SKS +S PSQGKPI E+V KSS Sbjct: 355 TPKLHNEGKNIDRDAHRTPLISSLHEKTDRDDSKSLKSIPSQGKPIKEFVSHKSS 409 >gb|OIV89239.1| hypothetical protein TanjilG_24359 [Lupinus angustifolius] Length = 1041 Score = 246 bits (627), Expect = 1e-71 Identities = 137/239 (57%), Positives = 154/239 (64%), Gaps = 13/239 (5%) Frame = +2 Query: 5 LCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQNV 184 LC AATI WQW S++KGEDLTGKH NHPPDGPF SWYGN MPLENPFSAV+ TSS V Sbjct: 179 LCSAATIAWQWPSLIKGEDLTGKHLNHPPDGPFGSWYGNYKMPLENPFSAVEQSTSSPAV 238 Query: 185 EIHKSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLG 364 EI+K + K G + KS +K +V IL+SHP GISI DLRAEL KCDM L Sbjct: 239 EIYKPTLESKPGVIPKSIVK----------KVRYILNSHPKGISIFDLRAELAKCDMHLD 288 Query: 365 KHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRLV-------------PSTTSAVKNKQKD 505 K FYGHK FS FL+SIP++QLQ LG NF VRL+ PSTTSA+K+K K Sbjct: 289 KSFYGHKTFSRFLLSIPNIQLQSLGNGNFCVRLIHQRSSEPAKSIVLPSTTSAMKDKGKG 348 Query: 506 SAATQKVHNEGNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSSF 682 AT K E M R ADE I+SL+ S D SKSFQ PS K EYVDGK SF Sbjct: 349 YGATPKSDGEVKNMVRDADETLSIASLHERSIVDGSKSFQQVPSLDKSSVEYVDGKPSF 407 >gb|KYP69395.1| hypothetical protein KK1_008585 [Cajanus cajan] Length = 795 Score = 238 bits (606), Expect = 7e-70 Identities = 133/244 (54%), Positives = 153/244 (62%), Gaps = 13/244 (5%) Frame = +2 Query: 5 LCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQNV 184 L AATIMWQWSS+LKGE+L+GKHFNHPPDGPF SWYGN +PLENPF A +P TS QNV Sbjct: 82 LFSAATIMWQWSSLLKGENLSGKHFNHPPDGPFGSWYGNFKVPLENPFLAAEPSTSLQNV 141 Query: 185 EIHKSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLG 364 EI++ D+K G VSKS RQV ILSSHP GISI DL AEL KCD L Sbjct: 142 EINE----------PSLDLKSGAVSKSLVRQVKHILSSHPKGISITDLHAELAKCDEHLD 191 Query: 365 KHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRL-------------VPSTTSAVKNKQKD 505 K YG + FS FL+SIPH+QLQ LG NF V L VP T+S+VKN+++ Sbjct: 192 KSLYGFRSFSRFLLSIPHIQLQPLGSANFRVFLLASESPEPFDSSVVPLTSSSVKNEERG 251 Query: 506 SAATQKVHNEGNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSSFP 685 AD P ++ S D+SKSFQ PSQGK IGEYVDGKSSFP Sbjct: 252 G---------------YADGTPSNATFRARSMNDDSKSFQPVPSQGKTIGEYVDGKSSFP 296 Query: 686 SFME 697 S +E Sbjct: 297 SSVE 300 >ref|XP_020210769.1| uncharacterized protein LOC109795685 [Cajanus cajan] Length = 897 Score = 238 bits (606), Expect = 3e-69 Identities = 133/244 (54%), Positives = 153/244 (62%), Gaps = 13/244 (5%) Frame = +2 Query: 5 LCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQNV 184 L AATIMWQWSS+LKGE+L+GKHFNHPPDGPF SWYGN +PLENPF A +P TS QNV Sbjct: 169 LFSAATIMWQWSSLLKGENLSGKHFNHPPDGPFGSWYGNFKVPLENPFLAAEPSTSLQNV 228 Query: 185 EIHKSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLG 364 EI++ D+K G VSKS RQV ILSSHP GISI DL AEL KCD L Sbjct: 229 EINE----------PSLDLKSGAVSKSLVRQVKHILSSHPKGISITDLHAELAKCDEHLD 278 Query: 365 KHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRL-------------VPSTTSAVKNKQKD 505 K YG + FS FL+SIPH+QLQ LG NF V L VP T+S+VKN+++ Sbjct: 279 KSLYGFRSFSRFLLSIPHIQLQPLGSANFRVFLLASESPEPFDSSVVPLTSSSVKNEERG 338 Query: 506 SAATQKVHNEGNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSSFP 685 AD P ++ S D+SKSFQ PSQGK IGEYVDGKSSFP Sbjct: 339 G---------------YADGTPSNATFRARSMNDDSKSFQPVPSQGKTIGEYVDGKSSFP 383 Query: 686 SFME 697 S +E Sbjct: 384 SSVE 387 >gb|PNY09478.1| limkain-B1, partial [Trifolium pratense] Length = 378 Score = 220 bits (560), Expect = 8e-67 Identities = 124/211 (58%), Positives = 135/211 (63%) Frame = +2 Query: 2 ALCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQN 181 ALCHAATIMW+WSSMLKGEDL GKHFN+PPDG PT+S Sbjct: 64 ALCHAATIMWEWSSMLKGEDLIGKHFNYPPDG----------------------PTNSW- 100 Query: 182 VEIHKSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPL 361 QVMKIL SHPNGISIGDLRAELTKCD+PL Sbjct: 101 -------------------------------QVMKILCSHPNGISIGDLRAELTKCDLPL 129 Query: 362 GKHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRLVPSTTSAVKNKQKDSAATQKVHNEGN 541 K FYG+KKFSNFLVSI HVQLQYLG NFWV LVPST SAVKNKQ + AATQK+HNEG Sbjct: 130 VKRFYGNKKFSNFLVSISHVQLQYLGGDNFWVCLVPSTKSAVKNKQSNGAATQKLHNEGK 189 Query: 542 TMDRSADEIPKISSLYVSSEGDNSKSFQSNP 634 +DRSAD IP+ISS VSS+G+ S S NP Sbjct: 190 NLDRSADGIPRISSSCVSSDGEIS-SVDVNP 219 >ref|XP_013461549.1| endonuclease or glycosyl hydrolase [Medicago truncatula] gb|KEH35584.1| endonuclease or glycosyl hydrolase [Medicago truncatula] Length = 1027 Score = 232 bits (591), Expect = 1e-66 Identities = 131/235 (55%), Positives = 155/235 (65%), Gaps = 14/235 (5%) Frame = +2 Query: 14 AATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQNVEIH 193 AATIMWQW+S+LKGE+LTGKHFNHPPDG F SWYGNS +PLENPFSA TSSQNV+I Sbjct: 51 AATIMWQWTSLLKGENLTGKHFNHPPDGQFGSWYGNSKVPLENPFSATGQSTSSQNVQI- 109 Query: 194 KSYSDIKLGEVSKSDIKLGE-VSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLGKH 370 +++ E S SD+KL E V KS RQV ILSSHPNGIS DLRAEL K + LG+ Sbjct: 110 -----VEINEPS-SDLKLAEGVPKSVIRQVKDILSSHPNGISAIDLRAELAKRGVILGRS 163 Query: 371 FYGHKKFSNFLVSIPHVQLQYLGEVNFWV-------------RLVPSTTSAVKNKQKDSA 511 +G+++ S FL SIP V LQ LG+ NF V +VPSTT AVKN++KD Sbjct: 164 MFGYRRLSRFLSSIPDVHLQNLGDGNFCVCLIPSESPEPSEKSIVPSTTYAVKNEEKDYT 223 Query: 512 ATQKVHNEGNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKS 676 T K+H E +D P +SS + D+SKSFQS PSQ KPIGE V KS Sbjct: 224 TTPKLHGEDKELDGDKHRTPSMSSSHERIVEDDSKSFQSFPSQEKPIGEDVSHKS 278 >ref|XP_019417454.1| PREDICTED: uncharacterized protein LOC109328441 isoform X4 [Lupinus angustifolius] Length = 1013 Score = 231 bits (589), Expect = 2e-66 Identities = 129/247 (52%), Positives = 154/247 (62%), Gaps = 13/247 (5%) Frame = +2 Query: 5 LCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQNV 184 LC AATI WQWSS++KGEDL GKHFNHPPDGPF SWYGN MPLENPFS V+ TSS V Sbjct: 183 LCSAATITWQWSSLIKGEDLAGKHFNHPPDGPFGSWYGNYKMPLENPFSTVEQSTSSPAV 242 Query: 185 EIHKSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLG 364 EI++ + K G + KS R+V ILS H GISI DLRAEL KCD+ + Sbjct: 243 EIYE----------PTPESKPGVIPKSVLRRVRHILSLHTKGISISDLRAELAKCDVYVD 292 Query: 365 KHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRLVP-------------STTSAVKNKQKD 505 K YGHK FS FL+SIP+VQL+ LG+ NF+VRL+ TTSAVK ++K Sbjct: 293 KSLYGHKTFSRFLLSIPNVQLRSLGDGNFFVRLIRPGSPEPAESTILLPTTSAVKGEEKG 352 Query: 506 SAATQKVHNEGNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSSFP 685 AT K + + R ADE ISSL D+SKSFQ PS GEYVDGK+S+ Sbjct: 353 YVATLKSNGVVSDNARDADETHSISSLDERIMDDDSKSFQQVPSPDTSSGEYVDGKASYS 412 Query: 686 SFMESNV 706 +E +V Sbjct: 413 PSIEGHV 419 >ref|XP_003602534.1| endonuclease or glycosyl hydrolase [Medicago truncatula] gb|AES72785.1| endonuclease or glycosyl hydrolase [Medicago truncatula] Length = 1166 Score = 232 bits (591), Expect = 2e-66 Identities = 131/235 (55%), Positives = 155/235 (65%), Gaps = 14/235 (5%) Frame = +2 Query: 14 AATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQNVEIH 193 AATIMWQW+S+LKGE+LTGKHFNHPPDG F SWYGNS +PLENPFSA TSSQNV+I Sbjct: 190 AATIMWQWTSLLKGENLTGKHFNHPPDGQFGSWYGNSKVPLENPFSATGQSTSSQNVQI- 248 Query: 194 KSYSDIKLGEVSKSDIKLGE-VSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLGKH 370 +++ E S SD+KL E V KS RQV ILSSHPNGIS DLRAEL K + LG+ Sbjct: 249 -----VEINEPS-SDLKLAEGVPKSVIRQVKDILSSHPNGISAIDLRAELAKRGVILGRS 302 Query: 371 FYGHKKFSNFLVSIPHVQLQYLGEVNFWV-------------RLVPSTTSAVKNKQKDSA 511 +G+++ S FL SIP V LQ LG+ NF V +VPSTT AVKN++KD Sbjct: 303 MFGYRRLSRFLSSIPDVHLQNLGDGNFCVCLIPSESPEPSEKSIVPSTTYAVKNEEKDYT 362 Query: 512 ATQKVHNEGNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKS 676 T K+H E +D P +SS + D+SKSFQS PSQ KPIGE V KS Sbjct: 363 TTPKLHGEDKELDGDKHRTPSMSSSHERIVEDDSKSFQSFPSQEKPIGEDVSHKS 417 >ref|XP_019417452.1| PREDICTED: uncharacterized protein LOC109328441 isoform X2 [Lupinus angustifolius] Length = 1090 Score = 231 bits (589), Expect = 3e-66 Identities = 129/247 (52%), Positives = 154/247 (62%), Gaps = 13/247 (5%) Frame = +2 Query: 5 LCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQNV 184 LC AATI WQWSS++KGEDL GKHFNHPPDGPF SWYGN MPLENPFS V+ TSS V Sbjct: 183 LCSAATITWQWSSLIKGEDLAGKHFNHPPDGPFGSWYGNYKMPLENPFSTVEQSTSSPAV 242 Query: 185 EIHKSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLG 364 EI++ + K G + KS R+V ILS H GISI DLRAEL KCD+ + Sbjct: 243 EIYE----------PTPESKPGVIPKSVLRRVRHILSLHTKGISISDLRAELAKCDVYVD 292 Query: 365 KHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRLVP-------------STTSAVKNKQKD 505 K YGHK FS FL+SIP+VQL+ LG+ NF+VRL+ TTSAVK ++K Sbjct: 293 KSLYGHKTFSRFLLSIPNVQLRSLGDGNFFVRLIRPGSPEPAESTILLPTTSAVKGEEKG 352 Query: 506 SAATQKVHNEGNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSSFP 685 AT K + + R ADE ISSL D+SKSFQ PS GEYVDGK+S+ Sbjct: 353 YVATLKSNGVVSDNARDADETHSISSLDERIMDDDSKSFQQVPSPDTSSGEYVDGKASYS 412 Query: 686 SFMESNV 706 +E +V Sbjct: 413 PSIEGHV 419 >ref|XP_019417453.1| PREDICTED: uncharacterized protein LOC109328441 isoform X3 [Lupinus angustifolius] Length = 1090 Score = 231 bits (589), Expect = 3e-66 Identities = 129/247 (52%), Positives = 154/247 (62%), Gaps = 13/247 (5%) Frame = +2 Query: 5 LCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQNV 184 LC AATI WQWSS++KGEDL GKHFNHPPDGPF SWYGN MPLENPFS V+ TSS V Sbjct: 183 LCSAATITWQWSSLIKGEDLAGKHFNHPPDGPFGSWYGNYKMPLENPFSTVEQSTSSPAV 242 Query: 185 EIHKSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLG 364 EI++ + K G + KS R+V ILS H GISI DLRAEL KCD+ + Sbjct: 243 EIYE----------PTPESKPGVIPKSVLRRVRHILSLHTKGISISDLRAELAKCDVYVD 292 Query: 365 KHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRLVP-------------STTSAVKNKQKD 505 K YGHK FS FL+SIP+VQL+ LG+ NF+VRL+ TTSAVK ++K Sbjct: 293 KSLYGHKTFSRFLLSIPNVQLRSLGDGNFFVRLIRPGSPEPAESTILLPTTSAVKGEEKG 352 Query: 506 SAATQKVHNEGNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSSFP 685 AT K + + R ADE ISSL D+SKSFQ PS GEYVDGK+S+ Sbjct: 353 YVATLKSNGVVSDNARDADETHSISSLDERIMDDDSKSFQQVPSPDTSSGEYVDGKASYS 412 Query: 686 SFMESNV 706 +E +V Sbjct: 413 PSIEGHV 419 >ref|XP_019417451.1| PREDICTED: uncharacterized protein LOC109328441 isoform X1 [Lupinus angustifolius] gb|OIV96993.1| hypothetical protein TanjilG_26770 [Lupinus angustifolius] Length = 1105 Score = 231 bits (589), Expect = 3e-66 Identities = 129/247 (52%), Positives = 154/247 (62%), Gaps = 13/247 (5%) Frame = +2 Query: 5 LCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQNV 184 LC AATI WQWSS++KGEDL GKHFNHPPDGPF SWYGN MPLENPFS V+ TSS V Sbjct: 183 LCSAATITWQWSSLIKGEDLAGKHFNHPPDGPFGSWYGNYKMPLENPFSTVEQSTSSPAV 242 Query: 185 EIHKSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLG 364 EI++ + K G + KS R+V ILS H GISI DLRAEL KCD+ + Sbjct: 243 EIYE----------PTPESKPGVIPKSVLRRVRHILSLHTKGISISDLRAELAKCDVYVD 292 Query: 365 KHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRLVP-------------STTSAVKNKQKD 505 K YGHK FS FL+SIP+VQL+ LG+ NF+VRL+ TTSAVK ++K Sbjct: 293 KSLYGHKTFSRFLLSIPNVQLRSLGDGNFFVRLIRPGSPEPAESTILLPTTSAVKGEEKG 352 Query: 506 SAATQKVHNEGNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSSFP 685 AT K + + R ADE ISSL D+SKSFQ PS GEYVDGK+S+ Sbjct: 353 YVATLKSNGVVSDNARDADETHSISSLDERIMDDDSKSFQQVPSPDTSSGEYVDGKASYS 412 Query: 686 SFMESNV 706 +E +V Sbjct: 413 PSIEGHV 419 >gb|PNY12010.1| limkain-B1 [Trifolium pratense] Length = 1102 Score = 221 bits (562), Expect = 1e-62 Identities = 119/241 (49%), Positives = 154/241 (63%), Gaps = 13/241 (5%) Frame = +2 Query: 5 LCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQNV 184 LC AATIMWQW+S+LKGE+LTGKHFNHPPDG F SWYGNS +PLENPFSA + SSQ V Sbjct: 186 LCSAATIMWQWTSLLKGENLTGKHFNHPPDGQFGSWYGNSKVPLENPFSAAEQSPSSQKV 245 Query: 185 EIHKSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLG 364 +I +++ E S G KS R + IL+ HPNGI++ DLRAELT+C++ LG Sbjct: 246 QI------VEINEPSSDLQPAGGNPKSVVRIIKHILNLHPNGIAVSDLRAELTECNVSLG 299 Query: 365 KHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRLVP-------------STTSAVKNKQKD 505 ++ G+K+ FL SIP+V LQ +G NF V+L+P ST SAVKN+++ Sbjct: 300 RNICGYKRLYRFLSSIPNVHLQKVGNGNFCVKLLPSESPEPSESSTTLSTASAVKNEERG 359 Query: 506 SAATQKVHNEGNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSSFP 685 T K+H+E MD+ + P + SL D+SKS QS PSQG+PI E V +SS Sbjct: 360 YTTTPKLHSEDKDMDKDIYKNPSLYSLQERIIEDDSKSLQSIPSQGRPIEEDVPHESSLG 419 Query: 686 S 688 S Sbjct: 420 S 420 >dbj|GAU13177.1| hypothetical protein TSUD_179200 [Trifolium subterraneum] Length = 1090 Score = 215 bits (547), Expect = 2e-60 Identities = 119/243 (48%), Positives = 154/243 (63%), Gaps = 15/243 (6%) Frame = +2 Query: 5 LCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQNV 184 LC A+TIMWQWSS+LKGE+LTGKHFNHPPDG F SWYGNS +PLENPFSA + S Q V Sbjct: 185 LCSASTIMWQWSSLLKGENLTGKHFNHPPDGHFGSWYGNSKVPLENPFSAAEQSPSPQKV 244 Query: 185 EIHKSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLG 364 +I +++ E S G KS R + IL+ HPNGISI DLRAELTKC++ LG Sbjct: 245 QI------VEINEPSSDLKSAGGEPKSVIRIIKHILNLHPNGISISDLRAELTKCNVSLG 298 Query: 365 KHFYGHKKFSNFLVSIPHVQLQYL--GEVNFWVRLVP-------------STTSAVKNKQ 499 ++ +G+K+ FL SIP+V LQ + G NF V+L+P ST SA+KN++ Sbjct: 299 RNVFGYKRLYRFLSSIPNVHLQNVGNGNGNFCVKLLPSEFPEPSESSTTLSTASAIKNEE 358 Query: 500 KDSAATQKVHNEGNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSS 679 + T K+H+E MD+ P + SL D+SKSFQS+PSQ +PI E + +SS Sbjct: 359 RGYTTTPKLHSEDKDMDKDEYRSP-LYSLQEGINEDDSKSFQSSPSQERPIEEDMPHESS 417 Query: 680 FPS 688 S Sbjct: 418 IHS 420 >ref|XP_014630372.1| PREDICTED: uncharacterized protein LOC102660946 [Glycine max] gb|KRH64104.1| hypothetical protein GLYMA_04G216400 [Glycine max] Length = 1387 Score = 211 bits (538), Expect = 4e-59 Identities = 116/231 (50%), Positives = 148/231 (64%) Frame = +2 Query: 5 LCHAATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMPLENPFSAVDPPTSSQNV 184 L AATIMWQW +LKGE+L GKH NHPPDGP+ SWYGN +PLENP A + TS ++V Sbjct: 173 LSTAATIMWQWYKLLKGENLMGKHVNHPPDGPYGSWYGNFRVPLENPSPATEHSTSLEDV 232 Query: 185 EIHKSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLG 364 EI++ D+KLGEV KS ++V IL+SH GISI DL A+L KCD+ L Sbjct: 233 EIYE----------PSLDLKLGEVPKSVVQKVKHILTSHSKGISITDLHADLAKCDVHLD 282 Query: 365 KHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRLVPSTTSAVKNKQKDSAATQKVHNEGNT 544 ++ YG + S FL+SIPHVQLQ LG+ NF V LVPS + + +A+ V NE Sbjct: 283 QNLYGFQSVSCFLLSIPHVQLQPLGDGNFCVCLVPSGSPEPFDSSVVPSASSVVKNEERG 342 Query: 545 MDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSSFPSFME 697 + +E P IS+++ S D+S SFQ P Q K IGEYV+ KSSFPS +E Sbjct: 343 YE---NETPSISTVHARSMNDDSTSFQPVPPQVKTIGEYVNSKSSFPSLVE 390 Score = 82.8 bits (203), Expect = 2e-14 Identities = 52/127 (40%), Positives = 67/127 (52%), Gaps = 13/127 (10%) Frame = +2 Query: 227 SKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLGKHFYGHKKFSNFLV 406 S D+KLG V +S RQ+ IL SHP GISI LR EL K ++ + FYG+K FS FL Sbjct: 861 SDLDLKLGTVPQSIIRQIRCILRSHPKGISITGLREELKKSNVCFYQSFYGYKTFSRFLS 920 Query: 407 SIPHVQLQYLGEVNFWVRL-------------VPSTTSAVKNKQKDSAATQKVHNEGNTM 547 SIPHVQLQ LG F V L V S TSA K + ++ + ++ N + Sbjct: 921 SIPHVQLQPLGHGKFCVHLVSLESPETFESNDVQSITSAAKIDESNAKSDDLAAHQNNVV 980 Query: 548 DRSADEI 568 D + Sbjct: 981 SHLEDSM 987 >ref|XP_014630954.1| PREDICTED: uncharacterized protein LOC106794208 [Glycine max] gb|KRH57196.1| hypothetical protein GLYMA_05G045500 [Glycine max] gb|KRH57197.1| hypothetical protein GLYMA_05G045500 [Glycine max] Length = 1126 Score = 198 bits (503), Expect = 2e-54 Identities = 118/246 (47%), Positives = 144/246 (58%), Gaps = 15/246 (6%) Frame = +2 Query: 14 AATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMP--LENPFSAVDPPTSSQNVE 187 AATI WQWSSMLKGE+LTGK FNHPPDG + SWYG+ + LE P P +Q Sbjct: 186 AATIAWQWSSMLKGENLTGKCFNHPPDGRYRSWYGSYKLTACLEKP-----APVVAQEAS 240 Query: 188 IHKSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLGK 367 S +++ E S S LG V KS +QV ILS HP GI I LRAEL KC + L K Sbjct: 241 AAASLPNVEAYEPSSSG--LGSVPKSVVKQVRHILSLHPEGIGISVLRAELAKCGVRLDK 298 Query: 368 HFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRLVP-------------STTSAVKNKQKDS 508 F+GHK+FS FL+S+PHVQLQ G+ NF V LVP ST S +++K S Sbjct: 299 GFFGHKRFSRFLLSLPHVQLQPSGDANFSVHLVPWEFPEPCESIPVASTMSGANSEEKGS 358 Query: 509 AATQKVHNEGNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSSFPS 688 AAT KV+ E R ADE I+S+ S DN K Q SQG+ EY+DG+SS P Sbjct: 359 AATPKVNGEDKRKVRVADEKFSITSILERSSDDNLKPVQPGLSQGRSNEEYMDGESSSPV 418 Query: 689 FMESNV 706 +E +V Sbjct: 419 LVEKHV 424 >gb|KHN34394.1| hypothetical protein glysoja_016722 [Glycine soja] Length = 1123 Score = 195 bits (496), Expect = 1e-53 Identities = 118/247 (47%), Positives = 143/247 (57%), Gaps = 16/247 (6%) Frame = +2 Query: 14 AATIMWQWSSMLKGEDLTGKHFNHPPDGPFNSWYGNSNMP--LENPFSAVDPPTSS-QNV 184 AATI WQWSSMLKGE+LTGK FNHPPDG + SWYG+ + LE P V +S NV Sbjct: 186 AATIAWQWSSMLKGENLTGKCFNHPPDGRYRSWYGSYKLTACLEKPAPGVASAAASLPNV 245 Query: 185 EIHKSYSDIKLGEVSKSDIKLGEVSKSASRQVMKILSSHPNGISIGDLRAELTKCDMPLG 364 E ++ S LG V KS +QV ILS HP GI I LRAEL KC + L Sbjct: 246 EAYEPSSS-----------GLGSVPKSVVKQVRHILSLHPEGIGISVLRAELAKCGVRLD 294 Query: 365 KHFYGHKKFSNFLVSIPHVQLQYLGEVNFWVRLVP-------------STTSAVKNKQKD 505 K F+GHK+FS FL+S+PHVQLQ G+ NF V LVP ST S +++K Sbjct: 295 KGFFGHKRFSRFLLSLPHVQLQPSGDANFSVHLVPWEFPEPCESIPVASTMSGANSEEKG 354 Query: 506 SAATQKVHNEGNTMDRSADEIPKISSLYVSSEGDNSKSFQSNPSQGKPIGEYVDGKSSFP 685 SAAT KV+ E R ADE I+S+ S DN K Q SQG+ EY+DG+SS P Sbjct: 355 SAATPKVNGEDKRKVRVADEKFSITSILERSSDDNLKPVQPGLSQGRSNEEYMDGESSSP 414 Query: 686 SFMESNV 706 +E +V Sbjct: 415 VLVEKHV 421