BLASTX nr result
ID: Astragalus24_contig00025880
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus24_contig00025880 (628 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007141970.1| hypothetical protein PHAVU_008G241400g [Phas... 368 e-123 ref|XP_004490774.1| PREDICTED: protein SET DOMAIN GROUP 40 [Cice... 367 e-123 gb|KHN39503.1| Protein SET DOMAIN GROUP 40 [Glycine soja] 366 e-123 ref|XP_006596494.1| PREDICTED: protein SET DOMAIN GROUP 40 isofo... 366 e-123 gb|ACU19071.1| unknown [Glycine max] 359 e-120 ref|XP_017428330.1| PREDICTED: protein SET DOMAIN GROUP 40 isofo... 358 e-120 gb|KYP48362.1| Protein SET DOMAIN GROUP 40 [Cajanus cajan] 355 e-119 ref|XP_020234126.1| protein SET DOMAIN GROUP 40 isoform X1 [Caja... 355 e-119 ref|XP_014503667.1| histone-lysine N-methyltransferase setd3 [Vi... 355 e-118 ref|XP_019432595.1| PREDICTED: protein SET DOMAIN GROUP 40 [Lupi... 349 e-116 ref|XP_003616150.2| SET domain group 40 protein [Medicago trunca... 341 e-113 ref|XP_017428331.1| PREDICTED: protein SET DOMAIN GROUP 40 isofo... 336 e-111 ref|XP_006596495.1| PREDICTED: protein SET DOMAIN GROUP 40 isofo... 335 e-111 ref|XP_015931772.1| protein SET DOMAIN GROUP 40 isoform X1 [Arac... 334 e-110 ref|XP_015931636.2| LOW QUALITY PROTEIN: protein SET DOMAIN GROU... 331 e-109 ref|XP_016167158.1| protein SET DOMAIN GROUP 40-like isoform X1 ... 329 e-108 dbj|GAU32040.1| hypothetical protein TSUD_213950 [Trifolium subt... 328 e-108 ref|XP_020234127.1| protein SET DOMAIN GROUP 40 isoform X2 [Caja... 323 e-106 ref|XP_016166123.2| LOW QUALITY PROTEIN: protein SET DOMAIN GROU... 307 e-101 gb|PNY14759.1| protein SET domain group 40-like [Trifolium prate... 291 3e-95 >ref|XP_007141970.1| hypothetical protein PHAVU_008G241400g [Phaseolus vulgaris] gb|ESW13964.1| hypothetical protein PHAVU_008G241400g [Phaseolus vulgaris] Length = 497 Score = 368 bits (944), Expect = e-123 Identities = 172/208 (82%), Positives = 190/208 (91%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ R++VMEDKKLC AVNRHS LSS QILIVCLLYEV KGKTSRWHPYLMHLPH YD+L Sbjct: 74 ALMTRENVMEDKKLCFAVNRHSCLSSAQILIVCLLYEVCKGKTSRWHPYLMHLPHTYDIL 133 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 A F EFEK+ALQVDEA+WVTEKA+LKAKSEWKEAHALM+DL+F+PQ LTFKAWVWAAATI Sbjct: 134 AMFDEFEKRALQVDEAVWVTEKAILKAKSEWKEAHALMEDLMFRPQFLTFKAWVWAAATI 193 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLH+PWDEAGCLCPVGDLFNYDAPG E S ED+EHLLSNSSIHD L +GD+NI+V Sbjct: 194 SSRTLHVPWDEAGCLCPVGDLFNYDAPGEESSDIEDLEHLLSNSSIHDTNLLNGDKNIVV 253 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 DAE+ DSHSQRLTDGGF+E+ NAYCFYA Sbjct: 254 DAEQLDSHSQRLTDGGFEENVNAYCFYA 281 >ref|XP_004490774.1| PREDICTED: protein SET DOMAIN GROUP 40 [Cicer arietinum] Length = 494 Score = 367 bits (941), Expect = e-123 Identities = 175/208 (84%), Positives = 190/208 (91%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ R+SVMEDKKLC+AVN+H SLSSVQIL VCLLYEVGKGKTSRWHPYLMHLP +YDVL Sbjct: 70 ALMTRESVMEDKKLCIAVNKHPSLSSVQILTVCLLYEVGKGKTSRWHPYLMHLPQSYDVL 129 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 A FGEFEK ALQVDEA+W+TEKAVLKAKSEWKEAHALM+DL+FKPQLLTFKAWVWAAATI Sbjct: 130 AMFGEFEKNALQVDEAIWITEKAVLKAKSEWKEAHALMEDLMFKPQLLTFKAWVWAAATI 189 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLHIPWDEAGCLCPVGDLFNYDAPG E S EDV++ LSNSSI LS+GD+NI+V Sbjct: 190 SSRTLHIPWDEAGCLCPVGDLFNYDAPGEELSGIEDVDNFLSNSSIPVTTLSNGDKNIVV 249 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 D E+ D HSQRLTDGGFDEDANAYCFYA Sbjct: 250 DEEQVDFHSQRLTDGGFDEDANAYCFYA 277 >gb|KHN39503.1| Protein SET DOMAIN GROUP 40 [Glycine soja] Length = 497 Score = 366 bits (940), Expect = e-123 Identities = 173/208 (83%), Positives = 189/208 (90%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ R++VMEDKKLC AVNRHSSLSS QILIVCLLYE+GKGKTSRWHPYLMHLPH YDVL Sbjct: 74 ALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKGKTSRWHPYLMHLPHTYDVL 133 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 A FGEFEK ALQVDEA+WVTEKA+LKAKSEWKEAH+LM DL+FKPQ TFKAWVWAAATI Sbjct: 134 AMFGEFEKHALQVDEAMWVTEKAMLKAKSEWKEAHSLMQDLMFKPQFFTFKAWVWAAATI 193 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLHIPWDEAGCLCPVGDLFNYDAPG EPS ED++ LLSN+SI D + +GD+NIMV Sbjct: 194 SSRTLHIPWDEAGCLCPVGDLFNYDAPGIEPSGIEDLDRLLSNTSIPDTIVLNGDKNIMV 253 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 DAE+ DSHS RLTDGGF+EDANAYCFYA Sbjct: 254 DAEQLDSHSWRLTDGGFEEDANAYCFYA 281 >ref|XP_006596494.1| PREDICTED: protein SET DOMAIN GROUP 40 isoform X1 [Glycine max] gb|KRH17268.1| hypothetical protein GLYMA_14G209800 [Glycine max] Length = 497 Score = 366 bits (940), Expect = e-123 Identities = 173/208 (83%), Positives = 189/208 (90%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ R++VMEDKKLC AVNRHSSLSS QILIVCLLYE+GKGKTSRWHPYLMHLPH YDVL Sbjct: 74 ALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKGKTSRWHPYLMHLPHTYDVL 133 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 A FGEFEK ALQVDEA+WVTEKA+LKAKSEWKEAH+LM DL+FKPQ TFKAWVWAAATI Sbjct: 134 AMFGEFEKHALQVDEAMWVTEKAMLKAKSEWKEAHSLMQDLMFKPQFFTFKAWVWAAATI 193 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLHIPWDEAGCLCPVGDLFNYDAPG EPS ED++ LLSN+SI D + +GD+NIMV Sbjct: 194 SSRTLHIPWDEAGCLCPVGDLFNYDAPGIEPSGIEDLDRLLSNTSIPDTIVLNGDKNIMV 253 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 DAE+ DSHS RLTDGGF+EDANAYCFYA Sbjct: 254 DAEQLDSHSWRLTDGGFEEDANAYCFYA 281 >gb|ACU19071.1| unknown [Glycine max] Length = 497 Score = 359 bits (922), Expect = e-120 Identities = 171/208 (82%), Positives = 188/208 (90%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ R++VMEDKKLC AVNRHSSLSS QILIVCLLYE+GKGKTSRWHPYLMHLPH YDVL Sbjct: 74 ALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKGKTSRWHPYLMHLPHTYDVL 133 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 A FGEFEK ALQVDEA+WVTEKA+LKAKSEWKEAH+LM DL+FKPQ TFKAWV AAATI Sbjct: 134 AMFGEFEKHALQVDEAMWVTEKAMLKAKSEWKEAHSLMQDLMFKPQFFTFKAWVRAAATI 193 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLHIPWDEAGCLCPVGDLFNYDAPG EPS ED++ LLSN+SI D + +GD+NI+V Sbjct: 194 SSRTLHIPWDEAGCLCPVGDLFNYDAPGIEPSGIEDLDRLLSNTSIPDTIVLNGDKNIVV 253 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 DAE+ DSHS RLTDGGF+EDANAYCFYA Sbjct: 254 DAEQLDSHSWRLTDGGFEEDANAYCFYA 281 >ref|XP_017428330.1| PREDICTED: protein SET DOMAIN GROUP 40 isoform X1 [Vigna angularis] gb|KOM47098.1| hypothetical protein LR48_Vigan07g080200 [Vigna angularis] dbj|BAT81312.1| hypothetical protein VIGAN_03100100 [Vigna angularis var. angularis] Length = 497 Score = 358 bits (920), Expect = e-120 Identities = 169/208 (81%), Positives = 186/208 (89%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ R+SVMED+KLC AV+RHS LSS Q+LIVCLLYE+GKGKTSRWHPYLMHLPH YD+L Sbjct: 74 ALMTRESVMEDEKLCFAVSRHSCLSSAQVLIVCLLYEMGKGKTSRWHPYLMHLPHTYDIL 133 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 A FGEFEK+ALQVDEA+WVTEKA+LKAKSEWKEA ALM+DL+F+PQ LT KAWVWAAATI Sbjct: 134 AMFGEFEKRALQVDEAVWVTEKAILKAKSEWKEALALMEDLMFRPQFLTLKAWVWAAATI 193 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLHIPWDEAGCLCPVGDLFNYDAPG E S E +EHL SNSSIHD L +G NIMV Sbjct: 194 SSRTLHIPWDEAGCLCPVGDLFNYDAPGEESSDIEGLEHLPSNSSIHDPNLLNGGNNIMV 253 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 DAE+ DSHSQRLTDGGF+ED NAYCFYA Sbjct: 254 DAEQFDSHSQRLTDGGFEEDGNAYCFYA 281 >gb|KYP48362.1| Protein SET DOMAIN GROUP 40 [Cajanus cajan] Length = 453 Score = 355 bits (912), Expect = e-119 Identities = 165/208 (79%), Positives = 188/208 (90%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ R+SVMEDK+L +AVN+HS+LSS Q+LIVCLLYE+GKGKTSRWHPYLMHLPH YDVL Sbjct: 73 ALMTRESVMEDKRLSVAVNKHSALSSAQMLIVCLLYEMGKGKTSRWHPYLMHLPHTYDVL 132 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 A FGEFEK+ALQVDEA+WVTEKA +KA+SEWKEAHALM+DL+FKPQ LTFKAW+WAAATI Sbjct: 133 AMFGEFEKRALQVDEAIWVTEKATVKARSEWKEAHALMEDLMFKPQFLTFKAWIWAAATI 192 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLHIPWDEAGCLCPVGDLFNYDAPG EPS ED+EHLLSN+SI D+ + D +IM Sbjct: 193 SSRTLHIPWDEAGCLCPVGDLFNYDAPGEEPSDVEDLEHLLSNTSIPDSIKLNVDNDIMA 252 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 +AE+ D HSQRLTDGGF+ED NAYCFYA Sbjct: 253 EAEQLDPHSQRLTDGGFEEDMNAYCFYA 280 >ref|XP_020234126.1| protein SET DOMAIN GROUP 40 isoform X1 [Cajanus cajan] Length = 496 Score = 355 bits (912), Expect = e-119 Identities = 165/208 (79%), Positives = 188/208 (90%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ R+SVMEDK+L +AVN+HS+LSS Q+LIVCLLYE+GKGKTSRWHPYLMHLPH YDVL Sbjct: 73 ALMTRESVMEDKRLSVAVNKHSALSSAQMLIVCLLYEMGKGKTSRWHPYLMHLPHTYDVL 132 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 A FGEFEK+ALQVDEA+WVTEKA +KA+SEWKEAHALM+DL+FKPQ LTFKAW+WAAATI Sbjct: 133 AMFGEFEKRALQVDEAIWVTEKATVKARSEWKEAHALMEDLMFKPQFLTFKAWIWAAATI 192 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLHIPWDEAGCLCPVGDLFNYDAPG EPS ED+EHLLSN+SI D+ + D +IM Sbjct: 193 SSRTLHIPWDEAGCLCPVGDLFNYDAPGEEPSDVEDLEHLLSNTSIPDSIKLNVDNDIMA 252 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 +AE+ D HSQRLTDGGF+ED NAYCFYA Sbjct: 253 EAEQLDPHSQRLTDGGFEEDMNAYCFYA 280 >ref|XP_014503667.1| histone-lysine N-methyltransferase setd3 [Vigna radiata var. radiata] Length = 497 Score = 355 bits (910), Expect = e-118 Identities = 166/208 (79%), Positives = 185/208 (88%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ R+SVMED+KLC V+RHS LSS Q+LIVCLLYE+GKGKTSRWHPYLMHLPH YD+L Sbjct: 74 ALMTRESVMEDEKLCFVVSRHSCLSSAQVLIVCLLYEMGKGKTSRWHPYLMHLPHTYDIL 133 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 A FGEFEK+ALQVDEA+WVTEKA+LKAKSEWKEA ALM+DL+F+PQ LT KAW+WAAATI Sbjct: 134 AMFGEFEKRALQVDEAVWVTEKAILKAKSEWKEALALMEDLMFRPQFLTLKAWLWAAATI 193 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLHIPWDEAGCLCPVGDLFNYDAPG E S E +EH SNSSIHD L +G +NIMV Sbjct: 194 SSRTLHIPWDEAGCLCPVGDLFNYDAPGEESSDIEGLEHFPSNSSIHDPNLLNGGKNIMV 253 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 DAE+ DSHSQRLTDGGF+ED NAYCFYA Sbjct: 254 DAEQFDSHSQRLTDGGFEEDVNAYCFYA 281 >ref|XP_019432595.1| PREDICTED: protein SET DOMAIN GROUP 40 [Lupinus angustifolius] gb|OIW16121.1| hypothetical protein TanjilG_18836 [Lupinus angustifolius] Length = 490 Score = 349 bits (895), Expect = e-116 Identities = 165/208 (79%), Positives = 182/208 (87%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ RDSVMEDKKLC +VN HSSLS QIL VCLLYE+GKGKTSRWHPYLMHLP +YD+L Sbjct: 68 ALMTRDSVMEDKKLCFSVNNHSSLSPTQILAVCLLYEMGKGKTSRWHPYLMHLPQSYDIL 127 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 A FGEFEK ALQVDEA+WVTEKAVLKAKSEWKEA ALM++L KP+LLTFKAWVWAAATI Sbjct: 128 AMFGEFEKHALQVDEAVWVTEKAVLKAKSEWKEAQALMEELKLKPRLLTFKAWVWAAATI 187 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLH+PWDEAGCLCPVGDLFNYDAPG EP D E LLSNSS+H LS+G ++V Sbjct: 188 SSRTLHVPWDEAGCLCPVGDLFNYDAPGDEPCSIGDGEDLLSNSSVHVTDLSNGGNTMLV 247 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 D+E+ DSHSQRLTDGGF+EDANAYCFYA Sbjct: 248 DSEQLDSHSQRLTDGGFEEDANAYCFYA 275 >ref|XP_003616150.2| SET domain group 40 protein [Medicago truncatula] gb|AES99108.2| SET domain group 40 protein [Medicago truncatula] Length = 488 Score = 341 bits (874), Expect = e-113 Identities = 167/209 (79%), Positives = 182/209 (87%), Gaps = 1/209 (0%) Frame = +3 Query: 3 ALLNRDSV-MEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDV 179 AL+ +SV MEDKKLCLAVNRHSSLSSVQIL VCLLYEVGKGKTSRWHPYL+HLP +YD+ Sbjct: 74 ALMTSESVIMEDKKLCLAVNRHSSLSSVQILTVCLLYEVGKGKTSRWHPYLVHLPQSYDL 133 Query: 180 LATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAAT 359 LA FGEFEKQALQVDEA+WVTEKAV KAKSEWKEAHALM+DL+FKPQLLTFKAWVWAAAT Sbjct: 134 LAMFGEFEKQALQVDEAMWVTEKAVQKAKSEWKEAHALMEDLMFKPQLLTFKAWVWAAAT 193 Query: 360 ISSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIM 539 ISSRTLHIPWDEAGCLCPVGDLFNYDAPG E S EDV+H LSN GD N++ Sbjct: 194 ISSRTLHIPWDEAGCLCPVGDLFNYDAPGEELSGVEDVDHFLSN----------GDMNVV 243 Query: 540 VDAEENDSHSQRLTDGGFDEDANAYCFYA 626 +D + D +SQRLTDGGF+EDANAYCFYA Sbjct: 244 IDEGQIDFNSQRLTDGGFEEDANAYCFYA 272 >ref|XP_017428331.1| PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Vigna angularis] Length = 486 Score = 336 bits (861), Expect = e-111 Identities = 162/208 (77%), Positives = 177/208 (85%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ R+SVMED+KLC AV LIVCLLYE+GKGKTSRWHPYLMHLPH YD+L Sbjct: 74 ALMTRESVMEDEKLCFAV-----------LIVCLLYEMGKGKTSRWHPYLMHLPHTYDIL 122 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 A FGEFEK+ALQVDEA+WVTEKA+LKAKSEWKEA ALM+DL+F+PQ LT KAWVWAAATI Sbjct: 123 AMFGEFEKRALQVDEAVWVTEKAILKAKSEWKEALALMEDLMFRPQFLTLKAWVWAAATI 182 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLHIPWDEAGCLCPVGDLFNYDAPG E S E +EHL SNSSIHD L +G NIMV Sbjct: 183 SSRTLHIPWDEAGCLCPVGDLFNYDAPGEESSDIEGLEHLPSNSSIHDPNLLNGGNNIMV 242 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 DAE+ DSHSQRLTDGGF+ED NAYCFYA Sbjct: 243 DAEQFDSHSQRLTDGGFEEDGNAYCFYA 270 >ref|XP_006596495.1| PREDICTED: protein SET DOMAIN GROUP 40 isoform X2 [Glycine max] Length = 483 Score = 335 bits (858), Expect = e-111 Identities = 161/208 (77%), Positives = 177/208 (85%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ R++VMEDKKLC AVNRHSSLSS QILIVCLLYE+GKGKTSRWHPYLMHLPH YDV Sbjct: 74 ALMTRETVMEDKKLCDAVNRHSSLSSAQILIVCLLYEMGKGKTSRWHPYLMHLPHTYDV- 132 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 DEA+WVTEKA+LKAKSEWKEAH+LM DL+FKPQ TFKAWVWAAATI Sbjct: 133 -------------DEAMWVTEKAMLKAKSEWKEAHSLMQDLMFKPQFFTFKAWVWAAATI 179 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLHIPWDEAGCLCPVGDLFNYDAPG EPS ED++ LLSN+SI D + +GD+NIMV Sbjct: 180 SSRTLHIPWDEAGCLCPVGDLFNYDAPGIEPSGIEDLDRLLSNTSIPDTIVLNGDKNIMV 239 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 DAE+ DSHS RLTDGGF+EDANAYCFYA Sbjct: 240 DAEQLDSHSWRLTDGGFEEDANAYCFYA 267 >ref|XP_015931772.1| protein SET DOMAIN GROUP 40 isoform X1 [Arachis duranensis] Length = 494 Score = 334 bits (857), Expect = e-110 Identities = 160/208 (76%), Positives = 179/208 (86%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ DSVM+D L A+NRH SLSS QIL VCLLYEVGK K SRW+PYL+HLP +YD+L Sbjct: 71 ALMTTDSVMQDTNLSQALNRHPSLSSTQILNVCLLYEVGKVKASRWYPYLVHLPKSYDIL 130 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 A FGEFEK ALQVDEA+WVTEKAVLKAKSEWK+AHALM+DL FKPQLLTFKAWVWA+ATI Sbjct: 131 AMFGEFEKTALQVDEAIWVTEKAVLKAKSEWKQAHALMEDLKFKPQLLTFKAWVWASATI 190 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLHIPWD AGCLCPVGDLFNYDAPG EPS D+E LLS+SSIHD +LS+ D + Sbjct: 191 SSRTLHIPWDSAGCLCPVGDLFNYDAPGKEPSDIGDLEDLLSSSSIHDGSLSNEDNTTVA 250 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 DAE+ DS SQRLTDGGF++DANAYCFYA Sbjct: 251 DAEQLDSQSQRLTDGGFEDDANAYCFYA 278 >ref|XP_015931636.2| LOW QUALITY PROTEIN: protein SET DOMAIN GROUP 40-like [Arachis duranensis] Length = 514 Score = 331 bits (849), Expect = e-109 Identities = 157/208 (75%), Positives = 178/208 (85%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ DSVM+D L A+NRH SLSS QIL VCLLYEVGKGK SRW+PYLMHLP +YD+L Sbjct: 91 ALITSDSVMQDTNLSQALNRHPSLSSTQILNVCLLYEVGKGKASRWYPYLMHLPKSYDIL 150 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 A FGEFEK ALQVDEA+WVTEKAVLK KS+W++AHALM+DL FKP+LLTFKAWVWA+ATI Sbjct: 151 AMFGEFEKTALQVDEAIWVTEKAVLKTKSDWQQAHALMEDLKFKPRLLTFKAWVWASATI 210 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLHIPWD AGCLCPVGDLFNYDAPG EPS D+E LLS+SSIHD +LS+ D + Sbjct: 211 SSRTLHIPWDSAGCLCPVGDLFNYDAPGKEPSDIGDLEDLLSSSSIHDGSLSNEDNTTVA 270 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 + E+ DS SQRLTDGGF+EDANAYCFYA Sbjct: 271 NTEQLDSQSQRLTDGGFEEDANAYCFYA 298 >ref|XP_016167158.1| protein SET DOMAIN GROUP 40-like isoform X1 [Arachis ipaensis] Length = 494 Score = 329 bits (844), Expect = e-108 Identities = 157/208 (75%), Positives = 177/208 (85%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ SVM+ L A+NRH SLSS QIL VCLLYEVGKGK SRW+PYLMHLP +YD+L Sbjct: 71 ALMTTHSVMQHTNLSQALNRHPSLSSTQILNVCLLYEVGKGKASRWYPYLMHLPKSYDIL 130 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 A FGEFEK ALQVDEA+WVTEKAVLK KS+W++AHALM+DL FKP+LLTFKAWVWA+ATI Sbjct: 131 AMFGEFEKTALQVDEAIWVTEKAVLKTKSDWQQAHALMEDLKFKPRLLTFKAWVWASATI 190 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLHIPWD AGCLCPVGDLFNYDAPG EPS D+E LLS+SSIHD +LS+ D + Sbjct: 191 SSRTLHIPWDSAGCLCPVGDLFNYDAPGKEPSDIGDLEDLLSSSSIHDGSLSNEDNTTVA 250 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 DAE+ DS SQRLTDGGF+EDANAYCFYA Sbjct: 251 DAEQLDSQSQRLTDGGFEEDANAYCFYA 278 >dbj|GAU32040.1| hypothetical protein TSUD_213950 [Trifolium subterraneum] Length = 485 Score = 328 bits (841), Expect = e-108 Identities = 162/209 (77%), Positives = 181/209 (86%), Gaps = 1/209 (0%) Frame = +3 Query: 3 ALLNRDSVM-EDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDV 179 ALL +S+M EDKKLCLAVNRHSSLSSVQIL VCLLYEVGKGKTSRWHPYL+HLP +YD+ Sbjct: 70 ALLTSESIMQEDKKLCLAVNRHSSLSSVQILTVCLLYEVGKGKTSRWHPYLVHLPQSYDL 129 Query: 180 LATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAAT 359 LA FGEFEKQALQVDEA+WVTEKAV KAKSEWKEA ALM+DL+FKPQLLTFKAWVWAAAT Sbjct: 130 LAMFGEFEKQALQVDEAMWVTEKAVQKAKSEWKEALALMEDLIFKPQLLTFKAWVWAAAT 189 Query: 360 ISSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIM 539 ISSRTLHIPWDEAGCLCP+GDLFNYDA G E S E+V ALS+GD++I+ Sbjct: 190 ISSRTLHIPWDEAGCLCPIGDLFNYDASGEELSGIENV-----------TALSNGDKSIV 238 Query: 540 VDAEENDSHSQRLTDGGFDEDANAYCFYA 626 VD ++ D +SQRLTDGGF+ED+NAYCFYA Sbjct: 239 VDEDQIDFYSQRLTDGGFEEDSNAYCFYA 267 >ref|XP_020234127.1| protein SET DOMAIN GROUP 40 isoform X2 [Cajanus cajan] Length = 482 Score = 323 bits (829), Expect = e-106 Identities = 153/208 (73%), Positives = 175/208 (84%) Frame = +3 Query: 3 ALLNRDSVMEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVL 182 AL+ R+SVMEDK+L +AVN+HS+LSS Q+LIVCLLYE+GKGKTSRWHPYLMHLPH YDV Sbjct: 73 ALMTRESVMEDKRLSVAVNKHSALSSAQMLIVCLLYEMGKGKTSRWHPYLMHLPHTYDV- 131 Query: 183 ATFGEFEKQALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATI 362 DEA+WVTEKA +KA+SEWKEAHALM+DL+FKPQ LTFKAW+WAAATI Sbjct: 132 -------------DEAIWVTEKATVKARSEWKEAHALMEDLMFKPQFLTFKAWIWAAATI 178 Query: 363 SSRTLHIPWDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMV 542 SSRTLHIPWDEAGCLCPVGDLFNYDAPG EPS ED+EHLLSN+SI D+ + D +IM Sbjct: 179 SSRTLHIPWDEAGCLCPVGDLFNYDAPGEEPSDVEDLEHLLSNTSIPDSIKLNVDNDIMA 238 Query: 543 DAEENDSHSQRLTDGGFDEDANAYCFYA 626 +AE+ D HSQRLTDGGF+ED NAYCFYA Sbjct: 239 EAEQLDPHSQRLTDGGFEEDMNAYCFYA 266 >ref|XP_016166123.2| LOW QUALITY PROTEIN: protein SET DOMAIN GROUP 40 [Arachis ipaensis] Length = 411 Score = 307 bits (786), Expect = e-101 Identities = 150/200 (75%), Positives = 166/200 (83%) Frame = +3 Query: 27 MEDKKLCLAVNRHSSLSSVQILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVLATFGEFEK 206 M+ L A+N H SLSS QIL VCLLYEVGKGK SRW+PYLMHLP +YD+LA FGEFEK Sbjct: 1 MQHTNLSQALNSHPSLSSTQILNVCLLYEVGKGKASRWYPYLMHLPKSYDILAMFGEFEK 60 Query: 207 QALQVDEALWVTEKAVLKAKSEWKEAHALMDDLVFKPQLLTFKAWVWAAATISSRTLHIP 386 ALQVDEA+WVTEKAVLK +AHALM+DL FKPQLLTFKAWVWA+ATISSRTLHIP Sbjct: 61 TALQVDEAIWVTEKAVLKX-----QAHALMEDLKFKPQLLTFKAWVWASATISSRTLHIP 115 Query: 387 WDEAGCLCPVGDLFNYDAPGAEPSVTEDVEHLLSNSSIHDAALSSGDENIMVDAEENDSH 566 WD AGCLCPVGDLFNYDAPG EPS D+E LLS+SSIHD +LS+ D + DAE+ DS Sbjct: 116 WDSAGCLCPVGDLFNYDAPGKEPSDIGDLEDLLSSSSIHDGSLSNEDNTTVADAEQLDSQ 175 Query: 567 SQRLTDGGFDEDANAYCFYA 626 SQRLTDGGF+EDANAYCFYA Sbjct: 176 SQRLTDGGFEEDANAYCFYA 195 >gb|PNY14759.1| protein SET domain group 40-like [Trifolium pratense] Length = 373 Score = 291 bits (745), Expect = 3e-95 Identities = 141/180 (78%), Positives = 155/180 (86%) Frame = +3 Query: 87 ILIVCLLYEVGKGKTSRWHPYLMHLPHNYDVLATFGEFEKQALQVDEALWVTEKAVLKAK 266 IL VCLLYEVGKGKTSRWHPYL+HLP +YD+LA FGEFEKQALQVDEA+WVTEKAV KAK Sbjct: 8 ILTVCLLYEVGKGKTSRWHPYLVHLPQSYDLLAMFGEFEKQALQVDEAMWVTEKAVQKAK 67 Query: 267 SEWKEAHALMDDLVFKPQLLTFKAWVWAAATISSRTLHIPWDEAGCLCPVGDLFNYDAPG 446 SEWKEA ALM+DL+FKPQLLTFKAWVWAAATISSRTLHIPWDEAGCLCP+GDLFNYDAPG Sbjct: 68 SEWKEALALMEDLMFKPQLLTFKAWVWAAATISSRTLHIPWDEAGCLCPIGDLFNYDAPG 127 Query: 447 AEPSVTEDVEHLLSNSSIHDAALSSGDENIMVDAEENDSHSQRLTDGGFDEDANAYCFYA 626 E S E+V ALS+GD++I VD E+ D HSQRLTDGGF+ED+NAYCFYA Sbjct: 128 EELSGIENV-----------TALSNGDKSIDVDEEQIDFHSQRLTDGGFEEDSNAYCFYA 176