BLASTX nr result

ID: Astragalus23_contig00017243 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00017243
         (956 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_013447332.1| cytosolic aldehyde dehydrogenase RF2C [Medic...   493   e-173
ref|XP_003629648.1| cytosolic aldehyde dehydrogenase RF2C [Medic...   493   e-171
ref|XP_016188885.1| aldehyde dehydrogenase family 2 member C4 is...   484   e-167
ref|XP_015954311.1| aldehyde dehydrogenase family 2 member C4 is...   481   e-166
gb|KRH41588.1| hypothetical protein GLYMA_08G039200 [Glycine max]     469   e-164
gb|KRH41585.1| hypothetical protein GLYMA_08G039200 [Glycine max]     469   e-163
ref|XP_004503432.1| PREDICTED: aldehyde dehydrogenase family 2 m...   472   e-163
gb|KRH41587.1| hypothetical protein GLYMA_08G039200 [Glycine max]     469   e-163
ref|XP_020233237.1| aldehyde dehydrogenase family 2 member C4 is...   466   e-162
ref|XP_003530494.1| PREDICTED: aldehyde dehydrogenase family 2 m...   469   e-161
ref|XP_020221618.1| aldehyde dehydrogenase family 2 member C4-li...   468   e-161
gb|KHN20692.1| Aldehyde dehydrogenase family 2 member C4 [Glycin...   467   e-161
ref|XP_003528912.1| PREDICTED: aldehyde dehydrogenase family 2 m...   467   e-161
ref|XP_020233236.1| aldehyde dehydrogenase family 2 member C4 is...   466   e-160
ref|XP_003530501.1| PREDICTED: aldehyde dehydrogenase family 2 m...   466   e-160
ref|XP_016203683.1| aldehyde dehydrogenase family 2 member C4 is...   462   e-160
gb|KHN21546.1| Aldehyde dehydrogenase family 2 member C4 [Glycin...   465   e-160
ref|XP_020221518.1| aldehyde dehydrogenase family 2 member C4-li...   464   e-160
ref|NP_001235519.2| aldehyde dehydrogenase superfamily protein [...   465   e-159
ref|XP_016203682.1| aldehyde dehydrogenase family 2 member C4 is...   462   e-159

>ref|XP_013447332.1| cytosolic aldehyde dehydrogenase RF2C [Medicago truncatula]
 gb|KEH21359.1| cytosolic aldehyde dehydrogenase RF2C [Medicago truncatula]
          Length = 373

 Score =  493 bits (1268), Expect = e-173
 Identities = 255/315 (80%), Positives = 277/315 (87%), Gaps = 4/315 (1%)
 Frame = -1

Query: 935 MTDLLNSTNG----LPNMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTK 768
           MTDL NS+NG    L  MPTIK+NKLFING+FVDSVSG TFETIDPRTG+VIARISEG K
Sbjct: 1   MTDL-NSSNGDNSSLFKMPTIKYNKLFINGDFVDSVSGSTFETIDPRTGDVIARISEGAK 59

Query: 767 EDIDIAVKAARKAFDLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSL 588
           EDI+IAVKAAR+AFD GPWPRM GVERAKIM+K+AEL+DEN EE+A LDAIDAG++Y   
Sbjct: 60  EDIEIAVKAAREAFDSGPWPRMSGVERAKIMMKFAELIDENIEELATLDAIDAGKVYFIN 119

Query: 587 KAMEVPSSANTLRYYAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFT 408
           KA E+PS+ANTLRYYAGAADKIHG+VLK+ GQ HAYTL+EPIGVVGHIIPWNA T +FFT
Sbjct: 120 KAFEIPSAANTLRYYAGAADKIHGEVLKSSGQFHAYTLMEPIGVVGHIIPWNAPTMVFFT 179

Query: 407 KVSPSLAAGCTMVLKPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMD 228
           KVSPSLAAGCTMVLKPAEQTPLSALF+AHLAKLAGIP+GVLNVVPGFGPTAGAAISSHMD
Sbjct: 180 KVSPSLAAGCTMVLKPAEQTPLSALFYAHLAKLAGIPNGVLNVVPGFGPTAGAAISSHMD 239

Query: 227 IDKVSFTGSVEIGREIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFN 48
           ID VSFTGSVE+GREIMQAAA+SNLKHVSLELGGKSPLIIF             LGIL N
Sbjct: 240 IDVVSFTGSVEVGREIMQAAAKSNLKHVSLELGGKSPLIIFDDADIDKAVELALLGILAN 299

Query: 47  KGEICCASSRVFVQE 3
           KGEIC A SRVFVQE
Sbjct: 300 KGEICVACSRVFVQE 314


>ref|XP_003629648.1| cytosolic aldehyde dehydrogenase RF2C [Medicago truncatula]
 gb|AET04124.1| cytosolic aldehyde dehydrogenase RF2C [Medicago truncatula]
          Length = 503

 Score =  493 bits (1268), Expect = e-171
 Identities = 255/315 (80%), Positives = 277/315 (87%), Gaps = 4/315 (1%)
 Frame = -1

Query: 935 MTDLLNSTNG----LPNMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTK 768
           MTDL NS+NG    L  MPTIK+NKLFING+FVDSVSG TFETIDPRTG+VIARISEG K
Sbjct: 1   MTDL-NSSNGDNSSLFKMPTIKYNKLFINGDFVDSVSGSTFETIDPRTGDVIARISEGAK 59

Query: 767 EDIDIAVKAARKAFDLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSL 588
           EDI+IAVKAAR+AFD GPWPRM GVERAKIM+K+AEL+DEN EE+A LDAIDAG++Y   
Sbjct: 60  EDIEIAVKAAREAFDSGPWPRMSGVERAKIMMKFAELIDENIEELATLDAIDAGKVYFIN 119

Query: 587 KAMEVPSSANTLRYYAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFT 408
           KA E+PS+ANTLRYYAGAADKIHG+VLK+ GQ HAYTL+EPIGVVGHIIPWNA T +FFT
Sbjct: 120 KAFEIPSAANTLRYYAGAADKIHGEVLKSSGQFHAYTLMEPIGVVGHIIPWNAPTMVFFT 179

Query: 407 KVSPSLAAGCTMVLKPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMD 228
           KVSPSLAAGCTMVLKPAEQTPLSALF+AHLAKLAGIP+GVLNVVPGFGPTAGAAISSHMD
Sbjct: 180 KVSPSLAAGCTMVLKPAEQTPLSALFYAHLAKLAGIPNGVLNVVPGFGPTAGAAISSHMD 239

Query: 227 IDKVSFTGSVEIGREIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFN 48
           ID VSFTGSVE+GREIMQAAA+SNLKHVSLELGGKSPLIIF             LGIL N
Sbjct: 240 IDVVSFTGSVEVGREIMQAAAKSNLKHVSLELGGKSPLIIFDDADIDKAVELALLGILAN 299

Query: 47  KGEICCASSRVFVQE 3
           KGEIC A SRVFVQE
Sbjct: 300 KGEICVACSRVFVQE 314


>ref|XP_016188885.1| aldehyde dehydrogenase family 2 member C4 isoform X1 [Arachis
           ipaensis]
          Length = 499

 Score =  484 bits (1245), Expect = e-167
 Identities = 242/311 (77%), Positives = 269/311 (86%)
 Frame = -1

Query: 935 MTDLLNSTNGLPNMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTKEDID 756
           MT+L N+     ++ T+KF KLFING+F+DSVSG TFETIDPRTGEVI RISEGT+EDID
Sbjct: 1   MTELTNNGRDCSSV-TVKFTKLFINGQFLDSVSGKTFETIDPRTGEVITRISEGTREDID 59

Query: 755 IAVKAARKAFDLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSLKAME 576
           IAVKAAR AFD GPWPRMP  ERAKIM+KWAEL+DEN EE+A LD IDAG+LYH  KA++
Sbjct: 60  IAVKAARHAFDFGPWPRMPPSERAKIMMKWAELIDENVEELAALDTIDAGKLYHMCKAVD 119

Query: 575 VPSSANTLRYYAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFTKVSP 396
           +P+++NTLRYYAGAADKIHG+VLK  G+ HAYTL+EPIGVVGHIIPWN  T +FF KVSP
Sbjct: 120 IPTASNTLRYYAGAADKIHGEVLKMSGKFHAYTLMEPIGVVGHIIPWNFPTTMFFLKVSP 179

Query: 395 SLAAGCTMVLKPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDKV 216
           SLAAGCTMVLKPAEQTPLSALF+AHLAKLAG+PDGVLNVVPGFGPTAGAAISSHMDIDKV
Sbjct: 180 SLAAGCTMVLKPAEQTPLSALFYAHLAKLAGVPDGVLNVVPGFGPTAGAAISSHMDIDKV 239

Query: 215 SFTGSVEIGREIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFNKGEI 36
           SFTGS E+GREIMQAAA+SNLK VSLELGGKSPLIIF             LGILFNKGE+
Sbjct: 240 SFTGSTEVGREIMQAAAKSNLKQVSLELGGKSPLIIFDDADIDKAADLALLGILFNKGEV 299

Query: 35  CCASSRVFVQE 3
           C ASSRVFVQE
Sbjct: 300 CVASSRVFVQE 310


>ref|XP_015954311.1| aldehyde dehydrogenase family 2 member C4 isoform X1 [Arachis
           duranensis]
          Length = 499

 Score =  481 bits (1237), Expect = e-166
 Identities = 241/311 (77%), Positives = 267/311 (85%)
 Frame = -1

Query: 935 MTDLLNSTNGLPNMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTKEDID 756
           MT+L N+     ++ T+ F KLFING+F+ SVSG TFETIDPRTGEVI RISEGTKEDID
Sbjct: 1   MTELANNGRDCSSV-TVNFTKLFINGQFLHSVSGKTFETIDPRTGEVITRISEGTKEDID 59

Query: 755 IAVKAARKAFDLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSLKAME 576
           IAVKAAR AFD GPWPRMP  ERAKI++KWAEL+DEN EE+A LD IDAG+LYH  KA++
Sbjct: 60  IAVKAARHAFDFGPWPRMPPSERAKILMKWAELIDENVEELAALDTIDAGKLYHMCKAVD 119

Query: 575 VPSSANTLRYYAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFTKVSP 396
           +P+++NTLRYYAGAADKIHG+VLK  GQ HAYTL+EPIGVVGHIIPWN  T +FF KVSP
Sbjct: 120 IPTASNTLRYYAGAADKIHGEVLKMSGQFHAYTLMEPIGVVGHIIPWNFPTTMFFLKVSP 179

Query: 395 SLAAGCTMVLKPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDKV 216
           SLAAGCTMVLKPAEQTPLSALF+AHLAKLAG+PDGVLNVVPGFGPTAGAAISSHMDIDKV
Sbjct: 180 SLAAGCTMVLKPAEQTPLSALFYAHLAKLAGVPDGVLNVVPGFGPTAGAAISSHMDIDKV 239

Query: 215 SFTGSVEIGREIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFNKGEI 36
           SFTGS E+GREIMQAAA+SNLK VSLELGGKSPLIIF             LGILFNKGE+
Sbjct: 240 SFTGSTEVGREIMQAAAKSNLKQVSLELGGKSPLIIFDDADIDKAADLALLGILFNKGEV 299

Query: 35  CCASSRVFVQE 3
           C ASSRVFVQE
Sbjct: 300 CVASSRVFVQE 310


>gb|KRH41588.1| hypothetical protein GLYMA_08G039200 [Glycine max]
          Length = 349

 Score =  469 bits (1207), Expect = e-164
 Identities = 235/312 (75%), Positives = 265/312 (84%), Gaps = 1/312 (0%)
 Frame = -1

Query: 935 MTDLLNSTNG-LPNMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTKEDI 759
           MT L N   G L  +PTIKF KLFING+FVDS+SG TFETIDPRTG+VIARISEG KEDI
Sbjct: 1   MTSLTNGDAGSLNKVPTIKFTKLFINGDFVDSLSGKTFETIDPRTGDVIARISEGDKEDI 60

Query: 758 DIAVKAARKAFDLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSLKAM 579
           DIAVKAAR AFD GPWPR+PG ERA+I++KWAE+++ENAEE+A LDAIDAG+LYH  + +
Sbjct: 61  DIAVKAARHAFDNGPWPRLPGSERARILLKWAEIIEENAEELAALDAIDAGKLYHMCRNV 120

Query: 578 EVPSSANTLRYYAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFTKVS 399
           EVP++ANTLRYYAGAADKIHG+VLK   + HAYTLLEP+GVVGHI PWN    +F+ KV+
Sbjct: 121 EVPAAANTLRYYAGAADKIHGEVLKMSREFHAYTLLEPLGVVGHITPWNFPNTMFYIKVA 180

Query: 398 PSLAAGCTMVLKPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDK 219
           PSLAAGCTMVLKPAEQTPLSALF AHLAKLAGIPDGV+NVVPGFGPTAGAA+SSHMD+DK
Sbjct: 181 PSLAAGCTMVLKPAEQTPLSALFSAHLAKLAGIPDGVINVVPGFGPTAGAALSSHMDVDK 240

Query: 218 VSFTGSVEIGREIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFNKGE 39
           VSFTGS + GR IMQAAA+SNLK VSLELGGKSPLIIF             LGIL+NKGE
Sbjct: 241 VSFTGSTQTGRVIMQAAAKSNLKQVSLELGGKSPLIIFDDADIDKATELALLGILYNKGE 300

Query: 38  ICCASSRVFVQE 3
           +C ASSRVFVQE
Sbjct: 301 VCVASSRVFVQE 312


>gb|KRH41585.1| hypothetical protein GLYMA_08G039200 [Glycine max]
          Length = 389

 Score =  469 bits (1207), Expect = e-163
 Identities = 235/312 (75%), Positives = 265/312 (84%), Gaps = 1/312 (0%)
 Frame = -1

Query: 935 MTDLLNSTNG-LPNMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTKEDI 759
           MT L N   G L  +PTIKF KLFING+FVDS+SG TFETIDPRTG+VIARISEG KEDI
Sbjct: 1   MTSLTNGDAGSLNKVPTIKFTKLFINGDFVDSLSGKTFETIDPRTGDVIARISEGDKEDI 60

Query: 758 DIAVKAARKAFDLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSLKAM 579
           DIAVKAAR AFD GPWPR+PG ERA+I++KWAE+++ENAEE+A LDAIDAG+LYH  + +
Sbjct: 61  DIAVKAARHAFDNGPWPRLPGSERARILLKWAEIIEENAEELAALDAIDAGKLYHMCRNV 120

Query: 578 EVPSSANTLRYYAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFTKVS 399
           EVP++ANTLRYYAGAADKIHG+VLK   + HAYTLLEP+GVVGHI PWN    +F+ KV+
Sbjct: 121 EVPAAANTLRYYAGAADKIHGEVLKMSREFHAYTLLEPLGVVGHITPWNFPNTMFYIKVA 180

Query: 398 PSLAAGCTMVLKPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDK 219
           PSLAAGCTMVLKPAEQTPLSALF AHLAKLAGIPDGV+NVVPGFGPTAGAA+SSHMD+DK
Sbjct: 181 PSLAAGCTMVLKPAEQTPLSALFSAHLAKLAGIPDGVINVVPGFGPTAGAALSSHMDVDK 240

Query: 218 VSFTGSVEIGREIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFNKGE 39
           VSFTGS + GR IMQAAA+SNLK VSLELGGKSPLIIF             LGIL+NKGE
Sbjct: 241 VSFTGSTQTGRVIMQAAAKSNLKQVSLELGGKSPLIIFDDADIDKATELALLGILYNKGE 300

Query: 38  ICCASSRVFVQE 3
           +C ASSRVFVQE
Sbjct: 301 VCVASSRVFVQE 312


>ref|XP_004503432.1| PREDICTED: aldehyde dehydrogenase family 2 member C4-like [Cicer
           arietinum]
          Length = 480

 Score =  472 bits (1215), Expect = e-163
 Identities = 235/307 (76%), Positives = 265/307 (86%), Gaps = 3/307 (0%)
 Frame = -1

Query: 914 TNGLP---NMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTKEDIDIAVK 744
           TNG P   N+PTIKF KLFING+FVD++SG TFETIDPR GEVIARISEG+KEDID+AV+
Sbjct: 2   TNGEPSVTNLPTIKFTKLFINGDFVDAISGKTFETIDPRRGEVIARISEGSKEDIDVAVE 61

Query: 743 AARKAFDLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSLKAMEVPSS 564
           AAR AFD GPWPR+ G ERAKIM+K+AEL+DEN EE+A LDAIDAG+LYH  KA+++P++
Sbjct: 62  AARHAFDSGPWPRLSGAERAKIMMKFAELIDENIEELAALDAIDAGKLYHMCKALDIPAA 121

Query: 563 ANTLRYYAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFTKVSPSLAA 384
           ANTLRYYAGAADKIHG+VLK   + HAYTL+EPIGVVGHIIPWN  T +FF KVSP LAA
Sbjct: 122 ANTLRYYAGAADKIHGEVLKVAREFHAYTLMEPIGVVGHIIPWNFPTSMFFLKVSPCLAA 181

Query: 383 GCTMVLKPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDKVSFTG 204
           GCTM++KPAEQTPLSALF+AHLAKLAGIP+GVLNVVPGFGPTAGAA+SSHMDID VSFTG
Sbjct: 182 GCTMIIKPAEQTPLSALFYAHLAKLAGIPNGVLNVVPGFGPTAGAAVSSHMDIDAVSFTG 241

Query: 203 SVEIGREIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFNKGEICCAS 24
           S + GREIMQAAA+SNLKHVSLELGGKSPLIIF             LGIL NKGE+C AS
Sbjct: 242 STQTGREIMQAAAKSNLKHVSLELGGKSPLIIFDDADIDKATHLALLGILLNKGEVCVAS 301

Query: 23  SRVFVQE 3
           SRVFVQE
Sbjct: 302 SRVFVQE 308


>gb|KRH41587.1| hypothetical protein GLYMA_08G039200 [Glycine max]
          Length = 406

 Score =  469 bits (1207), Expect = e-163
 Identities = 235/312 (75%), Positives = 265/312 (84%), Gaps = 1/312 (0%)
 Frame = -1

Query: 935 MTDLLNSTNG-LPNMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTKEDI 759
           MT L N   G L  +PTIKF KLFING+FVDS+SG TFETIDPRTG+VIARISEG KEDI
Sbjct: 1   MTSLTNGDAGSLNKVPTIKFTKLFINGDFVDSLSGKTFETIDPRTGDVIARISEGDKEDI 60

Query: 758 DIAVKAARKAFDLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSLKAM 579
           DIAVKAAR AFD GPWPR+PG ERA+I++KWAE+++ENAEE+A LDAIDAG+LYH  + +
Sbjct: 61  DIAVKAARHAFDNGPWPRLPGSERARILLKWAEIIEENAEELAALDAIDAGKLYHMCRNV 120

Query: 578 EVPSSANTLRYYAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFTKVS 399
           EVP++ANTLRYYAGAADKIHG+VLK   + HAYTLLEP+GVVGHI PWN    +F+ KV+
Sbjct: 121 EVPAAANTLRYYAGAADKIHGEVLKMSREFHAYTLLEPLGVVGHITPWNFPNTMFYIKVA 180

Query: 398 PSLAAGCTMVLKPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDK 219
           PSLAAGCTMVLKPAEQTPLSALF AHLAKLAGIPDGV+NVVPGFGPTAGAA+SSHMD+DK
Sbjct: 181 PSLAAGCTMVLKPAEQTPLSALFSAHLAKLAGIPDGVINVVPGFGPTAGAALSSHMDVDK 240

Query: 218 VSFTGSVEIGREIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFNKGE 39
           VSFTGS + GR IMQAAA+SNLK VSLELGGKSPLIIF             LGIL+NKGE
Sbjct: 241 VSFTGSTQTGRVIMQAAAKSNLKQVSLELGGKSPLIIFDDADIDKATELALLGILYNKGE 300

Query: 38  ICCASSRVFVQE 3
           +C ASSRVFVQE
Sbjct: 301 VCVASSRVFVQE 312


>ref|XP_020233237.1| aldehyde dehydrogenase family 2 member C4 isoform X2 [Cajanus
           cajan]
          Length = 407

 Score =  466 bits (1200), Expect = e-162
 Identities = 228/301 (75%), Positives = 263/301 (87%)
 Frame = -1

Query: 905 LPNMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTKEDIDIAVKAARKAF 726
           + N+ T+KF KLFING+FVDS+SG  FE+IDPRTGEVIARI+EG+KEDID+AVKA+R AF
Sbjct: 1   MTNLNTVKFTKLFINGDFVDSLSGREFESIDPRTGEVIARIAEGSKEDIDVAVKASRVAF 60

Query: 725 DLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSLKAMEVPSSANTLRY 546
           D GPWPRM GVERA+IM+KWA+L+DENAEEIAKLDAIDAG+LYH  KA E+P++ANT+RY
Sbjct: 61  DHGPWPRMTGVERARIMMKWADLIDENAEEIAKLDAIDAGKLYHRCKAFEIPAAANTIRY 120

Query: 545 YAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFTKVSPSLAAGCTMVL 366
           YAGAADKIHG+VLK   + HAYTL+EPIGVVGHIIPWN  + +F +KV+PSLAAGCTMVL
Sbjct: 121 YAGAADKIHGEVLKPAREFHAYTLMEPIGVVGHIIPWNFPSSMFVSKVAPSLAAGCTMVL 180

Query: 365 KPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDKVSFTGSVEIGR 186
           KPAEQTPLSALF+AHLAKLAGIPDGVLNVVPGFGPTAGAAI SHM+IDKVSFTGS E+GR
Sbjct: 181 KPAEQTPLSALFYAHLAKLAGIPDGVLNVVPGFGPTAGAAICSHMEIDKVSFTGSTEVGR 240

Query: 185 EIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFNKGEICCASSRVFVQ 6
           E+M+AAA SNLK VSLELGGKSPL++F             LG+L+NKGEIC A SRVFVQ
Sbjct: 241 EVMRAAANSNLKPVSLELGGKSPLVVFDDADLDKAVDLALLGVLYNKGEICVAGSRVFVQ 300

Query: 5   E 3
           E
Sbjct: 301 E 301


>ref|XP_003530494.1| PREDICTED: aldehyde dehydrogenase family 2 member C4-like [Glycine
           max]
 gb|KHN16598.1| Aldehyde dehydrogenase family 2 member C4 [Glycine soja]
 gb|KRH41586.1| hypothetical protein GLYMA_08G039200 [Glycine max]
          Length = 501

 Score =  469 bits (1207), Expect = e-161
 Identities = 235/312 (75%), Positives = 265/312 (84%), Gaps = 1/312 (0%)
 Frame = -1

Query: 935 MTDLLNSTNG-LPNMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTKEDI 759
           MT L N   G L  +PTIKF KLFING+FVDS+SG TFETIDPRTG+VIARISEG KEDI
Sbjct: 1   MTSLTNGDAGSLNKVPTIKFTKLFINGDFVDSLSGKTFETIDPRTGDVIARISEGDKEDI 60

Query: 758 DIAVKAARKAFDLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSLKAM 579
           DIAVKAAR AFD GPWPR+PG ERA+I++KWAE+++ENAEE+A LDAIDAG+LYH  + +
Sbjct: 61  DIAVKAARHAFDNGPWPRLPGSERARILLKWAEIIEENAEELAALDAIDAGKLYHMCRNV 120

Query: 578 EVPSSANTLRYYAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFTKVS 399
           EVP++ANTLRYYAGAADKIHG+VLK   + HAYTLLEP+GVVGHI PWN    +F+ KV+
Sbjct: 121 EVPAAANTLRYYAGAADKIHGEVLKMSREFHAYTLLEPLGVVGHITPWNFPNTMFYIKVA 180

Query: 398 PSLAAGCTMVLKPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDK 219
           PSLAAGCTMVLKPAEQTPLSALF AHLAKLAGIPDGV+NVVPGFGPTAGAA+SSHMD+DK
Sbjct: 181 PSLAAGCTMVLKPAEQTPLSALFSAHLAKLAGIPDGVINVVPGFGPTAGAALSSHMDVDK 240

Query: 218 VSFTGSVEIGREIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFNKGE 39
           VSFTGS + GR IMQAAA+SNLK VSLELGGKSPLIIF             LGIL+NKGE
Sbjct: 241 VSFTGSTQTGRVIMQAAAKSNLKQVSLELGGKSPLIIFDDADIDKATELALLGILYNKGE 300

Query: 38  ICCASSRVFVQE 3
           +C ASSRVFVQE
Sbjct: 301 VCVASSRVFVQE 312


>ref|XP_020221618.1| aldehyde dehydrogenase family 2 member C4-like [Cajanus cajan]
 gb|KYP61605.1| Aldehyde dehydrogenase family 2 member C4 [Cajanus cajan]
          Length = 505

 Score =  468 bits (1205), Expect = e-161
 Identities = 233/303 (76%), Positives = 256/303 (84%)
 Frame = -1

Query: 911 NGLPNMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTKEDIDIAVKAARK 732
           N    +PTI F KLFING FV S+SG TFETIDPRT EVIAR+SEG KEDIDIAVKAAR+
Sbjct: 14  NSFLKIPTINFTKLFINGHFVHSISGRTFETIDPRTEEVIARVSEGDKEDIDIAVKAARE 73

Query: 731 AFDLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSLKAMEVPSSANTL 552
           AFD GPWPR+PG ERAKI++KWAEL++EN EE+A LD ID G+L+   KA+E+PS+ N L
Sbjct: 74  AFDSGPWPRLPGSERAKILMKWAELIEENIEELAALDTIDGGKLHFFNKAVEIPSATNAL 133

Query: 551 RYYAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFTKVSPSLAAGCTM 372
           RYYAGAADKIHG VLK  G+ HAYTLLEP+GVVGHIIPWNA +  FF KVSPSLAAGCTM
Sbjct: 134 RYYAGAADKIHGDVLKMNGEFHAYTLLEPVGVVGHIIPWNAPSLTFFIKVSPSLAAGCTM 193

Query: 371 VLKPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDKVSFTGSVEI 192
           VLKPAEQTPLSALF+AHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDID VSFTGS+E+
Sbjct: 194 VLKPAEQTPLSALFYAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDAVSFTGSIEV 253

Query: 191 GREIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFNKGEICCASSRVF 12
           GRE+MQAAARSNLK VSLELGGKSPLIIF              GILFNKGE+C ASSRVF
Sbjct: 254 GREVMQAAARSNLKPVSLELGGKSPLIIFDDADIDKAAELALFGILFNKGEVCVASSRVF 313

Query: 11  VQE 3
           VQE
Sbjct: 314 VQE 316


>gb|KHN20692.1| Aldehyde dehydrogenase family 2 member C4 [Glycine soja]
          Length = 487

 Score =  467 bits (1202), Expect = e-161
 Identities = 231/298 (77%), Positives = 257/298 (86%)
 Frame = -1

Query: 896 MPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTKEDIDIAVKAARKAFDLG 717
           MP+IKF KLFINGEFVDS+SG  FETIDPRTGEVI RI+EG KEDID+AVKAAR AFD G
Sbjct: 1   MPSIKFTKLFINGEFVDSLSGKEFETIDPRTGEVITRIAEGAKEDIDVAVKAARDAFDYG 60

Query: 716 PWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSLKAMEVPSSANTLRYYAG 537
           PWPRMPG ERAKIM+KWA+L+D+N EEIA LDAIDAG+LYH  KA+++P++ANT+RYYAG
Sbjct: 61  PWPRMPGAERAKIMMKWADLIDQNIEEIAALDAIDAGKLYHWCKAVDIPAAANTIRYYAG 120

Query: 536 AADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFTKVSPSLAAGCTMVLKPA 357
           AADKIHG+VLK   + HAYTLLEPIGVVGHIIPWN  + +F  KVSPSLAAGCTMVLKPA
Sbjct: 121 AADKIHGEVLKASREFHAYTLLEPIGVVGHIIPWNFPSTMFVAKVSPSLAAGCTMVLKPA 180

Query: 356 EQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDKVSFTGSVEIGREIM 177
           EQTPLSALF+AHLAKLAGIPDGVLNVVPGFG TAGAAISSHMDIDKVSFTGS E+GRE+M
Sbjct: 181 EQTPLSALFYAHLAKLAGIPDGVLNVVPGFGQTAGAAISSHMDIDKVSFTGSTEVGREVM 240

Query: 176 QAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFNKGEICCASSRVFVQE 3
           +AAA SNLK VSLELGGKSP+I+F             +GILFNKGEIC A SRV VQE
Sbjct: 241 RAAANSNLKPVSLELGGKSPVIVFDDADVDKAAGLALMGILFNKGEICVAGSRVLVQE 298


>ref|XP_003528912.1| PREDICTED: aldehyde dehydrogenase family 2 member C4-like isoform
           X1 [Glycine max]
 gb|KRH48412.1| hypothetical protein GLYMA_07G087500 [Glycine max]
          Length = 501

 Score =  467 bits (1202), Expect = e-161
 Identities = 231/298 (77%), Positives = 257/298 (86%)
 Frame = -1

Query: 896 MPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTKEDIDIAVKAARKAFDLG 717
           MP+IKF KLFINGEFVDS+SG  FETIDPRTGEVI RI+EG KEDID+AVKAAR AFD G
Sbjct: 15  MPSIKFTKLFINGEFVDSLSGKEFETIDPRTGEVITRIAEGAKEDIDVAVKAARDAFDYG 74

Query: 716 PWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSLKAMEVPSSANTLRYYAG 537
           PWPRMPG ERAKIM+KWA+L+D+N EEIA LDAIDAG+LYH  KA+++P++ANT+RYYAG
Sbjct: 75  PWPRMPGAERAKIMMKWADLIDQNIEEIAALDAIDAGKLYHWCKAVDIPAAANTIRYYAG 134

Query: 536 AADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFTKVSPSLAAGCTMVLKPA 357
           AADKIHG+VLK   + HAYTLLEPIGVVGHIIPWN  + +F  KVSPSLAAGCTMVLKPA
Sbjct: 135 AADKIHGEVLKASREFHAYTLLEPIGVVGHIIPWNFPSTMFVAKVSPSLAAGCTMVLKPA 194

Query: 356 EQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDKVSFTGSVEIGREIM 177
           EQTPLSALF+AHLAKLAGIPDGVLNVVPGFG TAGAAISSHMDIDKVSFTGS E+GRE+M
Sbjct: 195 EQTPLSALFYAHLAKLAGIPDGVLNVVPGFGQTAGAAISSHMDIDKVSFTGSTEVGREVM 254

Query: 176 QAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFNKGEICCASSRVFVQE 3
           +AAA SNLK VSLELGGKSP+I+F             +GILFNKGEIC A SRV VQE
Sbjct: 255 RAAANSNLKPVSLELGGKSPVIVFDDADVDKAAGLALMGILFNKGEICVAGSRVLVQE 312


>ref|XP_020233236.1| aldehyde dehydrogenase family 2 member C4 isoform X1 [Cajanus
           cajan]
          Length = 490

 Score =  466 bits (1200), Expect = e-160
 Identities = 228/301 (75%), Positives = 263/301 (87%)
 Frame = -1

Query: 905 LPNMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTKEDIDIAVKAARKAF 726
           + N+ T+KF KLFING+FVDS+SG  FE+IDPRTGEVIARI+EG+KEDID+AVKA+R AF
Sbjct: 1   MTNLNTVKFTKLFINGDFVDSLSGREFESIDPRTGEVIARIAEGSKEDIDVAVKASRVAF 60

Query: 725 DLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSLKAMEVPSSANTLRY 546
           D GPWPRM GVERA+IM+KWA+L+DENAEEIAKLDAIDAG+LYH  KA E+P++ANT+RY
Sbjct: 61  DHGPWPRMTGVERARIMMKWADLIDENAEEIAKLDAIDAGKLYHRCKAFEIPAAANTIRY 120

Query: 545 YAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFTKVSPSLAAGCTMVL 366
           YAGAADKIHG+VLK   + HAYTL+EPIGVVGHIIPWN  + +F +KV+PSLAAGCTMVL
Sbjct: 121 YAGAADKIHGEVLKPAREFHAYTLMEPIGVVGHIIPWNFPSSMFVSKVAPSLAAGCTMVL 180

Query: 365 KPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDKVSFTGSVEIGR 186
           KPAEQTPLSALF+AHLAKLAGIPDGVLNVVPGFGPTAGAAI SHM+IDKVSFTGS E+GR
Sbjct: 181 KPAEQTPLSALFYAHLAKLAGIPDGVLNVVPGFGPTAGAAICSHMEIDKVSFTGSTEVGR 240

Query: 185 EIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFNKGEICCASSRVFVQ 6
           E+M+AAA SNLK VSLELGGKSPL++F             LG+L+NKGEIC A SRVFVQ
Sbjct: 241 EVMRAAANSNLKPVSLELGGKSPLVVFDDADLDKAVDLALLGVLYNKGEICVAGSRVFVQ 300

Query: 5   E 3
           E
Sbjct: 301 E 301


>ref|XP_003530501.1| PREDICTED: aldehyde dehydrogenase family 2 member C4-like [Glycine
           max]
 gb|KHN16597.1| Aldehyde dehydrogenase family 2 member C4 [Glycine soja]
 gb|KRH41589.1| hypothetical protein GLYMA_08G039300 [Glycine max]
          Length = 505

 Score =  466 bits (1200), Expect = e-160
 Identities = 233/303 (76%), Positives = 256/303 (84%)
 Frame = -1

Query: 911 NGLPNMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTKEDIDIAVKAARK 732
           N    MP IKF KLFING+FVDS+SG TFETIDPRT EVIAR+SEG KEDIDIAVKAAR+
Sbjct: 14  NSFLQMPPIKFTKLFINGDFVDSLSGRTFETIDPRTEEVIARVSEGDKEDIDIAVKAARQ 73

Query: 731 AFDLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSLKAMEVPSSANTL 552
           AFD GPWPR+P  ERAKIM+KWA+L+DEN EE+A LD +DAG+L +  K +E+PS+ N L
Sbjct: 74  AFDSGPWPRLPASERAKIMMKWADLIDENIEELAALDTVDAGKLNYINKVVEIPSATNAL 133

Query: 551 RYYAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFTKVSPSLAAGCTM 372
           RYYAGAADKIHG+VLK  G  HAYTLLEPIGVVGHIIPWNA +  FF KVSPSLAAGCTM
Sbjct: 134 RYYAGAADKIHGEVLKMNGDFHAYTLLEPIGVVGHIIPWNAPSLSFFIKVSPSLAAGCTM 193

Query: 371 VLKPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDKVSFTGSVEI 192
           VLKPAEQTPLSALF+AHLAKLAGIPDGVLN+VPGFGPTAGAAISSHMDID VSFTGS+E+
Sbjct: 194 VLKPAEQTPLSALFYAHLAKLAGIPDGVLNIVPGFGPTAGAAISSHMDIDVVSFTGSIEV 253

Query: 191 GREIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFNKGEICCASSRVF 12
           GRE+MQAAARSNLK VSLELGGKSPLIIF              GI+ NKGEIC ASSRVF
Sbjct: 254 GREVMQAAARSNLKPVSLELGGKSPLIIFNDADIDKAAQLALFGIMSNKGEICVASSRVF 313

Query: 11  VQE 3
           VQE
Sbjct: 314 VQE 316


>ref|XP_016203683.1| aldehyde dehydrogenase family 2 member C4 isoform X2 [Arachis
           ipaensis]
          Length = 421

 Score =  462 bits (1190), Expect = e-160
 Identities = 228/296 (77%), Positives = 257/296 (86%)
 Frame = -1

Query: 890 TIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTKEDIDIAVKAARKAFDLGPW 711
           T+KF KLFING+FVDS+SG  FETIDPRTGEVIARISEG KEDID AV+AAR+AFD GPW
Sbjct: 15  TVKFTKLFINGQFVDSLSGSEFETIDPRTGEVIARISEGRKEDIDAAVEAAREAFDTGPW 74

Query: 710 PRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSLKAMEVPSSANTLRYYAGAA 531
           PRMPG ERAKIM+KWA+L+D+NA E+A LDAIDAG+LY+  KA E+P++AN LRYYAGAA
Sbjct: 75  PRMPGAERAKIMLKWADLIDQNASELAALDAIDAGKLYNRCKAHEIPAAANMLRYYAGAA 134

Query: 530 DKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFTKVSPSLAAGCTMVLKPAEQ 351
           DKIHG+VLK  G+ HAYTL+EP+GVVGHIIPWN  + +FF KVSP+LAAGCTMVLKPAEQ
Sbjct: 135 DKIHGEVLKASGEFHAYTLMEPVGVVGHIIPWNFPSNMFFIKVSPALAAGCTMVLKPAEQ 194

Query: 350 TPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDKVSFTGSVEIGREIMQA 171
           TPLSALF+AHLAK AGIPDGVLNVVPGFGPTAGAAISSHM++DKVSFTGS E+GREIM A
Sbjct: 195 TPLSALFYAHLAKQAGIPDGVLNVVPGFGPTAGAAISSHMNVDKVSFTGSTEVGREIMHA 254

Query: 170 AARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFNKGEICCASSRVFVQE 3
           AA SNLK VSLELGGKSPL+IF             LGI++NKGEIC ASSRVFVQE
Sbjct: 255 AASSNLKQVSLELGGKSPLLIFDDADVDKAASLALLGIVYNKGEICVASSRVFVQE 310


>gb|KHN21546.1| Aldehyde dehydrogenase family 2 member C4 [Glycine soja]
          Length = 504

 Score =  465 bits (1196), Expect = e-160
 Identities = 236/315 (74%), Positives = 259/315 (82%), Gaps = 4/315 (1%)
 Frame = -1

Query: 935 MTDLLNST----NGLPNMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTK 768
           M+ L NS+    N    MP IKF KLFING+FVDS+SG TFETIDPR  EVIAR+SEG K
Sbjct: 1   MSALSNSSSSHGNSFLKMPAIKFTKLFINGDFVDSISGRTFETIDPRKEEVIARVSEGDK 60

Query: 767 EDIDIAVKAARKAFDLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSL 588
           EDIDIAVKAAR+AFD GPWPR+PG ERAKIM+KWA+L+DEN EE+A LD IDAG+LY+  
Sbjct: 61  EDIDIAVKAARQAFDSGPWPRLPGSERAKIMMKWADLVDENIEELAALDTIDAGKLYYIN 120

Query: 587 KAMEVPSSANTLRYYAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFT 408
           K  E+PS+ N LRYYAGAADKIHG VLK  G  HAYTLLEPIGVVGHIIPWNA +  FF 
Sbjct: 121 KVAEIPSATNALRYYAGAADKIHGDVLKMNGDFHAYTLLEPIGVVGHIIPWNAPSLSFFI 180

Query: 407 KVSPSLAAGCTMVLKPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMD 228
           KVSPSLAAGCTMVLKPAEQTPLSALF+AHLAKLAGIPDGVLN+VPGFGPTAGAAISSHMD
Sbjct: 181 KVSPSLAAGCTMVLKPAEQTPLSALFYAHLAKLAGIPDGVLNIVPGFGPTAGAAISSHMD 240

Query: 227 IDKVSFTGSVEIGREIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFN 48
           ID VSFTGS+E+GRE++QAAA SNLK VSLELGGKSPLIIF              GI+ N
Sbjct: 241 IDAVSFTGSIEVGREVLQAAAWSNLKPVSLELGGKSPLIIFNDADIDKASELALFGIMSN 300

Query: 47  KGEICCASSRVFVQE 3
           KGEIC A SRVFVQE
Sbjct: 301 KGEICVAGSRVFVQE 315


>ref|XP_020221518.1| aldehyde dehydrogenase family 2 member C4-like [Cajanus cajan]
 gb|KYP61606.1| Aldehyde dehydrogenase family 2 member C4 [Cajanus cajan]
          Length = 503

 Score =  464 bits (1195), Expect = e-160
 Identities = 233/315 (73%), Positives = 265/315 (84%), Gaps = 4/315 (1%)
 Frame = -1

Query: 935 MTDLLNSTNG----LPNMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTK 768
           MT L N TNG    + N+PT+ F KLFING+FVDSVSG TFETIDPRTGEVIA ISEG K
Sbjct: 1   MTSLTN-TNGDAASVNNLPTVTFTKLFINGDFVDSVSGKTFETIDPRTGEVIALISEGEK 59

Query: 767 EDIDIAVKAARKAFDLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSL 588
           EDIDIAVKAAR AFD GPWPR+PG ERAKI++KWA+L+DE+AEE+A LD IDAG+LYH  
Sbjct: 60  EDIDIAVKAARHAFDSGPWPRLPGAERAKILMKWAQLIDEHAEELAALDTIDAGKLYHMC 119

Query: 587 KAMEVPSSANTLRYYAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFT 408
           + +EVP++A+TLRYYAGAADKIHG+VLK   + HAYTLLEP+GVVGHIIPWN    +F+ 
Sbjct: 120 RNVEVPAAASTLRYYAGAADKIHGEVLKMSREFHAYTLLEPVGVVGHIIPWNFPNTMFYI 179

Query: 407 KVSPSLAAGCTMVLKPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMD 228
           KV PSLAAGCTMVLKPAEQTPLSAL+ AHLAKLAG+PDGVLNVVPGFGPTAGAA+SSHMD
Sbjct: 180 KVGPSLAAGCTMVLKPAEQTPLSALYSAHLAKLAGLPDGVLNVVPGFGPTAGAALSSHMD 239

Query: 227 IDKVSFTGSVEIGREIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFN 48
           +D VSFTGS + GREIMQAAA+SNLK V+LELGGKSPLIIF             +GIL+N
Sbjct: 240 VDAVSFTGSTKTGREIMQAAAKSNLKQVTLELGGKSPLIIFDDADIDKAAELALIGILYN 299

Query: 47  KGEICCASSRVFVQE 3
           KGE+C ASSRVFVQE
Sbjct: 300 KGEVCVASSRVFVQE 314


>ref|NP_001235519.2| aldehyde dehydrogenase superfamily protein [Glycine max]
 gb|KRH60295.1| hypothetical protein GLYMA_05G231900 [Glycine max]
          Length = 538

 Score =  465 bits (1196), Expect = e-159
 Identities = 236/315 (74%), Positives = 259/315 (82%), Gaps = 4/315 (1%)
 Frame = -1

Query: 935 MTDLLNST----NGLPNMPTIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTK 768
           M+ L NS+    N    MP IKF KLFING+FVDS+SG TFETIDPR  EVIAR+SEG K
Sbjct: 35  MSALSNSSSSHGNSFLKMPAIKFTKLFINGDFVDSISGRTFETIDPRKEEVIARVSEGDK 94

Query: 767 EDIDIAVKAARKAFDLGPWPRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSL 588
           EDIDIAVKAAR+AFD GPWPR+PG ERAKIM+KWA+L+DEN EE+A LD IDAG+LY+  
Sbjct: 95  EDIDIAVKAARQAFDSGPWPRLPGSERAKIMMKWADLVDENIEELAALDTIDAGKLYYIN 154

Query: 587 KAMEVPSSANTLRYYAGAADKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFT 408
           K  E+PS+ N LRYYAGAADKIHG VLK  G  HAYTLLEPIGVVGHIIPWNA +  FF 
Sbjct: 155 KVAEIPSATNALRYYAGAADKIHGDVLKMNGDFHAYTLLEPIGVVGHIIPWNAPSLSFFI 214

Query: 407 KVSPSLAAGCTMVLKPAEQTPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMD 228
           KVSPSLAAGCTMVLKPAEQTPLSALF+AHLAKLAGIPDGVLN+VPGFGPTAGAAISSHMD
Sbjct: 215 KVSPSLAAGCTMVLKPAEQTPLSALFYAHLAKLAGIPDGVLNIVPGFGPTAGAAISSHMD 274

Query: 227 IDKVSFTGSVEIGREIMQAAARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFN 48
           ID VSFTGS+E+GRE++QAAA SNLK VSLELGGKSPLIIF              GI+ N
Sbjct: 275 IDAVSFTGSIEVGREVLQAAAWSNLKPVSLELGGKSPLIIFNDADIDKASELALFGIMSN 334

Query: 47  KGEICCASSRVFVQE 3
           KGEIC A SRVFVQE
Sbjct: 335 KGEICVAGSRVFVQE 349


>ref|XP_016203682.1| aldehyde dehydrogenase family 2 member C4 isoform X1 [Arachis
           ipaensis]
          Length = 499

 Score =  462 bits (1190), Expect = e-159
 Identities = 228/296 (77%), Positives = 257/296 (86%)
 Frame = -1

Query: 890 TIKFNKLFINGEFVDSVSGMTFETIDPRTGEVIARISEGTKEDIDIAVKAARKAFDLGPW 711
           T+KF KLFING+FVDS+SG  FETIDPRTGEVIARISEG KEDID AV+AAR+AFD GPW
Sbjct: 15  TVKFTKLFINGQFVDSLSGSEFETIDPRTGEVIARISEGRKEDIDAAVEAAREAFDTGPW 74

Query: 710 PRMPGVERAKIMIKWAELLDENAEEIAKLDAIDAGRLYHSLKAMEVPSSANTLRYYAGAA 531
           PRMPG ERAKIM+KWA+L+D+NA E+A LDAIDAG+LY+  KA E+P++AN LRYYAGAA
Sbjct: 75  PRMPGAERAKIMLKWADLIDQNASELAALDAIDAGKLYNRCKAHEIPAAANMLRYYAGAA 134

Query: 530 DKIHGKVLKTQGQLHAYTLLEPIGVVGHIIPWNAATFLFFTKVSPSLAAGCTMVLKPAEQ 351
           DKIHG+VLK  G+ HAYTL+EP+GVVGHIIPWN  + +FF KVSP+LAAGCTMVLKPAEQ
Sbjct: 135 DKIHGEVLKASGEFHAYTLMEPVGVVGHIIPWNFPSNMFFIKVSPALAAGCTMVLKPAEQ 194

Query: 350 TPLSALFHAHLAKLAGIPDGVLNVVPGFGPTAGAAISSHMDIDKVSFTGSVEIGREIMQA 171
           TPLSALF+AHLAK AGIPDGVLNVVPGFGPTAGAAISSHM++DKVSFTGS E+GREIM A
Sbjct: 195 TPLSALFYAHLAKQAGIPDGVLNVVPGFGPTAGAAISSHMNVDKVSFTGSTEVGREIMHA 254

Query: 170 AARSNLKHVSLELGGKSPLIIFXXXXXXXXXXXXXLGILFNKGEICCASSRVFVQE 3
           AA SNLK VSLELGGKSPL+IF             LGI++NKGEIC ASSRVFVQE
Sbjct: 255 AASSNLKQVSLELGGKSPLLIFDDADVDKAASLALLGIVYNKGEICVASSRVFVQE 310


Top