BLASTX nr result
ID: Wisteria21_contig00009164
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Wisteria21_contig00009164 (1700 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003540247.1| PREDICTED: uncharacterized protein LOC100810... 679 0.0 ref|XP_014497137.1| PREDICTED: uncharacterized protein LOC106758... 672 0.0 gb|KOM44076.1| hypothetical protein LR48_Vigan05g168100 [Vigna a... 668 0.0 ref|XP_007150070.1| hypothetical protein PHAVU_005G123900g [Phas... 637 e-180 gb|KHN32876.1| Myb family transcription factor APL [Glycine soja... 635 e-179 ref|XP_007132437.1| hypothetical protein PHAVU_011G094300g [Phas... 558 e-156 gb|KHN21148.1| Myb family transcription factor APL [Glycine soja] 551 e-154 ref|XP_003596817.1| myb-like transcription factor family protein... 548 e-153 ref|XP_003539830.1| PREDICTED: uncharacterized protein LOC100805... 547 e-153 ref|XP_004487498.1| PREDICTED: uncharacterized protein LOC101506... 529 e-147 ref|XP_004487497.1| PREDICTED: uncharacterized protein LOC101506... 528 e-147 gb|KOM50550.1| hypothetical protein LR48_Vigan08g137700 [Vigna a... 527 e-146 ref|XP_012092686.1| PREDICTED: uncharacterized protein LOC105650... 524 e-145 ref|XP_002525443.1| transcription factor, putative [Ricinus comm... 520 e-144 ref|XP_014491719.1| PREDICTED: uncharacterized protein LOC106754... 515 e-143 ref|XP_007030697.1| Homeodomain-like superfamily protein isoform... 508 e-141 ref|XP_007030696.1| Homeodomain-like superfamily protein isoform... 508 e-141 ref|XP_002325408.2| myb family transcription factor family prote... 507 e-140 ref|XP_003539165.1| PREDICTED: uncharacterized protein LOC100781... 504 e-140 ref|XP_011034346.1| PREDICTED: uncharacterized protein LOC105132... 503 e-139 >ref|XP_003540247.1| PREDICTED: uncharacterized protein LOC100810396 [Glycine max] gi|734392541|gb|KHN27648.1| Myb family transcription factor APL [Glycine soja] gi|947077798|gb|KRH26638.1| hypothetical protein GLYMA_12G184700 [Glycine max] Length = 420 Score = 679 bits (1753), Expect = 0.0 Identities = 361/443 (81%), Positives = 386/443 (87%), Gaps = 9/443 (2%) Frame = -1 Query: 1547 MYHHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFT 1368 MYHHHQHQ KNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARF Sbjct: 1 MYHHHQHQGKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFI 60 Query: 1367 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKI--SAG 1194 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVT+KI SA Sbjct: 61 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTYKITTSAS 120 Query: 1193 TGERLPETNGTHMNKLSLGPQTNKDLHISEALQMQIEVQRRLNEQLEVQRHLQVRIEAQG 1014 TGERL ETNGTHMNKLSLGPQ NKDLHISEALQMQIEVQRRLNEQLEVQRHLQ+RIEAQG Sbjct: 121 TGERLSETNGTHMNKLSLGPQANKDLHISEALQMQIEVQRRLNEQLEVQRHLQLRIEAQG 180 Query: 1013 KYLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQGFCPQQ 834 KYLQSVLEKAQETLGRQNLG+VG+EAAKVQLSELVSKVSSQCLNSAF+E K+LQGF PQQ Sbjct: 181 KYLQSVLEKAQETLGRQNLGVVGIEAAKVQLSELVSKVSSQCLNSAFTEPKDLQGFFPQQ 240 Query: 833 TQTNQPNDCSMDSSCLTSCDRSQKEQEIQNGGIALRHFNSHMFMERXXXXXXXXXXXXQA 654 TQTN PNDCSMD SCLTS DRSQKEQEIQNG LRHFNSH+FME +A Sbjct: 241 TQTNPPNDCSMD-SCLTSSDRSQKEQEIQNG---LRHFNSHVFMEH-----------KEA 285 Query: 653 TDAPIHNNLRNPELMWCDEVKKNNTFLTPLGKSKNEERSSYAVESSPSNLSMTIGLERER 474 T+AP NNLRNPEL WC++ KK NTFL PL SKNEER +YA ESSP+NLSM+IGLERE Sbjct: 286 TEAP--NNLRNPELKWCEDGKK-NTFLAPL--SKNEERRNYAAESSPNNLSMSIGLERET 340 Query: 473 ENGISMYPERVITENHHSDGEFQHRNTARTEAMKPVED------RFP-SYFAAPRLDLNT 315 ENGI++YPER+ITE+ SDGEFQHRN + E +KPV++ R P SYFAA RLDLNT Sbjct: 341 ENGINLYPERLITES-QSDGEFQHRNRIKPETLKPVDEKVSQDYRLPASYFAAARLDLNT 399 Query: 314 RGDNEAAAASSCKQLDLNRFSWN 246 GDNE AA++CKQLDLNRFSW+ Sbjct: 400 HGDNE--AATTCKQLDLNRFSWS 420 >ref|XP_014497137.1| PREDICTED: uncharacterized protein LOC106758692 [Vigna radiata var. radiata] Length = 420 Score = 672 bits (1735), Expect = 0.0 Identities = 358/443 (80%), Positives = 380/443 (85%), Gaps = 9/443 (2%) Frame = -1 Query: 1547 MYHHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFT 1368 MYHH QHQ KNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWT DLHARF Sbjct: 1 MYHHPQHQGKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTADLHARFI 60 Query: 1367 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKI--SAG 1194 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHK+ SA Sbjct: 61 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKMTTSAA 120 Query: 1193 TGERLPETNGTHMNKLSLGPQTNKDLHISEALQMQIEVQRRLNEQLEVQRHLQVRIEAQG 1014 TGERL ET+GTHMNKLSLGPQ NKDLHISEALQMQIEVQRRLNEQLEVQRHLQ+RIEAQG Sbjct: 121 TGERLSETSGTHMNKLSLGPQANKDLHISEALQMQIEVQRRLNEQLEVQRHLQLRIEAQG 180 Query: 1013 KYLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQGFCPQQ 834 KYLQSVLEKAQETLGRQNLG+VGLE AKVQLSELVSKVSSQCLNSAFSELKELQGFCPQQ Sbjct: 181 KYLQSVLEKAQETLGRQNLGIVGLETAKVQLSELVSKVSSQCLNSAFSELKELQGFCPQQ 240 Query: 833 TQTNQPNDCSMDSSCLTSCDRSQKEQEIQNGGIALRHFNSHMFMERXXXXXXXXXXXXQA 654 T TNQPNDCS+D SCLTSCDRSQKEQEIQNG RHFNSHMFME+ +A Sbjct: 241 THTNQPNDCSVD-SCLTSCDRSQKEQEIQNG---FRHFNSHMFMEQ-----------KEA 285 Query: 653 TDAPIHNNLRNPELMWCDEVKKNNTFLTPLGKSKNEERSSYAVESSPSNLSMTIGLERER 474 T+AP NNLRN EL WCD+ KK NTFL PL S+NEER +YA E+ P NLSM+IGLERE Sbjct: 286 TEAP--NNLRNCELKWCDDGKK-NTFLAPL--SRNEERRNYAAETGPGNLSMSIGLERET 340 Query: 473 ENGISMYPERVITENHHSDGEFQHRNTARTEAMKPVED------RFP-SYFAAPRLDLNT 315 EN SMYPER+ITEN ++ EFQHRN +TE MKPV++ R P SYF APRLDLN Sbjct: 341 ENRSSMYPERLITEN-QTEVEFQHRNRIKTETMKPVDEKVSQDYRMPASYFVAPRLDLNN 399 Query: 314 RGDNEAAAASSCKQLDLNRFSWN 246 GDNE AA++CKQLDLNRFSW+ Sbjct: 400 HGDNE--AATTCKQLDLNRFSWS 420 >gb|KOM44076.1| hypothetical protein LR48_Vigan05g168100 [Vigna angularis] Length = 436 Score = 668 bits (1724), Expect = 0.0 Identities = 358/457 (78%), Positives = 379/457 (82%), Gaps = 23/457 (5%) Frame = -1 Query: 1547 MYHHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFT 1368 MYHHHQHQ KNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWT DLHARF Sbjct: 1 MYHHHQHQGKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTADLHARFI 60 Query: 1367 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKI--SAG 1194 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSN VTHK+ SA Sbjct: 61 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNTVTHKMTTSAA 120 Query: 1193 TGERLPETNGTHMNKLSLGPQTNKDLHISEALQMQIEVQRRLNEQLEVQRHLQVRIEAQG 1014 TGERL ET+GTHMNKLSLGPQ NKDLHISEALQMQIEVQRRLNEQLEVQRHLQ+RIEAQG Sbjct: 121 TGERLSETSGTHMNKLSLGPQANKDLHISEALQMQIEVQRRLNEQLEVQRHLQLRIEAQG 180 Query: 1013 KYLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQGFCPQQ 834 KYLQSVLEKAQETLGRQNLG+VGLE AKVQLSELVSKVSSQCLNSAFSELKELQGFCPQQ Sbjct: 181 KYLQSVLEKAQETLGRQNLGIVGLETAKVQLSELVSKVSSQCLNSAFSELKELQGFCPQQ 240 Query: 833 TQTNQPNDCSMDSSCLTSCDRSQKEQEIQNGGIALRHFNSHMFMERXXXXXXXXXXXXQA 654 T TNQPNDCS+D SCLTSCDRSQKEQEIQNG RHFNSHMFME+ +A Sbjct: 241 THTNQPNDCSVD-SCLTSCDRSQKEQEIQNG---FRHFNSHMFMEQ-----------KEA 285 Query: 653 TDAPIHNNLRNPELMWCDEVKKNNTFLTPL--------------GKSKNEERSSYAVESS 516 T+AP NNLRN EL WCD+ KK NTFL PL +NEER +YA E+ Sbjct: 286 TEAP--NNLRNCELKWCDDGKK-NTFLAPLXXXXXXXXXXXXXXXXXRNEERRNYAAETG 342 Query: 515 PSNLSMTIGLERERENGISMYPERVITENHHSDGEFQHRNTARTEAMKPVED------RF 354 P NLSM+IGLERE EN SMYPER+ITEN S+GEFQHRN +TE MKPV++ R Sbjct: 343 PGNLSMSIGLERETENRSSMYPERLITEN-QSEGEFQHRNRIKTETMKPVDEKVSQDYRM 401 Query: 353 P-SYFAAPRLDLNTRGDNEAAAASSCKQLDLNRFSWN 246 P SYF A RLDLN GDNE AA++CKQLDLNRFSW+ Sbjct: 402 PASYFVAARLDLNNHGDNE--AATTCKQLDLNRFSWS 436 >ref|XP_007150070.1| hypothetical protein PHAVU_005G123900g [Phaseolus vulgaris] gi|561023334|gb|ESW22064.1| hypothetical protein PHAVU_005G123900g [Phaseolus vulgaris] Length = 430 Score = 637 bits (1643), Expect = e-180 Identities = 344/453 (75%), Positives = 372/453 (82%), Gaps = 19/453 (4%) Frame = -1 Query: 1547 MYHHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFT 1368 MYHHH+HQ KNIHS+SRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARF Sbjct: 1 MYHHHRHQGKNIHSTSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFI 60 Query: 1367 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKI--SAG 1194 EAVNQLGGADKATPKTVMKLMGI GLTLYHLKSHLQKYRLSKNLHGQSNNVTHK+ SA Sbjct: 61 EAVNQLGGADKATPKTVMKLMGISGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKMTTSAT 120 Query: 1193 TGERLPETNGTHMNKLSLGPQTN----------KDLHISEALQMQIEVQRRLNEQLEVQR 1044 TGERL ET+GTHM+KLSLGPQ N KDLHI EALQMQIEVQRRLNEQLEVQ+ Sbjct: 121 TGERLSETSGTHMSKLSLGPQANNHANFQCLLSKDLHIGEALQMQIEVQRRLNEQLEVQK 180 Query: 1043 HLQVRIEAQGKYLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSEL 864 HLQ+RIEAQGKYLQSVLEKAQ+TLGRQNLG++GLE AKVQLSELVSKVSSQCLNSAFSEL Sbjct: 181 HLQLRIEAQGKYLQSVLEKAQDTLGRQNLGIIGLETAKVQLSELVSKVSSQCLNSAFSEL 240 Query: 863 KELQGFCPQQTQTNQPNDCSMDSSCLTSCDRSQKEQEIQNGGIALRHFNSHMFMERXXXX 684 KELQGFCPQQT TNQPNDCSMD SCLTSCD QKEQ+IQN +LR FNSH+FME+ Sbjct: 241 KELQGFCPQQTHTNQPNDCSMD-SCLTSCDILQKEQKIQN---SLRQFNSHVFMEQ---- 292 Query: 683 XXXXXXXXQATDAPIHNNLRNPELMWCDEVKKNNTFLTPLGKSKNEERSSYAVESSPSNL 504 ++TDA NNLRN EL WCD+ KK NTFL PL SK EER YA E+ P NL Sbjct: 293 -------KESTDA--RNNLRNSELKWCDDGKK-NTFLAPL--SKTEERRKYAAETGPGNL 340 Query: 503 SMTIGLERERENGISMYPERVITENHHSDGEFQHRNTARTEAMKPVED------RFP-SY 345 SM+IGLERE EN SMYPE +I E+ S+GEFQHRN +TE MK V++ R P SY Sbjct: 341 SMSIGLERETENRSSMYPESLIKES-QSEGEFQHRNRIKTETMKAVDEKVCQDYRMPASY 399 Query: 344 FAAPRLDLNTRGDNEAAAASSCKQLDLNRFSWN 246 F A RLDLN GDNE AA++CKQLDLNRFSW+ Sbjct: 400 FVATRLDLNNHGDNE--AATTCKQLDLNRFSWS 430 >gb|KHN32876.1| Myb family transcription factor APL [Glycine soja] gi|947073788|gb|KRH22679.1| hypothetical protein GLYMA_13G316600 [Glycine max] gi|947073789|gb|KRH22680.1| hypothetical protein GLYMA_13G316600 [Glycine max] Length = 400 Score = 635 bits (1638), Expect = e-179 Identities = 342/425 (80%), Positives = 361/425 (84%), Gaps = 9/425 (2%) Frame = -1 Query: 1547 MYHHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFT 1368 MYHHHQHQ KNIHSSSRMPIPSERHMFLQ GNGSGDSGLVLSTDAKPRLKWTPDLHARF Sbjct: 1 MYHHHQHQGKNIHSSSRMPIPSERHMFLQAGNGSGDSGLVLSTDAKPRLKWTPDLHARFI 60 Query: 1367 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKI--SAG 1194 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKI SA Sbjct: 61 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKITTSAT 120 Query: 1193 TGERLPETNGTHMNKLSLGPQTNKDLHISEALQMQIEVQRRLNEQLEVQRHLQVRIEAQG 1014 TGERL ETNGTHMNKLSLGPQ NKDLHISEALQMQIEVQRRLNEQLEVQRHLQ+RIEAQG Sbjct: 121 TGERLSETNGTHMNKLSLGPQANKDLHISEALQMQIEVQRRLNEQLEVQRHLQLRIEAQG 180 Query: 1013 KYLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQGFCPQQ 834 KYLQSVLEKAQETLGRQNLG+VGLEAAKVQLSELVSKVSSQC NSAF+ELK+LQGFCPQQ Sbjct: 181 KYLQSVLEKAQETLGRQNLGIVGLEAAKVQLSELVSKVSSQCFNSAFTELKDLQGFCPQQ 240 Query: 833 TQTNQPNDCSMDSSCLTSCDRSQKEQEIQNGGIALRHFNSHMFMERXXXXXXXXXXXXQA 654 QTN PNDCSMD SC+TSCDRSQKEQEIQNG LRHF+SHMFME+ +A Sbjct: 241 PQTNPPNDCSMD-SCITSCDRSQKEQEIQNG---LRHFSSHMFMEQ-----------KEA 285 Query: 653 TDAPIHNNLRNPELMWCDEVKKNNTFLTPLGKSKNEERSSYAVESSPSNLSMTIGLERER 474 +AP NNLRNPE+ W D+ KK NTFL PL SKNEER +YA E SPSNLSM+IGLERE Sbjct: 286 KEAP--NNLRNPEIKWYDDGKK-NTFLAPL--SKNEERRNYAAECSPSNLSMSIGLERET 340 Query: 473 ENGISMYPERVITENHHSDGEFQHRNTARTEAMKPVED------RFP-SYFAAPRLDLNT 315 ENG SMYPER+ITE+ SDG + + MKPV++ R P SYFAA RLDLNT Sbjct: 341 ENGSSMYPERLITES-PSDGRI------KPQTMKPVDEKVSQDYRLPTSYFAAARLDLNT 393 Query: 314 RGDNE 300 GDNE Sbjct: 394 HGDNE 398 >ref|XP_007132437.1| hypothetical protein PHAVU_011G094300g [Phaseolus vulgaris] gi|561005437|gb|ESW04431.1| hypothetical protein PHAVU_011G094300g [Phaseolus vulgaris] Length = 394 Score = 558 bits (1437), Expect = e-156 Identities = 308/445 (69%), Positives = 334/445 (75%), Gaps = 11/445 (2%) Frame = -1 Query: 1547 MYHHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFT 1368 MY H QHQ KNIHSSSRMPIPSERHMFL TGNGSGDSGLVLSTDAKPRLKWTPDLHARF Sbjct: 1 MYTHLQHQGKNIHSSSRMPIPSERHMFLHTGNGSGDSGLVLSTDAKPRLKWTPDLHARFI 60 Query: 1367 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKI----S 1200 EAV QLGGADKATPKTVMKLMGI GLTLYHLKSHLQKYRLSK+LHGQSNN THKI Sbjct: 61 EAVQQLGGADKATPKTVMKLMGISGLTLYHLKSHLQKYRLSKSLHGQSNNATHKICINPG 120 Query: 1199 AGTGERLPETNGTHMNKLSLGPQT-NKDLHISEALQMQIEVQRRLNEQLEVQRHLQVRIE 1023 A T ERL E NGTH+N L+L PQ+ NKDLHISEALQMQIEVQRRLNEQLEVQR LQ+RIE Sbjct: 121 AATDERLRENNGTHVNNLNLAPQSNNKDLHISEALQMQIEVQRRLNEQLEVQRLLQLRIE 180 Query: 1022 AQGKYLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQGFC 843 AQGKYLQ+VLEKAQETLGRQNLG+VGLEAAK+QLS+LVSKVSSQCLNSAF E+KELQGF Sbjct: 181 AQGKYLQAVLEKAQETLGRQNLGVVGLEAAKLQLSDLVSKVSSQCLNSAFLEMKELQGFS 240 Query: 842 PQQTQTNQPNDCSMDSSCLTSCDRSQKEQEIQNGGIALRHFNSHMFMERXXXXXXXXXXX 663 P QTQTNQPNDCSMD SCLTSC+ SQKEQEIQNGG++LR FN H FMER Sbjct: 241 PHQTQTNQPNDCSMD-SCLTSCEVSQKEQEIQNGGMSLRPFNVHTFMER----------- 288 Query: 662 XQATDAPIHNNLRNPELMWCDEVKKNNTFLTPLGKSKNEERSSYAVESSPSNLSMTIGLE 483 + + P NNL N +L WCD VK NTFLTPL + SPSNLSM+IGLE Sbjct: 289 KEVIEGPNLNNLPNTDLKWCDPVK--NTFLTPLSR------------RSPSNLSMSIGLE 334 Query: 482 RERENGISMYPERVITENHHSDGEFQHRNTARTEAMKPVEDR------FPSYFAAPRLDL 321 E ENG +T R E++KPV ++ PS+FAAP+LDL Sbjct: 335 GETENG----------------------STIRKESVKPVAEKVSQDYGLPSFFAAPKLDL 372 Query: 320 NTRGDNEAAAASSCKQLDLNRFSWN 246 T N SCK+LDLN FSWN Sbjct: 373 TTEDKN---TRKSCKELDLNGFSWN 394 >gb|KHN21148.1| Myb family transcription factor APL [Glycine soja] Length = 401 Score = 551 bits (1420), Expect = e-154 Identities = 312/446 (69%), Positives = 340/446 (76%), Gaps = 12/446 (2%) Frame = -1 Query: 1547 MYHHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFT 1368 MY H QHQ KNIHSSSRMPIPSER MFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARF Sbjct: 1 MYTHQQHQGKNIHSSSRMPIPSERQMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFI 60 Query: 1367 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKISAGTG 1188 EAV QLGGADKATPKTVMKL+GIPGLTLYHLKSHLQKYRLSK+LHGQSNN+THKISA T Sbjct: 61 EAVQQLGGADKATPKTVMKLIGIPGLTLYHLKSHLQKYRLSKSLHGQSNNMTHKISAATD 120 Query: 1187 ERLPETNGTHMNKLSLGPQT-NKDLHISEALQMQIEVQRRLNEQLEVQRHLQVRIEAQGK 1011 ERL E NGTHMN L+L PQ+ NKDL+ISEAL MQIE QRRLNEQLEVQR LQ+RIEAQGK Sbjct: 121 ERLRENNGTHMNSLNLAPQSNNKDLYISEALHMQIEEQRRLNEQLEVQRLLQLRIEAQGK 180 Query: 1010 YLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSE-LKELQGFCP-Q 837 YLQ+VLEKAQETLGRQNLG VGLEA K+QLSELVSKVSSQCLNSAFS+ LKE+QG P Q Sbjct: 181 YLQAVLEKAQETLGRQNLGAVGLEATKLQLSELVSKVSSQCLNSAFSDRLKEIQGCSPHQ 240 Query: 836 QTQTNQP--NDCSMDSSCLTSCDRSQKEQEIQNGGIALRHFNSHMFMERXXXXXXXXXXX 663 QTQTNQP NDCSMD SCLTSC+ SQKEQEIQNGG++LR FN H FMER Sbjct: 241 QTQTNQPNTNDCSMD-SCLTSCEGSQKEQEIQNGGMSLRPFNVHTFMER----------- 288 Query: 662 XQATDAPIHNNLRNPELMWCDEVKKNNTFLTPLGKSKNEERSSYAVESSPSNLSMTIGLE 483 + + P NNL N +L WCD VKK NTFLTPL S +A + SPSNLSM+IGLE Sbjct: 289 KEVIEGPNLNNLPNTDLNWCDPVKK-NTFLTPL--------SMHADKRSPSNLSMSIGLE 339 Query: 482 RERENGISMYPERVITENHHSDGEFQHRNTARTEAMKPVEDR------FPS-YFAAPRLD 324 E ENG +T RTE++KPV D+ PS YFAA +LD Sbjct: 340 GETENG----------------------STIRTESVKPVADKVSQDYGLPSNYFAASKLD 377 Query: 323 LNTRGDNEAAAASSCKQLDLNRFSWN 246 L T + + +SCKQLDLN FSWN Sbjct: 378 LTTEDNKD--TKTSCKQLDLNGFSWN 401 >ref|XP_003596817.1| myb-like transcription factor family protein [Medicago truncatula] gi|355485865|gb|AES67068.1| myb-like transcription factor family protein [Medicago truncatula] gi|388517363|gb|AFK46743.1| unknown [Medicago truncatula] Length = 389 Score = 548 bits (1413), Expect = e-153 Identities = 319/449 (71%), Positives = 338/449 (75%), Gaps = 15/449 (3%) Frame = -1 Query: 1547 MYHHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFT 1368 MYHHH HQ KNIHSSSRM IPSERHMFLQTGNGS DSGLVLSTDAKPRLKWTPDLHARF Sbjct: 1 MYHHH-HQGKNIHSSSRMSIPSERHMFLQTGNGSSDSGLVLSTDAKPRLKWTPDLHARFI 59 Query: 1367 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ-SNNVTHKI---- 1203 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ S+NVTHKI Sbjct: 60 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSSSNVTHKINTHA 119 Query: 1202 SAGTGERLPETNGTHMNKLSLGPQT--NKDLHISEALQMQIEVQRRLNEQLEVQRHLQVR 1029 ++ + ERL ETNGTHMNKL+LGPQT NKDLHISEALQMQIEVQRRLNEQLEVQRHLQ+R Sbjct: 120 TSVSDERLSETNGTHMNKLTLGPQTNNNKDLHISEALQMQIEVQRRLNEQLEVQRHLQLR 179 Query: 1028 IEAQGKYLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQG 849 IEAQGKYLQSVLEKAQETLGRQNLG+VGLEAAKVQLSELVSKVSSQCLNS FSE+KELQG Sbjct: 180 IEAQGKYLQSVLEKAQETLGRQNLGIVGLEAAKVQLSELVSKVSSQCLNSTFSEMKELQG 239 Query: 848 FCPQQTQTNQPNDCSMDSSCLTSCDRSQKEQE-IQNGGIALRHF----NSHMFMERXXXX 684 FCP QPND SMDSSCLTS DRSQKEQE IQNGG LRHF N+H+FMER Sbjct: 240 FCP------QPNDGSMDSSCLTSSDRSQKEQEIIQNGGFGLRHFNNNNNNHVFMER---- 289 Query: 683 XXXXXXXXQATD-APIHNNLRNPE-LMWC-DEVKKNNTFLTPLGKSKNEERSSYAVESSP 513 QAT+ A NLRN E L WC +EVKKN+ FLTPLG + ER+ Sbjct: 290 -----KEQQATELAGSVQNLRNNEVLKWCVEEVKKNSNFLTPLGNNNELERNH------- 337 Query: 512 SNLSMTIGLERERENGISMYPERVITENHHSDGEFQHRNTARTEAMKPVEDRFPSYFAAP 333 NLSM IG+ ENH GEFQ RNTA Sbjct: 338 GNLSMNIGV-----------------ENHLDIGEFQQRNTA------------------- 361 Query: 332 RLDLNTRGDNEAAAASSCKQLDLNRFSWN 246 RLDLN+RGDN A++CKQLDLNRFSWN Sbjct: 362 RLDLNSRGDNN-EGATTCKQLDLNRFSWN 389 >ref|XP_003539830.1| PREDICTED: uncharacterized protein LOC100805237 isoformX1 [Glycine max] gi|571492729|ref|XP_006592327.1| PREDICTED: uncharacterized protein LOC100805237 isoform X2 [Glycine max] gi|947076393|gb|KRH25233.1| hypothetical protein GLYMA_12G089100 [Glycine max] gi|947076394|gb|KRH25234.1| hypothetical protein GLYMA_12G089100 [Glycine max] Length = 405 Score = 547 bits (1410), Expect = e-153 Identities = 312/450 (69%), Positives = 341/450 (75%), Gaps = 16/450 (3%) Frame = -1 Query: 1547 MYHHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFT 1368 MY H QHQ KNIHSSSRMPIPSER MFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARF Sbjct: 1 MYTHQQHQGKNIHSSSRMPIPSERQMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFI 60 Query: 1367 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKIS---- 1200 EAV QLGGADKATPKTVMKL+GIPGLTLYHLKSHLQKYRLSK+LHGQSNN+THKI+ Sbjct: 61 EAVQQLGGADKATPKTVMKLIGIPGLTLYHLKSHLQKYRLSKSLHGQSNNMTHKITINSG 120 Query: 1199 AGTGERLPETNGTHMNKLSLGPQT-NKDLHISEALQMQIEVQRRLNEQLEVQRHLQVRIE 1023 A T ERL E NGTHMN L+L PQ+ NKDL+ISEAL MQIE QRRLNEQLEVQR LQ+RIE Sbjct: 121 AATDERLRENNGTHMNSLNLAPQSNNKDLYISEALHMQIEEQRRLNEQLEVQRLLQLRIE 180 Query: 1022 AQGKYLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSE-LKELQGF 846 AQGKYLQ+VLEKAQETLGRQNLG VGLEA K+QLSELVSKVSSQCLNSAFS+ LKE+QGF Sbjct: 181 AQGKYLQAVLEKAQETLGRQNLGAVGLEATKLQLSELVSKVSSQCLNSAFSDRLKEIQGF 240 Query: 845 CP-QQTQTNQP--NDCSMDSSCLTSCDRSQKEQEIQNGGIALRHFNSHMFMERXXXXXXX 675 P QQTQTNQP NDCSMD SCLTSC+ SQKEQEIQNGG++LR FN H FMER Sbjct: 241 SPHQQTQTNQPNTNDCSMD-SCLTSCEGSQKEQEIQNGGMSLRPFNVHTFMER------- 292 Query: 674 XXXXXQATDAPIHNNLRNPELMWCDEVKKNNTFLTPLGKSKNEERSSYAVESSPSNLSMT 495 + + P NNL N +L WCD VKK NTFLTPL S +A + SPSNLSM+ Sbjct: 293 ----KEVIEGPNLNNLPNTDLNWCDPVKK-NTFLTPL--------SMHADKRSPSNLSMS 339 Query: 494 IGLERERENGISMYPERVITENHHSDGEFQHRNTARTEAMKPVEDR------FPS-YFAA 336 IGLE E ENG +T RTE++KPV D+ PS YFAA Sbjct: 340 IGLEGETENG----------------------STIRTESVKPVADKVSQDYGLPSNYFAA 377 Query: 335 PRLDLNTRGDNEAAAASSCKQLDLNRFSWN 246 +LDL T + + +SCKQLDLN FSWN Sbjct: 378 SKLDLTTEDNKD--TKTSCKQLDLNGFSWN 405 >ref|XP_004487498.1| PREDICTED: uncharacterized protein LOC101506129 isoform X3 [Cicer arietinum] Length = 409 Score = 529 bits (1362), Expect = e-147 Identities = 315/458 (68%), Positives = 335/458 (73%), Gaps = 26/458 (5%) Frame = -1 Query: 1541 HHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFTEA 1362 HHHQHQ KNIHSSSRM IPSERHMFLQTGNGS DSGLVLSTDAKPRLKWTPDLHARF EA Sbjct: 5 HHHQHQGKNIHSSSRMSIPSERHMFLQTGNGSSDSGLVLSTDAKPRLKWTPDLHARFIEA 64 Query: 1361 VNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQS--NNVTHKI----S 1200 VNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQS NNVTHKI + Sbjct: 65 VNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNNNVTHKINTHAT 124 Query: 1199 AGTGER-LPETNGTHMNKL-SLGPQT-NKDLHISEALQMQIEVQRRLNEQLEVQRHLQVR 1029 +GT ER L ETNGTHMNKL +LGPQT NKD+HISEALQMQIEVQRRLNEQLEVQRHLQ+R Sbjct: 125 SGTDERSLSETNGTHMNKLITLGPQTNNKDIHISEALQMQIEVQRRLNEQLEVQRHLQLR 184 Query: 1028 IEAQGKYLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQG 849 IEAQGKYLQSVLEKAQETLGRQNLG+VGLEAAKVQLSELVSKVSSQ LNSAFSE+KELQG Sbjct: 185 IEAQGKYLQSVLEKAQETLGRQNLGIVGLEAAKVQLSELVSKVSSQSLNSAFSEMKELQG 244 Query: 848 F--CPQQTQ--TNQPNDCSMDSSCLTSCDRSQKEQE--IQN-GGIALRHF---NSHMFME 699 F QQTQ TNQPNDCSMDSSCLTS DRSQKEQ+ IQN GG LRHF N+H+FME Sbjct: 245 FNNPQQQTQITTNQPNDCSMDSSCLTSSDRSQKEQQEIIQNGGGFGLRHFNNNNNHLFME 304 Query: 698 RXXXXXXXXXXXXQATDAPIHNNLRNPE-LMWC-DEVKKNNTFLTPLGKSKNEERSSYAV 525 R + N+RN E L WC DEVKKN + Sbjct: 305 R------------KELQQVTETNIRNSEVLKWCSDEVKKNTS------------------ 334 Query: 524 ESSPSNLSMTIGLERERENGISMYPERVITENHHSDGEFQHRNTARTEAMKPVEDRFP-- 351 + NLSM+IG+E HSDGEFQ R + + MK D Sbjct: 335 TNFHGNLSMSIGVE------------------SHSDGEFQ-RTSRTDQTMKLAMDDHKMQ 375 Query: 350 ---SYFAAPRLDLNTRGDNEAAAASSCKQLDLNRFSWN 246 SYF+ RLDLN+RGDN CKQLDLNRFSWN Sbjct: 376 PSNSYFSTSRLDLNSRGDNN----EGCKQLDLNRFSWN 409 >ref|XP_004487497.1| PREDICTED: uncharacterized protein LOC101506129 isoform X1 [Cicer arietinum] gi|828290007|ref|XP_012572462.1| PREDICTED: uncharacterized protein LOC101506129 isoform X2 [Cicer arietinum] Length = 410 Score = 528 bits (1361), Expect = e-147 Identities = 315/459 (68%), Positives = 335/459 (72%), Gaps = 27/459 (5%) Frame = -1 Query: 1541 HHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFTEA 1362 HHHQHQ KNIHSSSRM IPSERHMFLQTGNGS DSGLVLSTDAKPRLKWTPDLHARF EA Sbjct: 5 HHHQHQGKNIHSSSRMSIPSERHMFLQTGNGSSDSGLVLSTDAKPRLKWTPDLHARFIEA 64 Query: 1361 VNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQS--NNVTHKI----- 1203 VNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQS NNVTHKI Sbjct: 65 VNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNNNVTHKITDTHA 124 Query: 1202 SAGTGER-LPETNGTHMNKL-SLGPQT-NKDLHISEALQMQIEVQRRLNEQLEVQRHLQV 1032 ++GT ER L ETNGTHMNKL +LGPQT NKD+HISEALQMQIEVQRRLNEQLEVQRHLQ+ Sbjct: 125 TSGTDERSLSETNGTHMNKLITLGPQTNNKDIHISEALQMQIEVQRRLNEQLEVQRHLQL 184 Query: 1031 RIEAQGKYLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQ 852 RIEAQGKYLQSVLEKAQETLGRQNLG+VGLEAAKVQLSELVSKVSSQ LNSAFSE+KELQ Sbjct: 185 RIEAQGKYLQSVLEKAQETLGRQNLGIVGLEAAKVQLSELVSKVSSQSLNSAFSEMKELQ 244 Query: 851 GF--CPQQTQ--TNQPNDCSMDSSCLTSCDRSQKEQE--IQN-GGIALRHF---NSHMFM 702 GF QQTQ TNQPNDCSMDSSCLTS DRSQKEQ+ IQN GG LRHF N+H+FM Sbjct: 245 GFNNPQQQTQITTNQPNDCSMDSSCLTSSDRSQKEQQEIIQNGGGFGLRHFNNNNNHLFM 304 Query: 701 ERXXXXXXXXXXXXQATDAPIHNNLRNPE-LMWC-DEVKKNNTFLTPLGKSKNEERSSYA 528 ER + N+RN E L WC DEVKKN + Sbjct: 305 ER------------KELQQVTETNIRNSEVLKWCSDEVKKNTS----------------- 335 Query: 527 VESSPSNLSMTIGLERERENGISMYPERVITENHHSDGEFQHRNTARTEAMKPVEDRFP- 351 + NLSM+IG+E HSDGEFQ R + + MK D Sbjct: 336 -TNFHGNLSMSIGVE------------------SHSDGEFQ-RTSRTDQTMKLAMDDHKM 375 Query: 350 ----SYFAAPRLDLNTRGDNEAAAASSCKQLDLNRFSWN 246 SYF+ RLDLN+RGDN CKQLDLNRFSWN Sbjct: 376 QPSNSYFSTSRLDLNSRGDNN----EGCKQLDLNRFSWN 410 >gb|KOM50550.1| hypothetical protein LR48_Vigan08g137700 [Vigna angularis] Length = 376 Score = 527 bits (1357), Expect = e-146 Identities = 295/428 (68%), Positives = 319/428 (74%), Gaps = 11/428 (2%) Frame = -1 Query: 1496 MPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFTEAVNQLGGADKATPKTV 1317 MPIPSERHMFL TGNGSGDS LVLSTDAKPRLKWTPDLHARF EAV QLGGADKATPKTV Sbjct: 1 MPIPSERHMFLHTGNGSGDSSLVLSTDAKPRLKWTPDLHARFIEAVQQLGGADKATPKTV 60 Query: 1316 MKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKIS----AGTGERLPETNGTHMNK 1149 MKLMGI GLTLYHLKSHLQKYRLSK+LHGQSNNVTHKIS A T ERL E NGTH+N Sbjct: 61 MKLMGISGLTLYHLKSHLQKYRLSKSLHGQSNNVTHKISINPGAATDERLRENNGTHVNN 120 Query: 1148 LSLGPQTN-KDLHISEALQMQIEVQRRLNEQLEVQRHLQVRIEAQGKYLQSVLEKAQETL 972 L+L PQ+N KDLHISEALQMQIEVQRRLNEQLEVQR LQ+RIEAQGKYLQ+VLEKAQETL Sbjct: 121 LNLAPQSNNKDLHISEALQMQIEVQRRLNEQLEVQRLLQLRIEAQGKYLQAVLEKAQETL 180 Query: 971 GRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQGFCPQQTQTNQPNDCSMDSS 792 GRQNLG+VGLEAAK+QLS+LVSKVSSQCLNSAF ELKELQGF P QTQTNQPNDCSMD S Sbjct: 181 GRQNLGVVGLEAAKLQLSDLVSKVSSQCLNSAFLELKELQGFSPHQTQTNQPNDCSMD-S 239 Query: 791 CLTSCDRSQKEQEIQNGGIALRHFNSHMFMERXXXXXXXXXXXXQATDAPIHNNLRNPEL 612 CLTSC+ SQKEQEIQNGG++LR FN H FMER + + P NNL + +L Sbjct: 240 CLTSCEVSQKEQEIQNGGMSLRPFNVHTFMER-----------KELIEGPNLNNLPSTDL 288 Query: 611 MWCDEVKKNNTFLTPLGKSKNEERSSYAVESSPSNLSMTIGLERERENGISMYPERVITE 432 WCD VK NTFLTPL + SPSNLSM+IGLE E ENG Sbjct: 289 KWCDPVK--NTFLTPLSR------------RSPSNLSMSIGLEGETENG----------- 323 Query: 431 NHHSDGEFQHRNTARTEAMKPVEDR------FPSYFAAPRLDLNTRGDNEAAAASSCKQL 270 +T R E++K V ++ PSYFAAP LDL T N SCK+L Sbjct: 324 -----------STIRKESVKSVAEKVSQDYGLPSYFAAPNLDLTTEDKNN----RSCKEL 368 Query: 269 DLNRFSWN 246 DLN FSWN Sbjct: 369 DLNGFSWN 376 >ref|XP_012092686.1| PREDICTED: uncharacterized protein LOC105650401 [Jatropha curcas] gi|802796130|ref|XP_012092687.1| PREDICTED: uncharacterized protein LOC105650401 [Jatropha curcas] gi|643699885|gb|KDP20269.1| hypothetical protein JCGZ_06855 [Jatropha curcas] Length = 417 Score = 524 bits (1349), Expect = e-145 Identities = 286/440 (65%), Positives = 328/440 (74%), Gaps = 6/440 (1%) Frame = -1 Query: 1547 MYHHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFT 1368 MYHHHQHQ K++HSSSRMPIP ERH+FLQ GNG GDSGLVLSTDAKPRLKWTPDLH RF Sbjct: 1 MYHHHQHQGKSVHSSSRMPIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 60 Query: 1367 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKISAG-- 1194 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ+N+ + KI A Sbjct: 61 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANSGSSKIGAAAV 120 Query: 1193 TGERLPETNGTHMNKLSLGPQTNKDLHISEALQMQIEVQRRLNEQLEVQRHLQVRIEAQG 1014 T +R+ E N THMN LS+GPQTNK LHISEALQMQIEVQRRL+EQLEVQRHLQ+RIEAQG Sbjct: 121 TSDRMSEANVTHMNNLSIGPQTNKSLHISEALQMQIEVQRRLHEQLEVQRHLQLRIEAQG 180 Query: 1013 KYLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQGFCPQQ 834 KYLQ+VLEKAQETLGRQNLG +GLEAAKVQLSELVSKVS+QCLNSAFSELKELQG CPQQ Sbjct: 181 KYLQAVLEKAQETLGRQNLGTMGLEAAKVQLSELVSKVSTQCLNSAFSELKELQGLCPQQ 240 Query: 833 TQTNQPNDCSMDSSCLTSCDRSQKEQEIQNGGIALRHFNSHMFMERXXXXXXXXXXXXQA 654 TQT P DCS+D SCLTSC+ SQKEQEI N G+ LR +N +E + Sbjct: 241 TQTTPPTDCSVD-SCLTSCEGSQKEQEIHNTGMGLRPYNGSALLE--------------S 285 Query: 653 TDAPIHNNLRNPELMWCDEVKKNNTFLTPLGKSKNEERSSYAVESSPSNLSMTIGLERER 474 D + L EL W ++ N FL+P+G N ER ++ E S S+LSM +GL+ E Sbjct: 286 KDMAEEHMLHQTELKWGED---NKMFLSPMG--NNAERRIFSSERSSSDLSMRVGLQGES 340 Query: 473 ENGISMYPERVITENHHSDGEFQHRNTARTEAMK-PVEDRFPSY---FAAPRLDLNTRGD 306 N S + E E + D +F + +++K E+ P Y + A +LDLNT Sbjct: 341 RNPCSGFSEGRYKE-RNDDDKFPDQTKKTADSVKLQNENISPGYRLPYFATKLDLNTH-- 397 Query: 305 NEAAAASSCKQLDLNRFSWN 246 +E AAS+CKQLDLN FSWN Sbjct: 398 DEIDAASTCKQLDLNGFSWN 417 >ref|XP_002525443.1| transcription factor, putative [Ricinus communis] gi|223535256|gb|EEF36933.1| transcription factor, putative [Ricinus communis] Length = 419 Score = 520 bits (1339), Expect = e-144 Identities = 283/439 (64%), Positives = 321/439 (73%), Gaps = 5/439 (1%) Frame = -1 Query: 1547 MYHHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFT 1368 MYHHHQHQ K++HSSSRM IP ERH+FLQ GNG GDSGLVLSTDAKPRLKWT DLH F Sbjct: 1 MYHHHQHQGKSVHSSSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTSDLHEHFI 60 Query: 1367 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKISAGT- 1191 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ+N+ ++KI G Sbjct: 61 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANSGSNKIGTGAV 120 Query: 1190 -GERLPETNGTHMNKLSLGPQTNKDLHISEALQMQIEVQRRLNEQLEVQRHLQVRIEAQG 1014 G+R+ ETN TH+N LS+G QTNK LHI EALQMQIEVQRRL+EQLEVQRHLQ+RIEAQG Sbjct: 121 VGDRISETNVTHINNLSMGTQTNKGLHIGEALQMQIEVQRRLHEQLEVQRHLQLRIEAQG 180 Query: 1013 KYLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQGFCPQQ 834 KYLQSVLEKAQETLGRQNLG +GLEAAKVQLSELVSKVS+QCLNSAFSELKELQG C QQ Sbjct: 181 KYLQSVLEKAQETLGRQNLGSIGLEAAKVQLSELVSKVSTQCLNSAFSELKELQGLCHQQ 240 Query: 833 TQTNQPNDCSMDSSCLTSCDRSQKEQEIQNGGIALRHFNSHMFMERXXXXXXXXXXXXQA 654 TQT P DCSMD SCLTSC+ SQKEQEI N G+ LR +N + +E + Sbjct: 241 TQTAPPTDCSMD-SCLTSCEGSQKEQEIHNTGMGLRPYNGNALLE--------------S 285 Query: 653 TDAPIHNNLRNPELMWCDEVKKNNTFLTPLGKSKNEERSSYAVESSPSNLSMTIGLERER 474 D + L EL W +++K N FL+PLG N R ++A E S S+LSMT+GL+ E Sbjct: 286 KDITEGHVLHQTELKWSEDLKDNKMFLSPLG--NNAARRNFAAERSTSDLSMTVGLQGEN 343 Query: 473 ENGISMYPERVITENHHSDGEFQHRNTARTEAMKPVEDRFPSY---FAAPRLDLNTRGDN 303 N S + E + + D N + P D Y + A +LDLN+ Sbjct: 344 GNA-SSFSEGRYKDRNDGDSFPDQTNKSLDSVKLPKGDVSQGYRLPYFATKLDLNSH--E 400 Query: 302 EAAAASSCKQLDLNRFSWN 246 E AASSCKQLDLN FSWN Sbjct: 401 EIDAASSCKQLDLNGFSWN 419 >ref|XP_014491719.1| PREDICTED: uncharacterized protein LOC106754229 [Vigna radiata var. radiata] Length = 376 Score = 515 bits (1327), Expect = e-143 Identities = 292/423 (69%), Positives = 318/423 (75%), Gaps = 6/423 (1%) Frame = -1 Query: 1496 MPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFTEAVNQLGGADKATPKTV 1317 MPIPSERHMFL TGNGSGDS LVLSTDAKPRLKWTPDLHARF EAV QLGGADKATPKTV Sbjct: 1 MPIPSERHMFLHTGNGSGDSSLVLSTDAKPRLKWTPDLHARFIEAVQQLGGADKATPKTV 60 Query: 1316 MKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKIS----AGTGERLPETNGTHMNK 1149 MKLMGI GLTLYHLKSHLQKYRLSK+LHGQSN VTHKIS A T ERL E NGTH+N Sbjct: 61 MKLMGISGLTLYHLKSHLQKYRLSKSLHGQSN-VTHKISINPGAATDERLRENNGTHVNN 119 Query: 1148 LSLGPQTN-KDLHISEALQMQIEVQRRLNEQLEVQRHLQVRIEAQGKYLQSVLEKAQETL 972 L+L PQ+N KDLHI EALQMQIEVQRRLNEQLEVQR LQ+RIEAQGKYLQ+VLEKAQETL Sbjct: 120 LNLAPQSNNKDLHIGEALQMQIEVQRRLNEQLEVQRLLQLRIEAQGKYLQAVLEKAQETL 179 Query: 971 GRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQGFCPQQTQTNQPNDCSMDSS 792 GRQNLG+VGLEAAK+QLS+LVSKVSSQCLNSAF ELKELQGF QTQTNQPNDCSMD S Sbjct: 180 GRQNLGVVGLEAAKLQLSDLVSKVSSQCLNSAFLELKELQGFSTHQTQTNQPNDCSMD-S 238 Query: 791 CLTSCDRSQKEQEIQNGGIALRHFNSHMFMERXXXXXXXXXXXXQATDAPIHNNLRNPEL 612 CLTSC+ SQKEQEIQNGG++LR FN H FMER + + P NNL + +L Sbjct: 239 CLTSCEVSQKEQEIQNGGMSLRPFNVHTFMER-----------KELIEGPNLNNLPSTDL 287 Query: 611 MWCDEVKKNNTFLTPLGKSKNEERSSYAVESSPSNLSMTIGLERERENGISMYPERVITE 432 WCD VK NTFLTPL + SPSNLSM+IGLE E ENG ++ E V Sbjct: 288 KWCDPVK--NTFLTPLSR------------RSPSNLSMSIGLEGETENGSTIRKESV--- 330 Query: 431 NHHSDGEFQHRNTARTEAMKPVED-RFPSYFAAPRLDLNTRGDNEAAAASSCKQLDLNRF 255 ++ A K +D PSYFAAP+LDL T N +SCK+LDLN F Sbjct: 331 --------------KSMAEKVSQDYGLPSYFAAPKLDLTTEVKNN---RTSCKELDLNGF 373 Query: 254 SWN 246 SWN Sbjct: 374 SWN 376 >ref|XP_007030697.1| Homeodomain-like superfamily protein isoform 2 [Theobroma cacao] gi|590643063|ref|XP_007030698.1| Homeodomain-like superfamily protein isoform 2 [Theobroma cacao] gi|508719302|gb|EOY11199.1| Homeodomain-like superfamily protein isoform 2 [Theobroma cacao] gi|508719303|gb|EOY11200.1| Homeodomain-like superfamily protein isoform 2 [Theobroma cacao] Length = 414 Score = 508 bits (1307), Expect = e-141 Identities = 281/438 (64%), Positives = 326/438 (74%), Gaps = 5/438 (1%) Frame = -1 Query: 1544 YHHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFTE 1365 +HHHQHQ KNIH SSRMPIP ERH+FLQ GNG GDSGLVLSTDAKPRLKWTPDLH RF E Sbjct: 3 HHHHQHQGKNIHPSSRMPIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIE 62 Query: 1364 AVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKISAGT-- 1191 AVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ+NN ++KI A Sbjct: 63 AVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANNGSNKIGAVAMA 122 Query: 1190 GERLPETNGTHMNKLSLGPQTNKDLHISEALQMQIEVQRRLNEQLEVQRHLQVRIEAQGK 1011 G+R+ E NGTH+N LS+GPQ N L I EALQMQIEVQRRL+EQLEVQRHLQ+RIEAQGK Sbjct: 123 GDRMSEANGTHVNNLSIGPQANNGLQIGEALQMQIEVQRRLHEQLEVQRHLQLRIEAQGK 182 Query: 1010 YLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQGFCPQQT 831 YLQ+VLEKAQETLGRQNLG VGLEAAKVQLSELVSKVS+QCLNSAFS+LK+LQG CPQQT Sbjct: 183 YLQAVLEKAQETLGRQNLGSVGLEAAKVQLSELVSKVSNQCLNSAFSDLKDLQGLCPQQT 242 Query: 830 QTNQPNDCSMDSSCLTSCDRSQKEQEIQNGGIALRHFN-SHMFMERXXXXXXXXXXXXQA 654 Q P DCSMD SCLTSC+ SQKEQEI N G+ LR +N S +E+ + Sbjct: 243 QATPPTDCSMD-SCLTSCEGSQKEQEIHNNGMCLRPYNTSGALLEQ-----------REI 290 Query: 653 TDAPIHNNLRNPELMWCDEVKKNNTFLTPLGKSKNEERSSYAVESSPSNLSMTIGLERER 474 + P+ L EL +++K+N FL+ LG K+ ER + + S S+LSM++GL+ E+ Sbjct: 291 AEDPL---LPQTELKSFEDIKENKMFLSSLG--KDAERRMFFADRSSSDLSMSVGLQGEK 345 Query: 473 ENG--ISMYPERVITENHHSDGEFQHRNTARTEAMKPVEDRFPSYFAAPRLDLNTRGDNE 300 NG S + E + + D F R R + + +R P YFA +LDLN +N+ Sbjct: 346 GNGGNSSSFSEAKF-KGRNEDDSFLDRGNKRADEV----NRLP-YFAT-KLDLNVHEEND 398 Query: 299 AAAASSCKQLDLNRFSWN 246 AASSCKQ DLN SWN Sbjct: 399 --AASSCKQFDLNGLSWN 414 >ref|XP_007030696.1| Homeodomain-like superfamily protein isoform 1 [Theobroma cacao] gi|508719301|gb|EOY11198.1| Homeodomain-like superfamily protein isoform 1 [Theobroma cacao] Length = 478 Score = 508 bits (1307), Expect = e-141 Identities = 281/438 (64%), Positives = 326/438 (74%), Gaps = 5/438 (1%) Frame = -1 Query: 1544 YHHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFTE 1365 +HHHQHQ KNIH SSRMPIP ERH+FLQ GNG GDSGLVLSTDAKPRLKWTPDLH RF E Sbjct: 67 HHHHQHQGKNIHPSSRMPIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIE 126 Query: 1364 AVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKISAGT-- 1191 AVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ+NN ++KI A Sbjct: 127 AVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANNGSNKIGAVAMA 186 Query: 1190 GERLPETNGTHMNKLSLGPQTNKDLHISEALQMQIEVQRRLNEQLEVQRHLQVRIEAQGK 1011 G+R+ E NGTH+N LS+GPQ N L I EALQMQIEVQRRL+EQLEVQRHLQ+RIEAQGK Sbjct: 187 GDRMSEANGTHVNNLSIGPQANNGLQIGEALQMQIEVQRRLHEQLEVQRHLQLRIEAQGK 246 Query: 1010 YLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQGFCPQQT 831 YLQ+VLEKAQETLGRQNLG VGLEAAKVQLSELVSKVS+QCLNSAFS+LK+LQG CPQQT Sbjct: 247 YLQAVLEKAQETLGRQNLGSVGLEAAKVQLSELVSKVSNQCLNSAFSDLKDLQGLCPQQT 306 Query: 830 QTNQPNDCSMDSSCLTSCDRSQKEQEIQNGGIALRHFN-SHMFMERXXXXXXXXXXXXQA 654 Q P DCSMD SCLTSC+ SQKEQEI N G+ LR +N S +E+ + Sbjct: 307 QATPPTDCSMD-SCLTSCEGSQKEQEIHNNGMCLRPYNTSGALLEQ-----------REI 354 Query: 653 TDAPIHNNLRNPELMWCDEVKKNNTFLTPLGKSKNEERSSYAVESSPSNLSMTIGLERER 474 + P+ L EL +++K+N FL+ LG K+ ER + + S S+LSM++GL+ E+ Sbjct: 355 AEDPL---LPQTELKSFEDIKENKMFLSSLG--KDAERRMFFADRSSSDLSMSVGLQGEK 409 Query: 473 ENG--ISMYPERVITENHHSDGEFQHRNTARTEAMKPVEDRFPSYFAAPRLDLNTRGDNE 300 NG S + E + + D F R R + + +R P YFA +LDLN +N+ Sbjct: 410 GNGGNSSSFSEAKF-KGRNEDDSFLDRGNKRADEV----NRLP-YFAT-KLDLNVHEEND 462 Query: 299 AAAASSCKQLDLNRFSWN 246 AASSCKQ DLN SWN Sbjct: 463 --AASSCKQFDLNGLSWN 478 >ref|XP_002325408.2| myb family transcription factor family protein [Populus trichocarpa] gi|550316805|gb|EEE99789.2| myb family transcription factor family protein [Populus trichocarpa] Length = 420 Score = 507 bits (1306), Expect = e-140 Identities = 276/440 (62%), Positives = 321/440 (72%), Gaps = 6/440 (1%) Frame = -1 Query: 1547 MYHHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFT 1368 MY HHQHQ KNIHSSSR IP ERH+FLQ GNG GDSGLVLSTDAKPRLKWTPDLH RF Sbjct: 1 MYQHHQHQGKNIHSSSRNSIPPERHLFLQVGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 60 Query: 1367 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKIS--AG 1194 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ+N+ ++K A Sbjct: 61 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANSGSNKSGTVAV 120 Query: 1193 TGERLPETNGTHMNKLSLGPQTNKDLHISEALQMQIEVQRRLNEQLEVQRHLQVRIEAQG 1014 G+R+PE N TH+N LS+G QTNK LH SEALQ+QIEVQRRL+EQLEVQRHLQ+RIEAQG Sbjct: 121 VGDRMPEVNATHINNLSIGSQTNKSLHFSEALQVQIEVQRRLHEQLEVQRHLQLRIEAQG 180 Query: 1013 KYLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQGFCPQQ 834 KYLQSVLEKAQETLGRQNLG VGLEAAKVQLSELVSKVSS+CLNSAFSELK+LQG CP Sbjct: 181 KYLQSVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSSKCLNSAFSELKDLQGLCPPL 240 Query: 833 TQTNQPNDCSMDSSCLTSCDRSQKEQEIQNGGIALRHFNSHMFMERXXXXXXXXXXXXQA 654 TQ PNDCSMD SCLTS + SQKEQEI N G+ LR +N + +E Sbjct: 241 TQPTHPNDCSMD-SCLTSIEGSQKEQEIHNTGMGLRPYNGNALLEPKVIAG--------- 290 Query: 653 TDAPIHNNLRNPELMWCDEVKKNNTFLTPLGKSKNEERSSYAVESSPSNLSMTIGLERER 474 + L+ EL W ++ + N FL+ + + +R +++ E S SNLS+ +GL+ ER Sbjct: 291 -----EHALQQTELKWGEDQRDNKMFLSSM--RNDTDRRTFSAERSCSNLSIGVGLQGER 343 Query: 473 ENGISMYPERVITENHHSDGEFQHRNTARTEAMKPVEDRF-PSY---FAAPRLDLNTRGD 306 N S + E + D FQ + R +A+K ++ P Y + A +LDLN+ G Sbjct: 344 GNVSSSFAEARF-KGRSEDDSFQDKTNRRIDAIKLENEKLSPGYRLSYYATKLDLNSHG- 401 Query: 305 NEAAAASSCKQLDLNRFSWN 246 E AAS C+QLDLN FSWN Sbjct: 402 -EIDAASGCRQLDLNGFSWN 420 >ref|XP_003539165.1| PREDICTED: uncharacterized protein LOC100781878 [Glycine max] Length = 414 Score = 504 bits (1299), Expect = e-140 Identities = 300/458 (65%), Positives = 325/458 (70%), Gaps = 27/458 (5%) Frame = -1 Query: 1538 HHQHQQKNIHSSS---RMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFT 1368 H QHQ KNIHSSS RMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARF Sbjct: 5 HQQHQGKNIHSSSSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFI 64 Query: 1367 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKISAGTG 1188 EAV QLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSK+LHGQSNN THKI+ +G Sbjct: 65 EAVQQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKSLHGQSNNATHKITINSG 124 Query: 1187 ----ERLPETNGTH-MNKLSLGPQT-NKDLHISEALQMQIEVQRRLNEQLEVQRHLQVRI 1026 ERL E N TH MN L+L PQ+ NKDLHISEALQMQIEVQRRLNEQL+VQR LQ+RI Sbjct: 125 SATDERLRENNETHVMNNLNLAPQSINKDLHISEALQMQIEVQRRLNEQLQVQRLLQLRI 184 Query: 1025 EAQGKYLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQGF 846 EAQGKYLQ+VLEKAQETLGRQNLG+VGLEAAK+QLSELVSKVSSQCLNSAFSELKE+QGF Sbjct: 185 EAQGKYLQAVLEKAQETLGRQNLGVVGLEAAKLQLSELVSKVSSQCLNSAFSELKEIQGF 244 Query: 845 CPQ-----QTQTNQP---NDCSMDSSCLTSCDRSQK--EQEIQNGGIALRHFNSHMFMER 696 P QT NQP NDCSMD SCLTSC+ S + +QEIQN G+ L FN H FME Sbjct: 245 SPHHQKQTQTNNNQPINANDCSMD-SCLTSCEGSSQKDQQEIQNRGMNLIPFNVHTFME- 302 Query: 695 XXXXXXXXXXXXQATDAPIHNNLRNPELMWCDEVKKNNTFLTPLGKSKNEERSSYAVESS 516 P NNL N +L WCD VKKNNTFLT R S E S Sbjct: 303 ----------------GPNLNNLPNTDLKWCDPVKKNNTFLT---------RLSMHAERS 337 Query: 515 PSNLSMTIGLERERENGISMYPERVITENHHSDGEFQHRNTARTEAMKPV-------EDR 357 PSNLSM+IGL E TEN + RTE++KP + Sbjct: 338 PSNLSMSIGL-----------LEGETTENRST--------IVRTESIKPAVAEKVSQDYG 378 Query: 356 FPS-YFAAPRLDLNTRGDNEAAAASSCKQLDLNRFSWN 246 PS YFAA +LD T + + +SCKQLDLN FSWN Sbjct: 379 LPSNYFAASKLDQTTEDNKD--TKTSCKQLDLNGFSWN 414 >ref|XP_011034346.1| PREDICTED: uncharacterized protein LOC105132500 isoform X1 [Populus euphratica] Length = 422 Score = 503 bits (1296), Expect = e-139 Identities = 278/442 (62%), Positives = 322/442 (72%), Gaps = 8/442 (1%) Frame = -1 Query: 1547 MYHHHQHQQKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFT 1368 MYHHHQHQ K+IHSSSRM IP ERH+FLQ GNG GDSGLVLSTDAKPRLKWTPDLH R Sbjct: 1 MYHHHQHQGKSIHSSSRMAIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERVI 60 Query: 1367 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKIS--AG 1194 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ+ + KI A Sbjct: 61 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQATIGSSKIGTVAV 120 Query: 1193 TGERLPETNGTH--MNKLSLGPQTNKDLHISEALQMQIEVQRRLNEQLEVQRHLQVRIEA 1020 G+R+PE N TH +N LS+G Q NK LH SEALQMQIEVQRRL+EQLEVQRHLQ+RIEA Sbjct: 121 VGDRMPEANATHININNLSIGSQPNKSLHFSEALQMQIEVQRRLHEQLEVQRHLQLRIEA 180 Query: 1019 QGKYLQSVLEKAQETLGRQNLGMVGLEAAKVQLSELVSKVSSQCLNSAFSELKELQGFCP 840 QGKYLQ+VLEKAQETLGRQNLG VGLEAAKVQLSELVSKVS+QCLNS FSEL +LQG CP Sbjct: 181 QGKYLQAVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCLNSTFSELNDLQGLCP 240 Query: 839 QQTQTNQPNDCSMDSSCLTSCDRSQKEQEIQNGGIALRHFNSHMFMERXXXXXXXXXXXX 660 QQT QPNDCSMD SCLTSC+ SQKEQEI N G+ LR NS+ F+E Sbjct: 241 QQTPPTQPNDCSMD-SCLTSCEGSQKEQEIHNTGMGLRPCNSNAFVE------------- 286 Query: 659 QATDAPIHNNLRNPELMWCDEVKKNNTFLTPLGKSKNEERSSYAVESSPSNLSMTIGLER 480 + + L+ EL W + +++N FLT +G ER +++ E S S+LS+ +GL+ Sbjct: 287 -PKEITEEHALQQTELKWGEYLRENKMFLTSIG--HETERRTFSAERSCSDLSIGVGLQG 343 Query: 479 ERENGISMYPERVITENHHSDGEFQHRNTARTEAMKPVEDRF-PSY---FAAPRLDLNTR 312 E+ N S + E + D FQ + R +++K ++ P Y + +LDLN+ Sbjct: 344 EKGNISSSFAEGRF-KGMSEDDSFQDQTNKRDDSVKFENEKMSPGYRLSYFTTKLDLNSH 402 Query: 311 GDNEAAAASSCKQLDLNRFSWN 246 E AASSCKQLDLN FSWN Sbjct: 403 --EEIDAASSCKQLDLNGFSWN 422