BLASTX nr result
ID: Mentha27_contig00010617
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha27_contig00010617 (2599 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU21278.1| hypothetical protein MIMGU_mgv1a001736mg [Mimulus... 1125 0.0 ref|XP_004252398.1| PREDICTED: AT-rich interactive domain-contai... 1074 0.0 ref|XP_006362097.1| PREDICTED: AT-rich interactive domain-contai... 1071 0.0 emb|CBI35803.3| unnamed protein product [Vitis vinifera] 1043 0.0 gb|EXB64667.1| AT-rich interactive domain-containing protein 4 [... 1002 0.0 ref|XP_006452906.1| hypothetical protein CICLE_v10007563mg [Citr... 997 0.0 ref|XP_006474564.1| PREDICTED: AT-rich interactive domain-contai... 996 0.0 ref|XP_007217035.1| hypothetical protein PRUPE_ppa001668mg [Prun... 995 0.0 ref|XP_006381551.1| hypothetical protein POPTR_0006s13780g [Popu... 993 0.0 ref|XP_002324130.2| arid/bright DNA-binding domain-containing fa... 978 0.0 ref|XP_004303747.1| PREDICTED: AT-rich interactive domain-contai... 975 0.0 ref|XP_002277324.1| PREDICTED: AT-rich interactive domain-contai... 970 0.0 ref|XP_007012520.1| ARID/BRIGHT DNA-binding domain-containing pr... 968 0.0 ref|XP_003547888.1| PREDICTED: AT-rich interactive domain-contai... 948 0.0 ref|XP_002516200.1| DNA binding protein, putative [Ricinus commu... 947 0.0 ref|XP_007135372.1| hypothetical protein PHAVU_010G123900g [Phas... 942 0.0 ref|XP_007012522.1| ARID/BRIGHT DNA-binding domain-containing pr... 936 0.0 ref|XP_004510562.1| PREDICTED: AT-rich interactive domain-contai... 916 0.0 ref|XP_003533805.1| PREDICTED: AT-rich interactive domain-contai... 912 0.0 ref|XP_006587067.1| PREDICTED: AT-rich interactive domain-contai... 907 0.0 >gb|EYU21278.1| hypothetical protein MIMGU_mgv1a001736mg [Mimulus guttatus] Length = 767 Score = 1125 bits (2910), Expect = 0.0 Identities = 575/773 (74%), Positives = 623/773 (80%), Gaps = 9/773 (1%) Frame = -1 Query: 2293 MFHTQGALKHTCNLLAVVCSEPEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTPD 2114 MFHTQGALK+TCNLLAV+C+ E+K ++V +ER FPFPEIVSSGRLEVQTLKNPT D Sbjct: 1 MFHTQGALKNTCNLLAVLCNRAAENKHSQNVLDERPNFPFPEIVSSGRLEVQTLKNPTVD 60 Query: 2113 EFRKVLDSWQPNLVYLQGERLENEEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVPN 1934 EF KVLDS Q NLVYLQGE LEN+++GSI WGG LSSPEA++GLF+S +PTTVYLEVPN Sbjct: 61 EFSKVLDSSQANLVYLQGEHLENDKIGSIVWGGFELSSPEAITGLFNSKLPTTVYLEVPN 120 Query: 1933 GEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFRLH 1754 GE LAKSLHSKG+PYVIYW NS SCY ASHF ALFS IQSSSCHTWD+F+LADASFRLH Sbjct: 121 GERLAKSLHSKGIPYVIYWNNSFSCYEASHFRHALFSSIQSSSCHTWDSFKLADASFRLH 180 Query: 1753 CLRNSSDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXS------LPAIKIYD 1592 CLR ++ VNDE GP L LPAIKIYD Sbjct: 181 CLRGNN---LVNDEVGPTLIGEAPKITVDAPEMEEDRVNDEDEDEESLSSGPLPAIKIYD 237 Query: 1591 DDVNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSRG 1412 DDVN RFLVCG LEDGLNALL+IEMRGSKLHNRVSALPPPLQAG+FSRG Sbjct: 238 DDVNTRFLVCGRTTSLDASLLGSLEDGLNALLNIEMRGSKLHNRVSALPPPLQAGSFSRG 297 Query: 1411 VVTMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENKPP 1232 VVTMRCDLST+SSAHISLLVSGSAQTCFDDQLLENHIKSE+IDKS LI A+ NSDENKPP Sbjct: 298 VVTMRCDLSTTSSAHISLLVSGSAQTCFDDQLLENHIKSEIIDKSRLIQAMPNSDENKPP 357 Query: 1231 TSEPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASFE 1052 SEPRRSVS+ACGA VFEVCMKVPSWA+QVLRQLAPD+SYR+ VALGIA IQGLAVASFE Sbjct: 358 LSEPRRSVSIACGATVFEVCMKVPSWATQVLRQLAPDISYRSLVALGIAGIQGLAVASFE 417 Query: 1051 REDAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISASIVNGGIQTL 872 +ED+ERLLFFC++Q S KR SI EI +NG ++ Sbjct: 418 KEDSERLLFFCTKQENISRSNDFKLTTPPSWLRAPPPSRKRPSIYQEIVPVTLNGLSSSV 477 Query: 871 IKKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQKMLPFSKILDADLH---D 701 +N +KE + SNGV + +R+IKIAALRPIPHVRHQKMLPFS+I D DLH D Sbjct: 478 ---NENNNKEIKFSNGVNTSLSSAKRKIKIAALRPIPHVRHQKMLPFSRIADFDLHHHLD 534 Query: 700 GSQVKANLPLPPAKHSTGTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVCSE 521 GS VKA+LP PAKH TPVS RKS S +YQAKQV+SLNP+PLKKHGCGRSPLHVCSE Sbjct: 535 GSYVKASLPSAPAKH-VSVTPVS-RKSGSGSYQAKQVISLNPLPLKKHGCGRSPLHVCSE 592 Query: 520 EEFLKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGIN 341 EEFLKDVMQFLILRGHNRLIPQ G+ EFPDAILNAKRLDLFNLYREVVTRGGFHVGNGIN Sbjct: 593 EEFLKDVMQFLILRGHNRLIPQNGIDEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGIN 652 Query: 340 WKGQVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWV 161 WKGQVFSKMRNHT TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWV Sbjct: 653 WKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWV 712 Query: 160 NCGLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKINRSGNG 2 NCGLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKI +SGNG Sbjct: 713 NCGLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKIPKSGNG 765 >ref|XP_004252398.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Solanum lycopersicum] Length = 771 Score = 1074 bits (2778), Expect = 0.0 Identities = 533/774 (68%), Positives = 611/774 (78%), Gaps = 10/774 (1%) Frame = -1 Query: 2293 MFHTQGALKHTCNLLAVVCSEPEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTPD 2114 MFH QGA + +C+LLAV+C E +DV + + R+ FPEIVSSGRLEVQ LKNP+ D Sbjct: 1 MFHCQGASRQSCSLLAVLCGRTSEYDQKKDVHDGKPRYCFPEIVSSGRLEVQVLKNPSTD 60 Query: 2113 EFRKVLDSWQPNLVYLQGERLENEEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVPN 1934 EF KVLDSWQPN+VYLQGE L N+EVGS+ WGG+ LSS EA+SGLFSS +PT VYLE+PN Sbjct: 61 EFHKVLDSWQPNIVYLQGEHLSNDEVGSLVWGGLDLSSAEAISGLFSSVLPTAVYLELPN 120 Query: 1933 GEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFRLH 1754 GE+LA++LH+KG+PYV+YWK++ SCY ASHF A V QSS+CH WDAFQLA ASFRL+ Sbjct: 121 GEKLAEALHAKGIPYVMYWKSAFSCYAASHFRHAFLCVAQSSTCHVWDAFQLAHASFRLY 180 Query: 1753 CLRN----SSDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIYDDD 1586 C+RN S ++ +D GP+L LPAIKIYDDD Sbjct: 181 CVRNNFALSEMSQRDSDNVGPHLLGDPPNIDVPLPEAGPEDDEESNSDA-LPAIKIYDDD 239 Query: 1585 VNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSRGVV 1406 V MRFLVCG+ + DGLNALL+IEMRGSKLHNRVSALPPPLQAGTFSRGVV Sbjct: 240 VTMRFLVCGLPCSLDECLLGSIADGLNALLNIEMRGSKLHNRVSALPPPLQAGTFSRGVV 299 Query: 1405 TMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENKPPTS 1226 TMRCDLSTSSSAHISLLVSGSAQTCFDD LLENHIKSE+I+ S L+H L + +EN+PP S Sbjct: 300 TMRCDLSTSSSAHISLLVSGSAQTCFDDLLLENHIKSEIIENSTLVHVLPSDEENRPPIS 359 Query: 1225 EPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASFERE 1046 PRRS+SVACG+ VFEVCMKVP WASQVLRQLAPDVSYR+ VALGIASIQGLAVASFE++ Sbjct: 360 APRRSMSVACGSEVFEVCMKVPMWASQVLRQLAPDVSYRSLVALGIASIQGLAVASFEKD 419 Query: 1045 DAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISASIVNG---GIQT 875 DA+RLLFFC++QGK+GF S KRS S NG G Sbjct: 420 DAQRLLFFCTKQGKDGFFGNFKMGNPPAWLRPPAPSRKRSDFYQGASYICQNGLTPGNHV 479 Query: 874 LIKKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQKMLPFSKILDADLHDGS 695 +K E+KE RL NGV P +T R+++K+AA+RPIPHVRHQKMLPFS+I + D DG+ Sbjct: 480 AVK----EEKESRLGNGVATPLVTARQKLKVAAMRPIPHVRHQKMLPFSRISELDSLDGN 535 Query: 694 QVKANLPLPPAK---HSTGTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVCS 524 QVK NLP+ P+ + G TP +HRKS SS++QAKQ++SLNP+PLKKHGCGRSP+HVCS Sbjct: 536 QVKTNLPIIPSSTKGSNVGVTPATHRKSASSSHQAKQIISLNPLPLKKHGCGRSPIHVCS 595 Query: 523 EEEFLKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGI 344 EEEFLKDVMQFLILRGH RLIPQ G+AEFPDAILNAKRLDLFNLYREVV+RGGFHVGNGI Sbjct: 596 EEEFLKDVMQFLILRGHTRLIPQSGIAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGI 655 Query: 343 NWKGQVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDW 164 NWKGQVFSKMRNHT TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC+SSA GDW Sbjct: 656 NWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCNSSAAGDW 715 Query: 163 VNCGLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKINRSGNG 2 VNCG+CGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSV+N+KKK+ R+ NG Sbjct: 716 VNCGICGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVTNFKKKVLRTANG 769 >ref|XP_006362097.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Solanum tuberosum] Length = 770 Score = 1071 bits (2770), Expect = 0.0 Identities = 532/773 (68%), Positives = 611/773 (79%), Gaps = 9/773 (1%) Frame = -1 Query: 2293 MFHTQGALKHTCNLLAVVCSEPEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTPD 2114 MFH QG + +C+LLAV+C E +DV + + R+ FPEIVSSGRLEVQ LKNP+ D Sbjct: 1 MFHCQGTSRQSCSLLAVLCGSTSEYDQKKDVHDGKPRYCFPEIVSSGRLEVQVLKNPSTD 60 Query: 2113 EFRKVLDSWQPNLVYLQGERLENEEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVPN 1934 EF KVLDSWQPN+VYLQGE L N+EVGS+ WGG+ LSS EA+SGLFSS +PT VYLE+PN Sbjct: 61 EFHKVLDSWQPNIVYLQGEHLSNDEVGSLVWGGLDLSSAEAISGLFSSALPTAVYLELPN 120 Query: 1933 GEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFRLH 1754 GE+LA++LH+KG+PYV+YWK++ SCY ASHF A V QSS+CH WDAFQLA ASFRL+ Sbjct: 121 GEKLAEALHAKGIPYVMYWKSAFSCYAASHFRHAFLCVAQSSTCHVWDAFQLAQASFRLY 180 Query: 1753 CLRNS----SDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIYDDD 1586 C++N+ ++ +D GP+L LPAIKIYDDD Sbjct: 181 CVQNNFVLPEMSQRDSDNMGPHLLGDPPNIDVPPPEAGPDDDEESNSDA-LPAIKIYDDD 239 Query: 1585 VNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSRGVV 1406 V MRFLVCG+ + DGLNALL+IEMRGSKLHNRVSALPPPLQAGTFSRGVV Sbjct: 240 VTMRFLVCGLPCSLDECLLGSIADGLNALLNIEMRGSKLHNRVSALPPPLQAGTFSRGVV 299 Query: 1405 TMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENKPPTS 1226 TMRCDLSTSSSAHISLLVSGSAQTCFDD LLENHIKSE+I+ S L+H L + +EN+PP S Sbjct: 300 TMRCDLSTSSSAHISLLVSGSAQTCFDDLLLENHIKSEIIENSTLVHVLPSDEENRPPIS 359 Query: 1225 EPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASFERE 1046 PRRS+SVACG+ VFEVCMKVP WASQVLRQLAPDVSYR+ VALGIASIQGLAVASFE++ Sbjct: 360 APRRSMSVACGSEVFEVCMKVPMWASQVLRQLAPDVSYRSLVALGIASIQGLAVASFEKD 419 Query: 1045 DAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISASIVNG---GIQT 875 DA+RLLFF ++QGK+GF S KRS S NG G Sbjct: 420 DAQRLLFFYTKQGKDGFFGNFKIGDPPAWLRPPAPSRKRSDFYQGASYICQNGSTPGNHV 479 Query: 874 LIKKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQKMLPFSKILDADLHDGS 695 +K E+KE RL NGV P +T R+++K+AA+RPIPHVRHQKMLPFS+I + D DG+ Sbjct: 480 AVK----EEKESRLGNGVATPLVTARQKLKVAAMRPIPHVRHQKMLPFSRISELDSLDGN 535 Query: 694 QVKANLPLPPAKHST--GTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVCSE 521 QVK NLP+ P+ + G TPV+HRKS SS++QAKQ++SLNP+PLKKHGCGRSP+HVCSE Sbjct: 536 QVKTNLPIIPSTKGSNVGVTPVTHRKSASSSHQAKQIISLNPLPLKKHGCGRSPIHVCSE 595 Query: 520 EEFLKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGIN 341 EEFLKDVMQFLILRGH RLIPQGGLAEFPDAILNAKRLDLFNLYREVV+RGGFHVGNGIN Sbjct: 596 EEFLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGIN 655 Query: 340 WKGQVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWV 161 WKGQVFSKMRNHT TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC+SSA GDWV Sbjct: 656 WKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCNSSAAGDWV 715 Query: 160 NCGLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKINRSGNG 2 NCG+CGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSV+N+KKK+ R+ NG Sbjct: 716 NCGICGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVTNFKKKVLRTANG 768 >emb|CBI35803.3| unnamed protein product [Vitis vinifera] Length = 746 Score = 1043 bits (2697), Expect = 0.0 Identities = 523/770 (67%), Positives = 597/770 (77%), Gaps = 6/770 (0%) Frame = -1 Query: 2293 MFHTQGALKHTCNLLAVVCSEPEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTPD 2114 M HTQG HTC LLAV C + E K + S +R R+PFP+ VSSGRLEVQTL +P+PD Sbjct: 1 MLHTQGISNHTCGLLAVTCGKTSECKQEHETSNDRPRYPFPDFVSSGRLEVQTLTSPSPD 60 Query: 2113 EFRKVLDSWQPNLVYLQGERLENEEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVPN 1934 EFR+V +S QPN VY QGE+L+N+EVGS+ WGG+ LSS E + GLF S +PTTVYLE+PN Sbjct: 61 EFRRVFESVQPNFVYFQGEQLQNDEVGSLVWGGVELSSAEDICGLFGSKLPTTVYLEIPN 120 Query: 1933 GEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFRLH 1754 GE+LA++LHSKG+PYVIYWKN+ SCY A HF +ALFSV+QSSS HTWDAFQLA ASFRL+ Sbjct: 121 GEKLAEALHSKGIPYVIYWKNAFSCYAACHFRNALFSVVQSSSTHTWDAFQLAYASFRLY 180 Query: 1753 CLRNS----SDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIYDDD 1586 C+RN+ ++ KV+ + GP L LPAIKIYDDD Sbjct: 181 CVRNNHVLPANSHKVSGKLGPRLLGDPATIDVPPPEVDAGEDEEGSLGT-LPAIKIYDDD 239 Query: 1585 VNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSRGVV 1406 V +RFLVCG LEDGLNALLSIE+RGSKLHNRVSA PPPLQAGTFSRGVV Sbjct: 240 VGIRFLVCGEPCMLDSCLFESLEDGLNALLSIEIRGSKLHNRVSAPPPPLQAGTFSRGVV 299 Query: 1405 TMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENKPPTS 1226 TMRCDLST SSAHISLLVSGSAQTCFDDQLLEN+IK EV ++S+L+HAL S+ NKPP S Sbjct: 300 TMRCDLSTCSSAHISLLVSGSAQTCFDDQLLENNIKKEVTEQSQLVHALPYSEGNKPPLS 359 Query: 1225 EPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASFERE 1046 EPRRS S+ACGAAVFEVC KVP+WASQVLRQLAPDVSYR+ VALGIASIQGLAVASFE++ Sbjct: 360 EPRRSASIACGAAVFEVCAKVPAWASQVLRQLAPDVSYRSLVALGIASIQGLAVASFEKD 419 Query: 1045 DAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISASIVNGGIQTLIK 866 DA RLLFFC+RQGK + + S + ++ Sbjct: 420 DANRLLFFCTRQGKYIHP-------------------------NNFTPSRLPSWLKPPPP 454 Query: 865 KEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQKMLPFSKILDADLHDGSQVK 686 + + NGV +P + +R+K+AA+RPIPH+RH KMLPFS I +AD HDG QVK Sbjct: 455 SRKRVEPSQDTMNGVTMPLLPAGQRLKVAAMRPIPHIRHHKMLPFSGISEADGHDGGQVK 514 Query: 685 ANLPL-PPAKHS-TGTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVCSEEEF 512 ANL + PP KHS G+T HRKS SS+YQAKQ++SLNP+PLKKHGCGRSP+ VCSEEEF Sbjct: 515 ANLSVPPPTKHSIVGSTSAMHRKSFSSSYQAKQIISLNPLPLKKHGCGRSPIRVCSEEEF 574 Query: 511 LKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGINWKG 332 LKDVMQFL LRGH RLIPQGGLAEFPDAILNAKRLDL+NLYREVV+RGGFHVGNGINWKG Sbjct: 575 LKDVMQFLNLRGHTRLIPQGGLAEFPDAILNAKRLDLYNLYREVVSRGGFHVGNGINWKG 634 Query: 331 QVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCG 152 QVFSKMRNHT TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG Sbjct: 635 QVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCG 694 Query: 151 LCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKINRSGNG 2 +CGEWAHFGCDRR GLGAFKDYAKTDGLEYICPQCSV+N+KKK N++ NG Sbjct: 695 ICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVTNFKKKANKAPNG 744 >gb|EXB64667.1| AT-rich interactive domain-containing protein 4 [Morus notabilis] Length = 779 Score = 1002 bits (2590), Expect = 0.0 Identities = 511/773 (66%), Positives = 596/773 (77%), Gaps = 9/773 (1%) Frame = -1 Query: 2293 MFHTQGALKHTCNLLAVVCSEPEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTPD 2114 MFH+QG+ K TC+LLAV C ESK +DV E R +PFPE++SSGRLEVQTL +P+ + Sbjct: 1 MFHSQGSSKQTCSLLAVTCGNVSESKRKKDVPENRSLYPFPELISSGRLEVQTLTSPSKE 60 Query: 2113 EFRKVLDSWQPNLVYLQGERLENEEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVPN 1934 EF K+L+S++PNLVYLQGE+L N+EVG + WG + LS+PE+VS LF +T+PTTVYLE+P+ Sbjct: 61 EFSKLLESYKPNLVYLQGEQLANDEVGPLVWGDVDLSTPESVSELFGTTLPTTVYLEIPD 120 Query: 1933 GEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFRLH 1754 EELA+ LHSKGVPYVIYWK+ S + A HF +AL SV++SSS H WDAFQLA ASFRL+ Sbjct: 121 CEELAEELHSKGVPYVIYWKDRFSRHAACHFRNALLSVVKSSSTHAWDAFQLAYASFRLY 180 Query: 1753 CLRNS----SDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIYDDD 1586 C+RN+ S G +++DE GP L +LPAIKI+DDD Sbjct: 181 CVRNNHVLPSKGHEISDEQGPCLL-GDRLKINVDPPAADVEDDEDGSLDTLPAIKIHDDD 239 Query: 1585 VNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSRGVV 1406 +++RFLVCG+ LEDGLNALL+IE+RG +LH + SA PPPLQAGTFSRGVV Sbjct: 240 LSLRFLVCGVPSTLDESVLEPLEDGLNALLNIEIRGGRLHGKFSAPPPPLQAGTFSRGVV 299 Query: 1405 TMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDE-NKPPT 1229 TMRCDLST S AHIS+L+SGSAQTCFDDQLLENHIK+E+I+ S+L+ AL + E NK P Sbjct: 300 TMRCDLSTCSCAHISILLSGSAQTCFDDQLLENHIKNEIIENSQLVRALPTASEGNKLPL 359 Query: 1228 SEPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASFER 1049 SEPR+S S+ACGA VFEVCMKVP+WASQVLRQLAPDVSY + VALGIASIQG+ VASFE+ Sbjct: 360 SEPRKSASIACGATVFEVCMKVPAWASQVLRQLAPDVSYHSLVALGIASIQGIPVASFEK 419 Query: 1048 EDAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISASIVNGG--IQT 875 EDAERLLFFCS QGK S KRS E S +G Sbjct: 420 EDAERLLFFCSSQGKE-ISNDLVFSNPPPWLRPPAPSRKRS---QETSPGSHDGHRVPNQ 475 Query: 874 LIKKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQKMLPFSKILDADLHDGS 695 ++ K + EDKER SNGV LP + R+R+K+AA+RPIPHVR KM PFS I +AD HDG Sbjct: 476 VVSKSEEEDKERGPSNGVSLPLLPARQRLKVAAMRPIPHVRRPKMTPFSGISEADGHDGG 535 Query: 694 QVKANLPL-PPAKHS-TGTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVCSE 521 QVKA +P+ PP K S G TP + RKS SS+ QAKQ++SLNP+PLKKHGCGRS +H CSE Sbjct: 536 QVKAIVPVAPPTKLSIVGLTPSAQRKSFSSSSQAKQIISLNPLPLKKHGCGRSSIHTCSE 595 Query: 520 EEFLKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGIN 341 EEFLKDVMQFLILRGH RLIPQ GLAEFPDAILN KRLDL+NLY+EVVTRGGFHVGNGIN Sbjct: 596 EEFLKDVMQFLILRGHTRLIPQSGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGIN 655 Query: 340 WKGQVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWV 161 WKGQ+FSKMRN+T TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWV Sbjct: 656 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWV 715 Query: 160 NCGLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKINRSGNG 2 NCG+CGEWAHFGCDRR GLGAFKDYAKTDGLEYICP CSVSN+KKK + NG Sbjct: 716 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVSNFKKKSQKVSNG 768 >ref|XP_006452906.1| hypothetical protein CICLE_v10007563mg [Citrus clementina] gi|557556132|gb|ESR66146.1| hypothetical protein CICLE_v10007563mg [Citrus clementina] Length = 745 Score = 997 bits (2578), Expect = 0.0 Identities = 499/771 (64%), Positives = 591/771 (76%), Gaps = 7/771 (0%) Frame = -1 Query: 2293 MFHTQGALKHTCNLLAVVCSEPEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTPD 2114 MFH Q + ++ C+LLAV+ + + K + ++++ ++PFPEI SSGRLEV L +P+ D Sbjct: 2 MFHAQSSSRNHCSLLAVLSRKFVDDKQKQAATDDKPKYPFPEIASSGRLEVHLLSSPSTD 61 Query: 2113 EFRKVLDSWQPNLVYLQGERL-ENEEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVP 1937 EFR++L+S +PN+VYLQGE++ ++EE+GS+ WG + LS+PEA+ GLF ST+PTTVYLE+P Sbjct: 62 EFRRLLESSEPNIVYLQGEKINDSEEIGSLVWGDVDLSTPEALCGLFGSTLPTTVYLEIP 121 Query: 1936 NGEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFRL 1757 NGE A++LHS+GVPYVIYWK+S SCY A HF AL SV+QSS HTWDAFQLA ASFRL Sbjct: 122 NGENFAEALHSRGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRL 181 Query: 1756 HCLRNS----SDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIYDD 1589 +C+RN+ S+ +K + + GP+L LPAIKIYDD Sbjct: 182 YCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPEN--LPAIKIYDD 239 Query: 1588 DVNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSRGV 1409 DV MRFLVCG+ LEDGLNALL+IE+RGSKLHNR SA PPPLQAG FSRGV Sbjct: 240 DVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGV 299 Query: 1408 VTMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENKPPT 1229 VTMRCDLST SSAHISLLVSGSAQTCF+DQLLENHIK+E+I+ S+L+HAL NS +N+ P Sbjct: 300 VTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPP 359 Query: 1228 SEPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASFER 1049 SEPR+S S+ACGA+VFEV MKV +WASQVLRQLAPDVSYR+ V LGIASIQGL+VASFE+ Sbjct: 360 SEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEK 419 Query: 1048 EDAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISASIVNGGIQTLI 869 +DAERLLFFC+RQGK S KRS C E Sbjct: 420 DDAERLLFFCTRQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRE-------------- 465 Query: 868 KKEDNEDKERRLSNGVGLPSMTQ-RRRIKIAALRPIPHVRHQKMLPFSKILDADLHDGSQ 692 S GV ++ R ++ AA+RPIPH RH KMLPFS + + +DG Q Sbjct: 466 ------------SKGVESENVCNVRPKLNAAAMRPIPHTRHHKMLPFSGFSEIERYDGDQ 513 Query: 691 VKANLPLPPAKHST-GTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVCSEEE 515 VKANLP+ P KHS+ G TPV+HRKS SS+YQA+Q++SLNP+PLKKHGCGR+P+ VCSEEE Sbjct: 514 VKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEE 573 Query: 514 FLKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGINWK 335 FL+DVMQFLILRGH RL+PQGGLAEFPDAILNAKRLDLFNLYREVV+RGGFHVGNGINWK Sbjct: 574 FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWK 633 Query: 334 GQVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNC 155 GQVFSKMRNHT TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNC Sbjct: 634 GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC 693 Query: 154 GLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKINRSGNG 2 G+CGEWAHFGCDRR GLGAFKDYAKTDGLEY+CPQCSV+N+KKK ++ NG Sbjct: 694 GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNG 744 >ref|XP_006474564.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Citrus sinensis] Length = 745 Score = 996 bits (2576), Expect = 0.0 Identities = 499/771 (64%), Positives = 591/771 (76%), Gaps = 7/771 (0%) Frame = -1 Query: 2293 MFHTQGALKHTCNLLAVVCSEPEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTPD 2114 MFH Q + ++ C+LLAV+ + + K + ++++ ++PFPEI SSGRLEV L +P+ D Sbjct: 2 MFHAQSSSRNHCSLLAVLSRKFVDDKQKQAATDDKPKYPFPEIASSGRLEVHLLSSPSTD 61 Query: 2113 EFRKVLDSWQPNLVYLQGERL-ENEEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVP 1937 EFR++L+S +PN+VYLQGE++ ++EE+GS+ WG + LS+PEA+ GLF ST+PTTVYLE+P Sbjct: 62 EFRRLLESSEPNIVYLQGEKINDSEEIGSLVWGDVDLSTPEALCGLFGSTLPTTVYLEIP 121 Query: 1936 NGEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFRL 1757 NGE A++LHS+GVPYVIYWK+S SCY A HF AL SV+QSS HTWDAFQLA ASFRL Sbjct: 122 NGENFAEALHSRGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFRL 181 Query: 1756 HCLRNS----SDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIYDD 1589 +C+RN+ S+ +K + + GP+L LPAIKIYDD Sbjct: 182 YCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPEN--LPAIKIYDD 239 Query: 1588 DVNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSRGV 1409 DV MRFLVCG+ LEDGLNALL+IE+RGSKLHNR SA PPPLQAG FSRGV Sbjct: 240 DVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGV 299 Query: 1408 VTMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENKPPT 1229 VTMRCDLST SSAHISLLVSGSAQTCF+DQLLENHIK+E+I+ S+L+HAL NS +N+ P Sbjct: 300 VTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPP 359 Query: 1228 SEPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASFER 1049 SEPR+S S+ACGA+VFEV MKV +WASQVLRQLAPDVSYR+ V LGIASIQGL+VASFE+ Sbjct: 360 SEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEK 419 Query: 1048 EDAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISASIVNGGIQTLI 869 +DAERLLFFC+RQGK S KRS C E Sbjct: 420 DDAERLLFFCTRQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRE-------------- 465 Query: 868 KKEDNEDKERRLSNGVGLPSMTQ-RRRIKIAALRPIPHVRHQKMLPFSKILDADLHDGSQ 692 S GV ++ R ++ AA+RPIPH RH KMLPFS + + +DG Q Sbjct: 466 ------------SKGVESENVCNVRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQ 513 Query: 691 VKANLPLPPAKHST-GTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVCSEEE 515 VKANLP+ P KHS+ G TPV+HRKS SS+YQA+Q++SLNP+PLKKHGCGR+P+ VCSEEE Sbjct: 514 VKANLPVAPLKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEE 573 Query: 514 FLKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGINWK 335 FL+DVMQFLILRGH RL+PQGGLAEFPDAILNAKRLDLFNLYREVV+RGGFHVGNGINWK Sbjct: 574 FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWK 633 Query: 334 GQVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNC 155 GQVFSKMRNHT TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNC Sbjct: 634 GQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC 693 Query: 154 GLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKINRSGNG 2 G+CGEWAHFGCDRR GLGAFKDYAKTDGLEY+CPQCSV+N+KKK ++ NG Sbjct: 694 GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNG 744 >ref|XP_007217035.1| hypothetical protein PRUPE_ppa001668mg [Prunus persica] gi|462413185|gb|EMJ18234.1| hypothetical protein PRUPE_ppa001668mg [Prunus persica] Length = 783 Score = 995 bits (2573), Expect = 0.0 Identities = 508/773 (65%), Positives = 586/773 (75%), Gaps = 9/773 (1%) Frame = -1 Query: 2293 MFHTQGALKHTCNLLAVVCSEPEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTPD 2114 M H+QGA K TC+LL V C + E KP D +E+ ++PFPE+VS GRLEVQTL P+ + Sbjct: 1 MNHSQGASKQTCSLLVVTCGKISEEKPNEDTLDEKLKYPFPELVSLGRLEVQTLTKPSKE 60 Query: 2113 EFRKVLDSWQPNLVYLQGERLENEEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVPN 1934 EF K+L+S++PNLVYLQGE+LEN E+GS W + LS+ EA+S +FS+T+PTTVYLEVPN Sbjct: 61 EFCKMLESYKPNLVYLQGEQLENNEIGSPVWEDVDLSTAEAISEIFSATLPTTVYLEVPN 120 Query: 1933 GEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFRLH 1754 GE LA +LHSKG+PYVIYWK+ S Y A HF AL SV+QSSS HTWDAFQLA ASFRL+ Sbjct: 121 GENLAAALHSKGIPYVIYWKHEFSSYAACHFRHALLSVVQSSSTHTWDAFQLAYASFRLY 180 Query: 1753 CLRNS-----SDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIYDD 1589 C+ NS + + + E GP L LPAIKI+DD Sbjct: 181 CVENSHAIPANRHKSSSAELGPCLLGDRLKINVDPPEADVEEDEEGSLGT-LPAIKIHDD 239 Query: 1588 DVNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSRGV 1409 DV +RFLVCG LEDGLNALL+IEMRGSKLH + SA PPPLQAGTFSRGV Sbjct: 240 DVILRFLVCGEPSTLDASLLEPLEDGLNALLNIEMRGSKLHGKFSAPPPPLQAGTFSRGV 299 Query: 1408 VTMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENKPPT 1229 VTMRCD+ST SSAHISLLVSGSAQTCFDDQLLENHIK+EVI++ +L+ AL N++ NK P Sbjct: 300 VTMRCDVSTCSSAHISLLVSGSAQTCFDDQLLENHIKNEVIEEIQLVRALPNNEGNKVPL 359 Query: 1228 SEPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASFER 1049 +EPR+S S+ACGA VFEVCMKVP+WASQVLRQLAPDVSY + VALGIASIQGL VASFE+ Sbjct: 360 AEPRKSASIACGATVFEVCMKVPAWASQVLRQLAPDVSYHSLVALGIASIQGLPVASFEK 419 Query: 1048 EDAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEIS-ASIVNGGIQTL 872 EDAERLLFFCS GK+ S KRS C E S S + + +L Sbjct: 420 EDAERLLFFCSSLGKDNKSNDFILGSPPTWLRPPPPSRKRSQPCQETSRGSNYSQRLPSL 479 Query: 871 I-KKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQKMLPFSKILDADLHDGS 695 K D ++KE NGV P + R+R+KIAA+RPIPHVR KM PFS + + D HDG Sbjct: 480 AASKIDEDNKEAGAMNGVSTPLLPPRQRLKIAAMRPIPHVRRPKMTPFSGMSELDGHDGG 539 Query: 694 QVKANLP-LPPAK-HSTGTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVCSE 521 Q KANLP PP K + G TP + RKS SS+ +KQ++SLNP+PLKKHGCGRSP+H C E Sbjct: 540 QFKANLPPAPPTKLNIVGLTPTTQRKSYSSSSHSKQIISLNPLPLKKHGCGRSPIHSCLE 599 Query: 520 EEFLKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGIN 341 EEFLKDVMQFLILRGH+RLIPQGGLAEFPDAILN KRLDL+NLY+EVVTRGGFHVGNGIN Sbjct: 600 EEFLKDVMQFLILRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGIN 659 Query: 340 WKGQVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWV 161 WKGQ+FSKMRN+T TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWV Sbjct: 660 WKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWV 719 Query: 160 NCGLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKINRSGNG 2 NCG+CGEWAHFGCDRR GLGAFKDYAKTDGLEYICP CS+SN+KKK + NG Sbjct: 720 NCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSISNFKKKPQKIANG 772 >ref|XP_006381551.1| hypothetical protein POPTR_0006s13780g [Populus trichocarpa] gi|550336257|gb|ERP59348.1| hypothetical protein POPTR_0006s13780g [Populus trichocarpa] Length = 749 Score = 993 bits (2567), Expect = 0.0 Identities = 496/770 (64%), Positives = 586/770 (76%), Gaps = 6/770 (0%) Frame = -1 Query: 2293 MFHTQGALKHTCNLLAVVCSEPEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTPD 2114 MFH QG L++ C LLAV+C + ++K + +S+++ RFPFPE+ S+GRLEVQ L NP+ D Sbjct: 2 MFHAQGPLRNHCTLLAVLCGKSGDNKQKQPLSDDKPRFPFPELASAGRLEVQVLTNPSTD 61 Query: 2113 EFRKVLDSWQPNLVYLQGERLEN-EEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVP 1937 EF++VL S +P++VY QGE++E+ EE+G + WG + LS+PE++ GLF ST+P TVYLE+P Sbjct: 62 EFQRVLHSLEPSIVYFQGEQIEDSEEIGPLRWGDIDLSTPESLCGLFGSTLPPTVYLEIP 121 Query: 1936 NGEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFRL 1757 NGE+LA++LHSKGVPYVIYWK+ SCY SHF AL SV+QSS HT DAFQLA ASFRL Sbjct: 122 NGEKLAEALHSKGVPYVIYWKSMFSCYAVSHFRQALLSVVQSSCSHTCDAFQLAYASFRL 181 Query: 1756 HCLRNS----SDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIYDD 1589 +C RN+ S+G+KV + GP L LPAIKIYDD Sbjct: 182 YCGRNNNTLASNGQKVGGKPGPQLLGDPPKFDITLPEADDQGEESSSGA--LPAIKIYDD 239 Query: 1588 DVNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSRGV 1409 DV MRFLVCG++ LEDGLNALL+IE+RGSKLHNR SA PPPLQAGTFSRGV Sbjct: 240 DVTMRFLVCGLSCTLDACLLESLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGTFSRGV 299 Query: 1408 VTMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENKPPT 1229 VTMRCDLST SSAHISLLVSGSAQTCF+DQLLENHIK+E+I+ S+L+HAL++ +E+K P+ Sbjct: 300 VTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALTSFEESKSPS 359 Query: 1228 SEPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASFER 1049 SEPR+S S+ACGA+VFEV MKVP+WASQVLRQLAPDVSYR+ V LGIASIQGL+VASFE+ Sbjct: 360 SEPRKSASIACGASVFEVSMKVPTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEK 419 Query: 1048 EDAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISASIVNGGIQTLI 869 +DA+RLLFFCS QGK KRS Sbjct: 420 DDADRLLFFCSEQGKESHPLNTFLTRPPTWLIPPAPCRKRS------------------- 460 Query: 868 KKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQKMLPFSKILDADLHDGSQV 689 E + + S G + + +AA+RPIPH KMLPFS DA+ +DG Q Sbjct: 461 --EPTRETKPLTSGRGGENGGNVKHKFHVAAMRPIPHTHRHKMLPFSGFFDAERYDGEQA 518 Query: 688 KANLPLPPAKHS-TGTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVCSEEEF 512 K +LP PP KHS G PV+HRKS SS+YQA+Q++SLNP+PLKKHGCGRSP+ VCSEEEF Sbjct: 519 KPSLPPPPPKHSVVGPAPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRSPIQVCSEEEF 578 Query: 511 LKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGINWKG 332 L+DVMQFLILRGH+RL+PQGGLAEFPDAILNAKRLDLFNLYREVV+RGGFHVGNGINWKG Sbjct: 579 LRDVMQFLILRGHSRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKG 638 Query: 331 QVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCG 152 QVFSKMRNHT TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG Sbjct: 639 QVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCG 698 Query: 151 LCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKINRSGNG 2 +CGEWAHFGCDRR GLGAFKDYAKTDGLEYICP CS++N+KKK ++ NG Sbjct: 699 ICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPNCSIANFKKKSQKTTNG 748 >ref|XP_002324130.2| arid/bright DNA-binding domain-containing family protein [Populus trichocarpa] gi|550318261|gb|EEF02695.2| arid/bright DNA-binding domain-containing family protein [Populus trichocarpa] Length = 746 Score = 978 bits (2528), Expect = 0.0 Identities = 490/769 (63%), Positives = 583/769 (75%), Gaps = 5/769 (0%) Frame = -1 Query: 2293 MFHTQGALKHTCNLLAVVCSEPEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTPD 2114 MFH QG L++ C LLAV+C + E K +S+++ R+P PE+ S+GRLEVQ L NP+ D Sbjct: 2 MFHAQGPLRNHCTLLAVLCGKSGEQK--LPLSDDKPRYPLPELESTGRLEVQVLNNPSTD 59 Query: 2113 EFRKVLDSWQPNLVYLQGERLEN-EEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVP 1937 EFR+VL S +P++VY QGE++E+ EE+GS+ W +GLS+PE++ GLF ST+P TVYLE+P Sbjct: 60 EFRQVLQSLEPSIVYFQGEQVEDREEIGSLRWADVGLSTPESLCGLFGSTLPPTVYLEMP 119 Query: 1936 NGEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFRL 1757 NGE+LA++LHSKGVPYVIYWK++ SCY ASHF AL SV+QSS HT DAFQLA ASFRL Sbjct: 120 NGEKLAEALHSKGVPYVIYWKSAFSCYAASHFRQALLSVVQSSCSHTCDAFQLAHASFRL 179 Query: 1756 HCLRNS----SDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIYDD 1589 +C++N+ S+ +KV + GP L LPAIKIYDD Sbjct: 180 YCVQNNNTPASNSQKVGGKPGPRLLGDPPKFDISLPEADDQGEEGSSGA--LPAIKIYDD 237 Query: 1588 DVNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSRGV 1409 DV MRFLVCG+ LEDGLNALL+IE+RGSKLHNR SA PPPLQAGTFSRGV Sbjct: 238 DVTMRFLVCGLTGTLDACALGSLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGTFSRGV 297 Query: 1408 VTMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENKPPT 1229 VTMRCDLST SSAHISLLVSGSAQ CF+DQLLENHIKSE+I+ S+L+HA ++SDE K P+ Sbjct: 298 VTMRCDLSTCSSAHISLLVSGSAQNCFNDQLLENHIKSELIENSQLVHASTSSDEIKSPS 357 Query: 1228 SEPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASFER 1049 SEPR+S S+ACGA+VFEV MKVP+WASQVLRQLAPDV+YR+ V LGIASIQGL+VASFE+ Sbjct: 358 SEPRKSASIACGASVFEVSMKVPTWASQVLRQLAPDVTYRSLVMLGIASIQGLSVASFEK 417 Query: 1048 EDAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISASIVNGGIQTLI 869 +DA+RLLFFC++Q K+ KR E G + Sbjct: 418 DDADRLLFFCTKQSKDPHPRNPVLTRHPSWLIPPAPCRKRYEPSRETKPLTFGCGGE--- 474 Query: 868 KKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQKMLPFSKILDADLHDGSQV 689 NG ++++ +AA+RPIPH R KMLPFS L+A+ +DG Q Sbjct: 475 -------------NGGNF-----KQKLYVAAMRPIPHTRRHKMLPFSGFLEAERYDGEQT 516 Query: 688 KANLPLPPAKHSTGTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVCSEEEFL 509 K +LP PP G PV+HRKS S++YQA+Q++SLNP+PLKKHGCGRSP+ CSEEEFL Sbjct: 517 KPSLPPPPKHSVVGPAPVTHRKSLSNSYQAQQIISLNPLPLKKHGCGRSPIQACSEEEFL 576 Query: 508 KDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGINWKGQ 329 +DVMQFLILRGH+RL+PQGGLAEFPDAILNAKRLDLFNLYREVV+RGGFHVGNGINWKGQ Sbjct: 577 RDVMQFLILRGHSRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQ 636 Query: 328 VFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCGL 149 VFSKMRNHT TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG+ Sbjct: 637 VFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGI 696 Query: 148 CGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKINRSGNG 2 CGEWAHFGCDRR GLGAFKDYAKTDGLEYICP CS++N+KKK ++ NG Sbjct: 697 CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSIANFKKKSQKNANG 745 >ref|XP_004303747.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Fragaria vesca subsp. vesca] Length = 779 Score = 975 bits (2521), Expect = 0.0 Identities = 501/774 (64%), Positives = 586/774 (75%), Gaps = 10/774 (1%) Frame = -1 Query: 2293 MFHTQGALKHTCNLLAVVCSEPEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTPD 2114 MFH QG TC++L V C E E K ++ E++ R+PFPE+VSSGRLEVQTL NP+ + Sbjct: 1 MFHAQG----TCSVLVVTCGEISEDKRGKETPEDKLRYPFPELVSSGRLEVQTLTNPSEE 56 Query: 2113 EFRKVLDSWQPNLVYLQGERLENEEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVPN 1934 EF K+L+S++PNLVYLQGE+LEN+EVG + W LS+ E++S +F +T+PTTVYLEVPN Sbjct: 57 EFCKLLESYKPNLVYLQGEQLENDEVGPLVWRDAYLSTAESMSDIFDATLPTTVYLEVPN 116 Query: 1933 GEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFRLH 1754 GEELA +L SKG+PYVIYWK+++S Y A HF AL SV+QSSS HTWDAFQLA ASFRL+ Sbjct: 117 GEELAVALQSKGIPYVIYWKDAISTYAACHFRHALLSVVQSSSTHTWDAFQLAHASFRLY 176 Query: 1753 CLRNSS----DGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIYDDD 1586 C++N + +K + E GP + LPAIKI+DDD Sbjct: 177 CVQNDHVVRVNLDKPSAELGPCILGEHLKISVDPPEADMEEDEEGATGS-LPAIKIHDDD 235 Query: 1585 VNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSRGVV 1406 V++RFLVCG LEDGLNALL+IEMRGSKLH + SA PPPLQAGTFSRGVV Sbjct: 236 VSLRFLVCGQPSTLDAGILEPLEDGLNALLNIEMRGSKLHGKFSAPPPPLQAGTFSRGVV 295 Query: 1405 TMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENKPPTS 1226 TMRCD+ST SSAHISLLVSGSAQTCFDDQLLENHIK EVI+ ++L+HA+ N+D NK P Sbjct: 296 TMRCDISTCSSAHISLLVSGSAQTCFDDQLLENHIKHEVIEINQLVHAVPNNDRNKLPLV 355 Query: 1225 EPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASFERE 1046 EPR+S ++ACGA VFEV MKVP WASQVLRQLAPDVSYR+ V+LGIASIQGL VASFE++ Sbjct: 356 EPRKSAAIACGATVFEVSMKVPVWASQVLRQLAPDVSYRSLVSLGIASIQGLPVASFEKD 415 Query: 1045 DAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISASIVNG-GIQTLI 869 DA+RLLFFCS + K+ S KRS +C E N G+ L Sbjct: 416 DADRLLFFCSSRTKDSQLNDLFLSTPPAWLRPPAPSKKRSRLCQEAIPGFRNRQGLPNLA 475 Query: 868 --KKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQKMLPFSKILDADLHDGS 695 K E+NE K NG P + R+R+K AA+RPIPHVR KM PFS I + + HDGS Sbjct: 476 ASKVEENE-KALGAVNGFSTPLLPARQRLKTAAMRPIPHVRRPKMTPFSGISEVNGHDGS 534 Query: 694 QV-KANLP-LPPAK-HSTGTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVCS 524 QV KA+LP +PP K + G TP + RKS SS+ QAKQ++SLNP+PLKKHGCGR P+H C Sbjct: 535 QVVKAHLPPVPPTKLNIVGLTPTTQRKSYSSSSQAKQIISLNPLPLKKHGCGRGPIHSCL 594 Query: 523 EEEFLKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGI 344 EEEFLKDVMQFLILRGH+RLIPQGGL EFPDAILN KRLDL+NLY+EVVTRGGFHVGNGI Sbjct: 595 EEEFLKDVMQFLILRGHSRLIPQGGLTEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGI 654 Query: 343 NWKGQVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDW 164 NWKGQ+FSKMRN+T TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GDW Sbjct: 655 NWKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDW 714 Query: 163 VNCGLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKINRSGNG 2 VNCG+CGEWAHFGCDRR GLGAFKDYAKTDGLEYICP CS+SN+KKK + NG Sbjct: 715 VNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSISNFKKKPQKVTNG 768 >ref|XP_002277324.1| PREDICTED: AT-rich interactive domain-containing protein 4 [Vitis vinifera] gi|297738501|emb|CBI27746.3| unnamed protein product [Vitis vinifera] Length = 739 Score = 970 bits (2507), Expect = 0.0 Identities = 498/770 (64%), Positives = 587/770 (76%), Gaps = 6/770 (0%) Frame = -1 Query: 2293 MFHTQGALKHTCNLLAVVCSEPEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTPD 2114 MFH Q A ++ C LLAVVC + S+ +++ +PFPE+VSSGRLEVQ LKNP+ Sbjct: 1 MFHVQAASRNHCALLAVVCGKIPVSE-----DQQQHPYPFPELVSSGRLEVQILKNPSIH 55 Query: 2113 EFRKVLDSWQPNLVYLQGERLE-NEEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVP 1937 EF++ L+S +PN +YLQGE+L +EE+GS++WGG+ LSS EA+ LF T+PTTVYLE P Sbjct: 56 EFQRSLESLEPNFLYLQGEQLPGSEEIGSLTWGGVDLSSAEALVELFGPTLPTTVYLETP 115 Query: 1936 NGEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFRL 1757 NGE+LAK+LHSKGV YVIYWKN+ SCY A HF ALFSV+QSS HTWDAFQLA ASFRL Sbjct: 116 NGEKLAKALHSKGVSYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFRL 175 Query: 1756 HCLRNS---SDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIYDDD 1586 +C++N+ S+ +KV+ + GP L LP IKIYD D Sbjct: 176 YCVQNNTVPSNNQKVSGKLGPCLLGDPPKINVVPPEVDEEESLPAT----LPVIKIYDAD 231 Query: 1585 VNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSRGVV 1406 V+MRFLVCG LEDGLNALL IE+RGSKLHNRVSA PPPLQAGTFSRGVV Sbjct: 232 VSMRFLVCGAPSALDACLLGSLEDGLNALLCIEIRGSKLHNRVSAPPPPLQAGTFSRGVV 291 Query: 1405 TMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENKPPTS 1226 TMRCDLST SSAHISLLVSGSAQTC +DQLLE++IK+E+I+KS+L+HA+ + +E+K +S Sbjct: 292 TMRCDLSTCSSAHISLLVSGSAQTCLNDQLLESYIKNELIEKSQLVHAVPSCEESKLSSS 351 Query: 1225 EPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASFERE 1046 EPRRS S+ACGA+VFEV +KVP+WASQVLRQLAPDVSYR+ V LGIASIQGL+VASFE++ Sbjct: 352 EPRRSASIACGASVFEVRIKVPTWASQVLRQLAPDVSYRSLVTLGIASIQGLSVASFEKD 411 Query: 1045 DAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISAS--IVNGGIQTL 872 DA+RLLFFC+R K S KRS CHE S V GG+ Sbjct: 412 DADRLLFFCTRHAKQLNQNNSILPRPPSWLIAPPASRKRSGPCHETKPSGYKVLGGV--- 468 Query: 871 IKKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQKMLPFSKILDADLHDGSQ 692 NG L +++ KIAA+RPIPH R+ KMLPFS I +A DG Q Sbjct: 469 --------------NGGVL-----QQKPKIAAMRPIPHTRNHKMLPFSGISEASRCDGDQ 509 Query: 691 VKANLPLPPAKHSTGTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVCSEEEF 512 K NL + PAKH+ GTTPV+HRK SS++QA+Q++SLNP+PLKKHGCGRSP+ +CSEEEF Sbjct: 510 AKGNLSVVPAKHN-GTTPVTHRKLLSSSFQAQQIISLNPLPLKKHGCGRSPIQICSEEEF 568 Query: 511 LKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGINWKG 332 L+DVMQFLILRGH RL+PQGGLAEFPDAILNAKRLDL+NLYREVV+RGGFHVGNGINWKG Sbjct: 569 LRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRGGFHVGNGINWKG 628 Query: 331 QVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCG 152 QVFSKMRNHT TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG Sbjct: 629 QVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCG 688 Query: 151 LCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKINRSGNG 2 +CGEWAHFGCDRR GLGAFKDYAKTDGLEYICP CS++N++KK ++ NG Sbjct: 689 ICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFQKKSQKTANG 738 >ref|XP_007012520.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 1 [Theobroma cacao] gi|590574848|ref|XP_007012521.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 1 [Theobroma cacao] gi|508782883|gb|EOY30139.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 1 [Theobroma cacao] gi|508782884|gb|EOY30140.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 1 [Theobroma cacao] Length = 746 Score = 968 bits (2502), Expect = 0.0 Identities = 491/770 (63%), Positives = 580/770 (75%), Gaps = 6/770 (0%) Frame = -1 Query: 2293 MFHTQGALKHTCNLLAVVCS-EPEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTP 2117 MF QG+ ++ C+LLAV+ ++K + VS+++ R+PFPE+ SSGRLEVQ L +P Sbjct: 2 MFSAQGSSRNHCSLLAVLSGGNVSDNKQKQPVSDDKPRYPFPELASSGRLEVQLLNSPNI 61 Query: 2116 DEFRKVLDSWQPNLVYLQGER-LENEEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEV 1940 DE R+VL+S +PN+VYLQGE+ ++EE+G + WG + LS+PE + GLF ST+PTTVYLE Sbjct: 62 DELRRVLESTEPNVVYLQGEQNADSEEIGPLIWGDVDLSTPETLCGLFDSTLPTTVYLET 121 Query: 1939 PNGEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFR 1760 PNG++LA++LHS+GVPYVIYWKN+ S + A HF AL SVIQSS HTWDAFQLA ASFR Sbjct: 122 PNGDKLAEALHSQGVPYVIYWKNTFSRFAACHFRQALLSVIQSSCSHTWDAFQLAHASFR 181 Query: 1759 LHCLRN----SSDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIYD 1592 L+C+RN SS+ +K + + GP L LPAIKIYD Sbjct: 182 LYCVRNNNVVSSNSQKQSVKPGPRLLGEAPKIDVSQPEVDMQGEESSPEN--LPAIKIYD 239 Query: 1591 DDVNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSRG 1412 DDV +RFLVCG LEDGLNALLSIE+RGSKLHNR SA PPPLQAGTFSRG Sbjct: 240 DDVTVRFLVCGSPCILDAFLLGSLEDGLNALLSIEIRGSKLHNRASAPPPPLQAGTFSRG 299 Query: 1411 VVTMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENKPP 1232 VVTMRCD ST SSAHISLLVSGSAQTCF+DQLLENHIK+E+I+KS+L+HA S+S+E+K P Sbjct: 300 VVTMRCDFSTCSSAHISLLVSGSAQTCFNDQLLENHIKNEIIEKSQLVHAQSSSEESKLP 359 Query: 1231 TSEPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASFE 1052 +SEPRRS S+ACGA+VFEVCMKVP+WASQVLRQLAPDVSYR+ V LGIASIQGL+VASFE Sbjct: 360 SSEPRRSASIACGASVFEVCMKVPTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFE 419 Query: 1051 REDAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISASIVNGGIQTL 872 ++DAERLLFFC RQ K+ S KRS C + G Sbjct: 420 KDDAERLLFFCMRQDKDPLQDSSVIAISPSWLVPPAPSRKRSEPCKDSKPLNCTG----- 474 Query: 871 IKKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQKMLPFSKILDADLHDGSQ 692 NG+ P + +AA+RPIPH K++PFS +A+ +DG Q Sbjct: 475 ----------MEGENGIARP------KSNVAAMRPIPHTHRHKIIPFSGFSEAERYDGDQ 518 Query: 691 VKANLPLPPAKHSTGTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVCSEEEF 512 K NLP+ P K PV+HRK+ SS+YQA+Q++SLNP+PLKKHGCGR+P+ VCSEEEF Sbjct: 519 GKVNLPVVPVKQPA---PVTHRKALSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEF 575 Query: 511 LKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGINWKG 332 L+DVMQFLILRGH RL+PQGGLAEFPDAILNAKRLDLFNLYREVV+RGGFHVGNGINWKG Sbjct: 576 LRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKG 635 Query: 331 QVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCG 152 QVFSKMRNHT TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVNCG Sbjct: 636 QVFSKMRNHTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCG 695 Query: 151 LCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKINRSGNG 2 +CGEWAHFGCDRR GLGAFKDYAKTDGLEY+CP CS+SN+KKK ++ NG Sbjct: 696 ICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSISNFKKKPQKTVNG 745 >ref|XP_003547888.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Glycine max] Length = 782 Score = 948 bits (2451), Expect = 0.0 Identities = 474/770 (61%), Positives = 571/770 (74%), Gaps = 12/770 (1%) Frame = -1 Query: 2290 FHTQGALKHTCNLLAVVCSEPEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTPDE 2111 FH+QG KHTC LLAV C S +S +R +PFPE+VS+GRLEVQTL +P ++ Sbjct: 4 FHSQGTPKHTCTLLAVTC---RTSSAEHKLSHAQRTYPFPELVSAGRLEVQTLCSPEKEQ 60 Query: 2110 FRKVLDSWQPNLVYLQGERLENEEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVPNG 1931 FRKVL+S+QPN VYL+G++LEN EVGS+ W G+ LS+ E ++ LF ST+PT VYLE+PNG Sbjct: 61 FRKVLESFQPNFVYLRGDQLENGEVGSLVWQGVELSTCEDITELFGSTLPTAVYLEIPNG 120 Query: 1930 EELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFRLHC 1751 E A++LH KG+PYVI+WKN+ SCY A HF A SV+QSSS HTWDAF LA ASF L+C Sbjct: 121 ESFAEALHLKGIPYVIFWKNTFSCYAACHFRQAFLSVVQSSSTHTWDAFHLARASFELYC 180 Query: 1750 LRNS----SDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIYDDDV 1583 ++N+ SD + + E GP+L LPAIKI++D+V Sbjct: 181 VQNNQVLPSDSDDASSEMGPHLLGDCLKINVDPPEIDEEDDDESSSGS-LPAIKIHEDEV 239 Query: 1582 NMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSRGVVT 1403 N+RFL+CG LEDGL ALL+IE+RG KLH + SA PPPLQA FSRGVVT Sbjct: 240 NLRFLICGAPSTVDESLLRSLEDGLRALLTIEIRGCKLHGKFSAPPPPLQAAAFSRGVVT 299 Query: 1402 MRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENKPPTSE 1223 MRCD+ST SSAHISLLVSGSAQTCF+DQLLENHIK+E+I+KS+L+HA N++ NK E Sbjct: 300 MRCDISTCSSAHISLLVSGSAQTCFNDQLLENHIKNEIIEKSQLVHAQLNNEGNKENICE 359 Query: 1222 PRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASFERED 1043 PRRS S+ACGA+VFE+CMK+P WA Q+LRQLAP+VSYR+ VALGIASIQGL +ASFE++D Sbjct: 360 PRRSASIACGASVFEICMKLPQWALQILRQLAPEVSYRSLVALGIASIQGLPIASFEKDD 419 Query: 1042 AERLLFF---CSRQG---KNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISASIVNGGI 881 AERLLFF C + KN R + + G Sbjct: 420 AERLLFFYQNCEKDSCTNKNNIIFSSPPGWLKPPPPTRKRCEPRQEASPGLHEGVFAG-- 477 Query: 880 QTLIKKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQKMLPFSKILDADLHD 701 Q + K + E+K+R++ NG+ +P R+R+K++A+RPIPH+R +M PF + D D Sbjct: 478 QGGVCKLNEEEKDRKIVNGISMPLTPARQRLKVSAMRPIPHIRRHRMTPFCGPSETDGFD 537 Query: 700 GSQVKANLPL--PPAKHSTGTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVC 527 G+QV+A LPL P + S G+T +HRKS SSA Q+KQV+SLNP+PLKKHGCGR P+ C Sbjct: 538 GTQVEAILPLVAPTKRTSIGSTSGTHRKSFSSAAQSKQVISLNPLPLKKHGCGRGPVQTC 597 Query: 526 SEEEFLKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNG 347 SEEEFLKDVM+FLILRGHNRLIPQGGL EFPDAILN KRLDL+NLY+EVVTRGGFHVGNG Sbjct: 598 SEEEFLKDVMEFLILRGHNRLIPQGGLTEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNG 657 Query: 346 INWKGQVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGD 167 INWKGQ+FSKMRN+T TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GD Sbjct: 658 INWKGQIFSKMRNYTTTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGD 717 Query: 166 WVNCGLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKIN 17 WVNCG+CGEWAHFGCDRR GLGAFKDYAKTDGLEYICP CSV+N+KKK N Sbjct: 718 WVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVTNFKKKQN 767 >ref|XP_002516200.1| DNA binding protein, putative [Ricinus communis] gi|223544686|gb|EEF46202.1| DNA binding protein, putative [Ricinus communis] Length = 749 Score = 947 bits (2448), Expect = 0.0 Identities = 478/728 (65%), Positives = 554/728 (76%), Gaps = 6/728 (0%) Frame = -1 Query: 2167 IVSSGRLEVQTLKNPTPDEFRKVLDSWQPNLVYLQGERLEN-EEVGSISWGGMGLSSPEA 1991 + SSGRLEVQ L +P+ DEFR+VL S +PN+VYLQGE +E+ EE+GS+ W G LS+P+A Sbjct: 43 LXSSGRLEVQILSSPSTDEFRRVLQSSEPNIVYLQGEIIEDSEEIGSLRWAGADLSTPDA 102 Query: 1990 VSGLFSSTMPTTVYLEVPNGEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQS 1811 + LF ST+P TVYLE+PNGE+LA++LH KGVPYVIYWK++ SCY A+HF AL SV+QS Sbjct: 103 LCELFGSTLPPTVYLEIPNGEKLAEALHFKGVPYVIYWKSTFSCYAAAHFRQALLSVVQS 162 Query: 1810 SSCHTWDAFQLADASFRLHCLRN----SSDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXX 1643 S HT DAFQLA ASF L+C+RN SS+ +KV + GP L Sbjct: 163 SCSHTCDAFQLAHASFSLYCVRNNTGLSSNNQKVGGKPGPRLLGEPPKIDITLPEADVQD 222 Query: 1642 XXXXXXXXSLPAIKIYDDDVNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHN 1463 LPAIKIYDDDV MRFLVC + LEDGLNALL+IE+RGSKLHN Sbjct: 223 EESSSGT--LPAIKIYDDDVTMRFLVCELPSTLDACLLGSLEDGLNALLNIEIRGSKLHN 280 Query: 1462 RVSALPPPLQAGTFSRGVVTMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVID 1283 R SA PPPLQAGTFSRGVVTMRCDLST SSAHISLLVSGSAQ CF+DQLLENHIK+E+I+ Sbjct: 281 RTSAPPPPLQAGTFSRGVVTMRCDLSTCSSAHISLLVSGSAQACFNDQLLENHIKNELIE 340 Query: 1282 KSELIHALSNSDENKPPTSEPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNF 1103 S+L+HAL +S+E+K TSEPR+S S+ CGA+VFEVC+KVPSWASQVLRQLAPDVSYR+ Sbjct: 341 NSQLVHALPSSEESKLLTSEPRKSASIGCGASVFEVCLKVPSWASQVLRQLAPDVSYRSL 400 Query: 1102 VALGIASIQGLAVASFEREDAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSS 923 V LGIASIQGL+VASFE+ED ERLLFFC+RQGK + S KRS Sbjct: 401 VMLGIASIQGLSVASFEKEDTERLLFFCTRQGKELYPNNSIIIKPPCWLIPPAPSRKRSE 460 Query: 922 ICHEISASIVNGGIQTLIKKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQK 743 C E G ER V ++++ +AA+RPIPH RH K Sbjct: 461 PCRETKLFTSKG-------------LERENGGSV-------KQKLNVAAMRPIPHTRHHK 500 Query: 742 MLPFSKILDADLHDGSQVKANLPLPPAKHST-GTTPVSHRKSTSSAYQAKQVLSLNPIPL 566 MLPFS + + +DG Q K +LP+ PAKH G PVSHRKS SS+YQA+Q++SLNP+PL Sbjct: 501 MLPFSGFAEGERYDGDQGKPSLPVAPAKHGVVGPAPVSHRKSLSSSYQAQQIISLNPLPL 560 Query: 565 KKHGCGRSPLHVCSEEEFLKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYR 386 KKHGCGR+P+ CSEEEFL+DVMQFLILRGH RL+PQGGLAEFPDAILNAKRLDLFNLYR Sbjct: 561 KKHGCGRAPIQACSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYR 620 Query: 385 EVVTRGGFHVGNGINWKGQVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDG 206 EVV+RGGFHVGNGINWKGQVFSKMRNHT TNRMTGVGNTLKRHYETYLLEYELAHDDVDG Sbjct: 621 EVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDG 680 Query: 205 ECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKK 26 ECCLLCHSSA GDWVNCG+CGEWAHFGCDRR GLGAFKDYAKTDGLEYICP CS++N++K Sbjct: 681 ECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSIANFRK 740 Query: 25 KINRSGNG 2 K ++ NG Sbjct: 741 KSQKTANG 748 >ref|XP_007135372.1| hypothetical protein PHAVU_010G123900g [Phaseolus vulgaris] gi|561008417|gb|ESW07366.1| hypothetical protein PHAVU_010G123900g [Phaseolus vulgaris] Length = 781 Score = 942 bits (2435), Expect = 0.0 Identities = 472/767 (61%), Positives = 566/767 (73%), Gaps = 9/767 (1%) Frame = -1 Query: 2290 FHTQGALKHTCNLLAVVCSEPEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTPDE 2111 FH GA KH C LLAV C S S+ + ++PFPE+VS+GRLEVQTL+NP ++ Sbjct: 4 FHPHGAPKHACTLLAVTCGA---SFAEHKASQNQHKYPFPELVSAGRLEVQTLRNPDKEQ 60 Query: 2110 FRKVLDSWQPNLVYLQGERLENEEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVPNG 1931 FRKVL+S+QPN VYLQGE+LEN++VGS+ W G+ LS+ E + LF ST+PT VYLE+PNG Sbjct: 61 FRKVLESYQPNFVYLQGEQLENDKVGSLVWQGLELSTSEDIIELFGSTLPTAVYLEIPNG 120 Query: 1930 EELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFRLHC 1751 E A++LH KG+PYVI+WKN+ Y A HF A SV+QSSS HTWDAF LA ASF L+C Sbjct: 121 ESFAEALHLKGIPYVIFWKNAFFSYAACHFRQAFLSVVQSSSTHTWDAFHLARASFELYC 180 Query: 1750 LRN----SSDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIYDDDV 1583 ++N S++ E GP+L LPAIKI++D+V Sbjct: 181 VQNNQVLSTNIHDAISEMGPHLLGDCLKINVDPPEIDEEDDDENSSGT-LPAIKIHEDEV 239 Query: 1582 NMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSRGVVT 1403 N+RFLVCG LEDGL ALL+IE+RG KLH + SA PPPLQA TFSRGVVT Sbjct: 240 NLRFLVCGAPSTVDESLLRSLEDGLRALLTIEIRGCKLHGKFSAPPPPLQAATFSRGVVT 299 Query: 1402 MRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENKPPTSE 1223 MRCD+ST SSAHISLLVSGSAQTCF+DQLLE+HIK+E+I+KS+L+HA N++ NK SE Sbjct: 300 MRCDISTCSSAHISLLVSGSAQTCFNDQLLESHIKNEIIEKSQLVHAQLNNEGNKQNISE 359 Query: 1222 PRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASFERED 1043 PRRS S+ACGA VFE+CMK+P WA Q+LRQLAP+VSYR+ VALGIASIQGL +ASFE++D Sbjct: 360 PRRSASIACGAPVFEICMKLPQWALQILRQLAPEVSYRSLVALGIASIQGLPIASFEKDD 419 Query: 1042 AERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISASI-VNGGI---QT 875 AERLLFF K+ +R AS ++ G+ Sbjct: 420 AERLLFFYQSCEKDSGTSKNNIIFGSPPGWLKPPPPRRKRCESSQGASPGLHEGVFAGPA 479 Query: 874 LIKKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQKMLPFSKILDADLHDGS 695 + K + E+K+R+++NG+ P R+R+K++A+RPIPH+R +M PF + D DG Sbjct: 480 TVYKVNEEEKDRKMANGISTPLAPARQRLKVSAMRPIPHIRRHRMTPFCGPSETDGFDGG 539 Query: 694 QVKANLPL-PPAKHSTGTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVCSEE 518 QV+ LPL P K S G+T +HRKS SSA Q+KQV+SLNP+PLKKHGCGR P+ CSEE Sbjct: 540 QVEPTLPLVAPTKRSIGSTSATHRKSFSSAAQSKQVISLNPLPLKKHGCGRGPVQTCSEE 599 Query: 517 EFLKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGINW 338 EFLKDVM+FLILRGHNRLIPQGGL EFPDAILN KRLDL+NLY+EVVTRGGFHVGNGINW Sbjct: 600 EFLKDVMEFLILRGHNRLIPQGGLTEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINW 659 Query: 337 KGQVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVN 158 KGQ+FSKMRN+T TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA GDWVN Sbjct: 660 KGQIFSKMRNYTTTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVN 719 Query: 157 CGLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKIN 17 CG+CGEWAHFGCDRR GLGAFKDYAKTDGLEYICP CSV+N+KKK N Sbjct: 720 CGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVTNFKKKQN 766 >ref|XP_007012522.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 3, partial [Theobroma cacao] gi|508782885|gb|EOY30141.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 3, partial [Theobroma cacao] Length = 708 Score = 936 bits (2420), Expect = 0.0 Identities = 474/727 (65%), Positives = 551/727 (75%), Gaps = 5/727 (0%) Frame = -1 Query: 2167 IVSSGRLEVQTLKNPTPDEFRKVLDSWQPNLVYLQGER-LENEEVGSISWGGMGLSSPEA 1991 + SSGRLEVQ L +P DE R+VL+S +PN+VYLQGE+ ++EE+G + WG + LS+PE Sbjct: 1 LASSGRLEVQLLNSPNIDELRRVLESTEPNVVYLQGEQNADSEEIGPLIWGDVDLSTPET 60 Query: 1990 VSGLFSSTMPTTVYLEVPNGEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQS 1811 + GLF ST+PTTVYLE PNG++LA++LHS+GVPYVIYWKN+ S + A HF AL SVIQS Sbjct: 61 LCGLFDSTLPTTVYLETPNGDKLAEALHSQGVPYVIYWKNTFSRFAACHFRQALLSVIQS 120 Query: 1810 SSCHTWDAFQLADASFRLHCLRN----SSDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXX 1643 S HTWDAFQLA ASFRL+C+RN SS+ +K + + GP L Sbjct: 121 SCSHTWDAFQLAHASFRLYCVRNNNVVSSNSQKQSVKPGPRLLGEAPKIDVSQPEVDMQG 180 Query: 1642 XXXXXXXXSLPAIKIYDDDVNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHN 1463 LPAIKIYDDDV +RFLVCG LEDGLNALLSIE+RGSKLHN Sbjct: 181 EESSPEN--LPAIKIYDDDVTVRFLVCGSPCILDAFLLGSLEDGLNALLSIEIRGSKLHN 238 Query: 1462 RVSALPPPLQAGTFSRGVVTMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVID 1283 R SA PPPLQAGTFSRGVVTMRCD ST SSAHISLLVSGSAQTCF+DQLLENHIK+E+I+ Sbjct: 239 RASAPPPPLQAGTFSRGVVTMRCDFSTCSSAHISLLVSGSAQTCFNDQLLENHIKNEIIE 298 Query: 1282 KSELIHALSNSDENKPPTSEPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNF 1103 KS+L+HA S+S+E+K P+SEPRRS S+ACGA+VFEVCMKVP+WASQVLRQLAPDVSYR+ Sbjct: 299 KSQLVHAQSSSEESKLPSSEPRRSASIACGASVFEVCMKVPTWASQVLRQLAPDVSYRSL 358 Query: 1102 VALGIASIQGLAVASFEREDAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSS 923 V LGIASIQGL+VASFE++DAERLLFFC RQ K+ S KRS Sbjct: 359 VMLGIASIQGLSVASFEKDDAERLLFFCMRQDKDPLQDSSVIAISPSWLVPPAPSRKRSE 418 Query: 922 ICHEISASIVNGGIQTLIKKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQK 743 C + G NG+ P + +AA+RPIPH K Sbjct: 419 PCKDSKPLNCTG---------------MEGENGIARP------KSNVAAMRPIPHTHRHK 457 Query: 742 MLPFSKILDADLHDGSQVKANLPLPPAKHSTGTTPVSHRKSTSSAYQAKQVLSLNPIPLK 563 ++PFS +A+ +DG Q K NLP+ P K PV+HRK+ SS+YQA+Q++SLNP+PLK Sbjct: 458 IIPFSGFSEAERYDGDQGKVNLPVVPVKQPA---PVTHRKALSSSYQAQQIISLNPLPLK 514 Query: 562 KHGCGRSPLHVCSEEEFLKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYRE 383 KHGCGR+P+ VCSEEEFL+DVMQFLILRGH RL+PQGGLAEFPDAILNAKRLDLFNLYRE Sbjct: 515 KHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYRE 574 Query: 382 VVTRGGFHVGNGINWKGQVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGE 203 VV+RGGFHVGNGINWKGQVFSKMRNHT TNRMTGVGNTLKRHYETYLLEYELAHDDVDGE Sbjct: 575 VVSRGGFHVGNGINWKGQVFSKMRNHTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE 634 Query: 202 CCLLCHSSAPGDWVNCGLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKK 23 CCLLCHSSA GDWVNCG+CGEWAHFGCDRR GLGAFKDYAKTDGLEY+CP CS+SN+KKK Sbjct: 635 CCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSISNFKKK 694 Query: 22 INRSGNG 2 ++ NG Sbjct: 695 PQKTVNG 701 >ref|XP_004510562.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Cicer arietinum] Length = 783 Score = 916 bits (2367), Expect = 0.0 Identities = 458/769 (59%), Positives = 565/769 (73%), Gaps = 13/769 (1%) Frame = -1 Query: 2290 FHTQGALKHTCNLLAVVCSE--PEESKPIRDVSEERRRFPFPEIVSSGRLEVQTLKNPTP 2117 FH QG+ K TC LL V + E+ P + FPFPE+VSSGRLEVQTL NP Sbjct: 4 FHPQGSSKQTCTLLTVTSATRCAEQKHP-----QNHHNFPFPELVSSGRLEVQTLCNPEK 58 Query: 2116 DEFRKVLDSWQPNLVYLQGERLENEEVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVP 1937 ++F KVL+S+QP++VYLQGE+L NEEVGS+ W G+ LS+PE +S LF +++PT VYLE+P Sbjct: 59 EQFCKVLESYQPSIVYLQGEQLVNEEVGSVVWQGVELSTPEDISELFGTSLPTAVYLEIP 118 Query: 1936 NGEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADASFRL 1757 NGE A++LH KG+PYV++WKN+ S Y A HF A FSV+QSSS HTWDAF LA ASF L Sbjct: 119 NGESFAEALHLKGIPYVVFWKNAFSRYAACHFRQAFFSVVQSSSTHTWDAFHLAHASFEL 178 Query: 1756 HCLRNS----SDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXS---LPAIKI 1598 +C++N+ +D + + GP+L S LP+I+I Sbjct: 179 YCVQNNQVLPTDSNDADSDMGPHLLGDCLKIHIDPPEMGEEEEDDDDDESSSGSLPSIQI 238 Query: 1597 YDDDVNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFS 1418 +DD+VN+RFL+CG LEDGL ALL+IE+RG KLH + SA PPPLQA FS Sbjct: 239 HDDEVNLRFLICGEPSTVDESLLRSLEDGLRALLTIEIRGCKLHGKYSAPPPPLQAAAFS 298 Query: 1417 RGVVTMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENK 1238 RGVVTMRCD+ST SSAHISLLVSGSAQ CF+DQLLENHIK+E+I+K +++HA S+ NK Sbjct: 299 RGVVTMRCDISTCSSAHISLLVSGSAQACFNDQLLENHIKNEIIEKGQIVHA-QLSEANK 357 Query: 1237 PPTSEPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVAS 1058 SEPRRS S+ACGA +FE+ MK+P WA Q+LRQLAPDVSYR+ VALGIASIQGL VAS Sbjct: 358 QTISEPRRSASIACGATIFEISMKLPQWALQILRQLAPDVSYRSLVALGIASIQGLPVAS 417 Query: 1057 FEREDAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISASIVNGGI- 881 FE++DAERLLFF K+G ++ S + ++ ++ G+ Sbjct: 418 FEKDDAERLLFFYQSSEKDGCANHNIVFSRPPIWLKPPPPTRKRSESSQGASPDIDDGVF 477 Query: 880 --QTLIKKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQKMLPFSKILDADL 707 Q IKK D E+K+R++ NG+ P R+R+K++A+RPIP VR +M PF + D Sbjct: 478 SGQGAIKKVDEEEKDRKMVNGISTPLTPARQRLKVSAMRPIPQVRRHRMTPFCGPSEMDG 537 Query: 706 HDGSQVKANLPLPPAKHST-GTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHV 530 G+ V+A++PL P K S+ ++ + RKS SS+ +KQV+SLNP+PLKKHGC R P+ Sbjct: 538 FGGAHVEASVPLVPMKRSSIASSSATQRKSFSSSALSKQVISLNPLPLKKHGCSRGPVQT 597 Query: 529 CSEEEFLKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGN 350 CSEEEFLKDVM+FLILRGH+RLIPQGGL+EFPDAILN KRLDL+NLY+EVVTRGGFHVGN Sbjct: 598 CSEEEFLKDVMEFLILRGHSRLIPQGGLSEFPDAILNGKRLDLYNLYKEVVTRGGFHVGN 657 Query: 349 GINWKGQVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPG 170 GINWKGQ+FSKM N+T+TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA G Sbjct: 658 GINWKGQIFSKMGNYTSTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAG 717 Query: 169 DWVNCGLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKK 23 DWVNCG+CGEWAHFGCDRR GLGAFKDYAKTDGLEYICP CS++N+KKK Sbjct: 718 DWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFKKK 766 >ref|XP_003533805.1| PREDICTED: AT-rich interactive domain-containing protein 4-like isoform X1 [Glycine max] Length = 752 Score = 912 bits (2358), Expect = 0.0 Identities = 467/772 (60%), Positives = 561/772 (72%), Gaps = 8/772 (1%) Frame = -1 Query: 2293 MFHTQGALKHTCNLLAVVCSEPEESKPIR---DVSEERRRFPFPEIVSSGRLEVQTLKNP 2123 MFH+QG +H C+LLAV+ + + K + + SE++ +PFPE+ SSGRLEV+ L P Sbjct: 2 MFHSQGVSRH-CSLLAVLSGKSRDIKQKQKQGNASEDQFPYPFPELSSSGRLEVKVLIEP 60 Query: 2122 TPDEFRKVLDSWQPNLVYLQGERLENE-EVGSISWGGMGLSSPEAVSGLFSSTMPTTVYL 1946 T DE L+ QP+ VYLQG++LE+ E+G + W LS PEA+ GLFSS +P TVYL Sbjct: 61 TADELGLALEQLQPDFVYLQGQQLEDRGEIGPLGWEDFDLSVPEALCGLFSSKLPNTVYL 120 Query: 1945 EVPNGEELAKSLHSKGVPYVIYWKNSVSCYPASHFCSALFSVIQSSSCHTWDAFQLADAS 1766 E P GE+LA++L SKGVPY IYWKN S Y ASHF +LFSV QS+S HTWDAFQLA AS Sbjct: 121 ETPKGEKLAEALRSKGVPYTIYWKNDFSKYAASHFRHSLFSVAQSTSSHTWDAFQLALAS 180 Query: 1765 FRLHCLRNS---SDGEKVNDEFGPNLFXXXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIY 1595 FRL+C+ N+ S+ K + GP + + A+KIY Sbjct: 181 FRLYCIHNNVLPSNCHKGAGKLGPQILGVPPNIDVSPCVADMKEEEEDSPET-ISAVKIY 239 Query: 1594 DDDVNMRFLVCGMAXXXXXXXXXXLEDGLNALLSIEMRGSKLHNRVSALPPPLQAGTFSR 1415 DDDVNMRFL+CG+ LEDGLNALL E+RG KLHNR SA PPPLQAGTFSR Sbjct: 240 DDDVNMRFLICGVPCTLDACLLGSLEDGLNALLFAEIRGCKLHNRTSATPPPLQAGTFSR 299 Query: 1414 GVVTMRCDLSTSSSAHISLLVSGSAQTCFDDQLLENHIKSEVIDKSELIHALSNSDENKP 1235 GVVTMRCD+ST SSAHISLLVSGSA TCF+DQLLENHIK E+I+KS+L+ A N +++K Sbjct: 300 GVVTMRCDISTCSSAHISLLVSGSADTCFNDQLLENHIKKELIEKSQLVQAFPNHEQSKA 359 Query: 1234 PTSEPRRSVSVACGAAVFEVCMKVPSWASQVLRQLAPDVSYRNFVALGIASIQGLAVASF 1055 P+SEPRRS SVACG++VFEVCM+VP+WASQVLRQLAP++SYR+ V LGIASIQGL VASF Sbjct: 360 PSSEPRRSASVACGSSVFEVCMQVPAWASQVLRQLAPNLSYRSLVMLGIASIQGLPVASF 419 Query: 1054 EREDAERLLFFCSRQGKNGFXXXXXXXXXXXXXXXXXXSCKRSSICHEISASIVNGGIQT 875 ++DAERLLFFC+RQ K S KRS C S SI + G Sbjct: 420 NKDDAERLLFFCTRQEKENCPNDHVFSGIPSWLKPPSTSRKRSEPCSS-SKSINDSG--- 475 Query: 874 LIKKEDNEDKERRLSNGVGLPSMTQRRRIKIAALRPIPHVRHQKMLPFSKILDADLHDGS 695 R +G + R++ +A++RPIPH K+LPFS + + +DG Sbjct: 476 ------------RGVEAIG----SHRQKFNLASMRPIPHSNRHKILPFSGLSEGTRYDGD 519 Query: 694 QVKANLPLPPAKHS-TGTTPVSHRKSTSSAYQAKQVLSLNPIPLKKHGCGRSPLHVCSEE 518 K+NLPL P KH+ +G T V++RKS S+++QA Q++SLNP+P+KKHGC R+P+ CSEE Sbjct: 520 HGKSNLPLAPIKHNVSGPTSVTNRKSVSNSFQAHQIISLNPLPMKKHGCDRAPIRACSEE 579 Query: 517 EFLKDVMQFLILRGHNRLIPQGGLAEFPDAILNAKRLDLFNLYREVVTRGGFHVGNGINW 338 EFL+DVMQFLILRGHNRLIP GGLAEFPDAILNAKRLDLFNLYREVV+RGGFHVGNGINW Sbjct: 580 EFLRDVMQFLILRGHNRLIPPGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 639 Query: 337 KGQVFSKMRNHTATNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAPGDWVN 158 KGQVFSKMRNHT TNRMTGVGNTLKRHYETYLLEYEL+HDDVDGECCL+CHSSA GDWVN Sbjct: 640 KGQVFSKMRNHTMTNRMTGVGNTLKRHYETYLLEYELSHDDVDGECCLMCHSSAAGDWVN 699 Query: 157 CGLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKKINRSGNG 2 CG+CGEWAHFGCDRR GLGAFKDYAKTDGLEY+CP+CS + KK ++ NG Sbjct: 700 CGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPRCSALKFSKKSQKTANG 751 >ref|XP_006587067.1| PREDICTED: AT-rich interactive domain-containing protein 4-like isoform X2 [Glycine max] Length = 795 Score = 907 bits (2344), Expect = 0.0 Identities = 471/804 (58%), Positives = 570/804 (70%), Gaps = 8/804 (0%) Frame = -1 Query: 2389 YVFLKWRSGFLASSLLHYKFPFSVRVRHSNLDMFHTQGALKHTCNLLAVVCSEPEESKPI 2210 YVF +R + Y F +V + + H +G +H C+LLAV+ + + K Sbjct: 22 YVFKSYR--------IIYWFTLTVPIEFGTVPR-HKKGVSRH-CSLLAVLSGKSRDIKQK 71 Query: 2209 R---DVSEERRRFPFPEIVSSGRLEVQTLKNPTPDEFRKVLDSWQPNLVYLQGERLENE- 2042 + + SE++ +PFPE+ SSGRLEV+ L PT DE L+ QP+ VYLQG++LE+ Sbjct: 72 QKQGNASEDQFPYPFPELSSSGRLEVKVLIEPTADELGLALEQLQPDFVYLQGQQLEDRG 131 Query: 2041 EVGSISWGGMGLSSPEAVSGLFSSTMPTTVYLEVPNGEELAKSLHSKGVPYVIYWKNSVS 1862 E+G + W LS PEA+ GLFSS +P TVYLE P GE+LA++L SKGVPY IYWKN S Sbjct: 132 EIGPLGWEDFDLSVPEALCGLFSSKLPNTVYLETPKGEKLAEALRSKGVPYTIYWKNDFS 191 Query: 1861 CYPASHFCSALFSVIQSSSCHTWDAFQLADASFRLHCLRNS---SDGEKVNDEFGPNLFX 1691 Y ASHF +LFSV QS+S HTWDAFQLA ASFRL+C+ N+ S+ K + GP + Sbjct: 192 KYAASHFRHSLFSVAQSTSSHTWDAFQLALASFRLYCIHNNVLPSNCHKGAGKLGPQILG 251 Query: 1690 XXXXXXXXXXXXXXXXXXXXXXXXSLPAIKIYDDDVNMRFLVCGMAXXXXXXXXXXLEDG 1511 + A+KIYDDDVNMRFL+CG+ LEDG Sbjct: 252 VPPNIDVSPCVADMKEEEEDSPET-ISAVKIYDDDVNMRFLICGVPCTLDACLLGSLEDG 310 Query: 1510 LNALLSIEMRGSKLHNRVSALPPPLQAGTFSRGVVTMRCDLSTSSSAHISLLVSGSAQTC 1331 LNALL E+RG KLHNR SA PPPLQAGTFSRGVVTMRCD+ST SSAHISLLVSGSA TC Sbjct: 311 LNALLFAEIRGCKLHNRTSATPPPLQAGTFSRGVVTMRCDISTCSSAHISLLVSGSADTC 370 Query: 1330 FDDQLLENHIKSEVIDKSELIHALSNSDENKPPTSEPRRSVSVACGAAVFEVCMKVPSWA 1151 F+DQLLENHIK E+I+KS+L+ A N +++K P+SEPRRS SVACG++VFEVCM+VP+WA Sbjct: 371 FNDQLLENHIKKELIEKSQLVQAFPNHEQSKAPSSEPRRSASVACGSSVFEVCMQVPAWA 430 Query: 1150 SQVLRQLAPDVSYRNFVALGIASIQGLAVASFEREDAERLLFFCSRQGKNGFXXXXXXXX 971 SQVLRQLAP++SYR+ V LGIASIQGL VASF ++DAERLLFFC+RQ K Sbjct: 431 SQVLRQLAPNLSYRSLVMLGIASIQGLPVASFNKDDAERLLFFCTRQEKENCPNDHVFSG 490 Query: 970 XXXXXXXXXXSCKRSSICHEISASIVNGGIQTLIKKEDNEDKERRLSNGVGLPSMTQRRR 791 S KRS C S SI + G R +G + R++ Sbjct: 491 IPSWLKPPSTSRKRSEPCSS-SKSINDSG---------------RGVEAIG----SHRQK 530 Query: 790 IKIAALRPIPHVRHQKMLPFSKILDADLHDGSQVKANLPLPPAKHS-TGTTPVSHRKSTS 614 +A++RPIPH K+LPFS + + +DG K+NLPL P KH+ +G T V++RKS S Sbjct: 531 FNLASMRPIPHSNRHKILPFSGLSEGTRYDGDHGKSNLPLAPIKHNVSGPTSVTNRKSVS 590 Query: 613 SAYQAKQVLSLNPIPLKKHGCGRSPLHVCSEEEFLKDVMQFLILRGHNRLIPQGGLAEFP 434 +++QA Q++SLNP+P+KKHGC R+P+ CSEEEFL+DVMQFLILRGHNRLIP GGLAEFP Sbjct: 591 NSFQAHQIISLNPLPMKKHGCDRAPIRACSEEEFLRDVMQFLILRGHNRLIPPGGLAEFP 650 Query: 433 DAILNAKRLDLFNLYREVVTRGGFHVGNGINWKGQVFSKMRNHTATNRMTGVGNTLKRHY 254 DAILNAKRLDLFNLYREVV+RGGFHVGNGINWKGQVFSKMRNHT TNRMTGVGNTLKRHY Sbjct: 651 DAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTMTNRMTGVGNTLKRHY 710 Query: 253 ETYLLEYELAHDDVDGECCLLCHSSAPGDWVNCGLCGEWAHFGCDRRPGLGAFKDYAKTD 74 ETYLLEYEL+HDDVDGECCL+CHSSA GDWVNCG+CGEWAHFGCDRR GLGAFKDYAKTD Sbjct: 711 ETYLLEYELSHDDVDGECCLMCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTD 770 Query: 73 GLEYICPQCSVSNYKKKINRSGNG 2 GLEY+CP+CS + KK ++ NG Sbjct: 771 GLEYVCPRCSALKFSKKSQKTANG 794