BLASTX nr result
ID: Catharanthus23_contig00004111
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00004111 (3134 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006362097.1| PREDICTED: AT-rich interactive domain-contai... 1175 0.0 ref|XP_004252398.1| PREDICTED: AT-rich interactive domain-contai... 1174 0.0 emb|CBI35803.3| unnamed protein product [Vitis vinifera] 1145 0.0 gb|EMJ18234.1| hypothetical protein PRUPE_ppa001668mg [Prunus pe... 1120 0.0 ref|XP_006452906.1| hypothetical protein CICLE_v10007563mg [Citr... 1108 0.0 ref|XP_006474564.1| PREDICTED: AT-rich interactive domain-contai... 1107 0.0 gb|EXB64667.1| AT-rich interactive domain-containing protein 4 [... 1096 0.0 ref|XP_004303747.1| PREDICTED: AT-rich interactive domain-contai... 1086 0.0 ref|XP_006381551.1| hypothetical protein POPTR_0006s13780g [Popu... 1070 0.0 gb|EOY30139.1| ARID/BRIGHT DNA-binding domain-containing protein... 1062 0.0 ref|XP_002277324.1| PREDICTED: AT-rich interactive domain-contai... 1051 0.0 ref|XP_002324130.2| arid/bright DNA-binding domain-containing fa... 1050 0.0 ref|XP_002516200.1| DNA binding protein, putative [Ricinus commu... 1048 0.0 ref|XP_003547888.1| PREDICTED: AT-rich interactive domain-contai... 1038 0.0 gb|ESW07366.1| hypothetical protein PHAVU_010G123900g [Phaseolus... 1034 0.0 gb|EOY30141.1| ARID/BRIGHT DNA-binding domain-containing protein... 1024 0.0 ref|XP_004510562.1| PREDICTED: AT-rich interactive domain-contai... 1004 0.0 ref|XP_003533805.1| PREDICTED: AT-rich interactive domain-contai... 998 0.0 ref|XP_006587068.1| PREDICTED: AT-rich interactive domain-contai... 995 0.0 ref|XP_006587067.1| PREDICTED: AT-rich interactive domain-contai... 995 0.0 >ref|XP_006362097.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Solanum tuberosum] Length = 770 Score = 1175 bits (3040), Expect = 0.0 Identities = 575/770 (74%), Positives = 645/770 (83%), Gaps = 3/770 (0%) Frame = -2 Query: 2809 MSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSAD 2630 M H Q S+ +CSLLAV CG +E +Q K+V D K RY FP++VSSGRLEVQ LKNPS D Sbjct: 1 MFHCQGTSRQSCSLLAVLCGSTSEYDQKKDVHDGKPRYCFPEIVSSGRLEVQVLKNPSTD 60 Query: 2629 EFGKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPN 2450 EF KVLDSWQPNI+YLQGE L + GSLVWGG++LS+ EAI GLFSS LPT VYLELPN Sbjct: 61 EFHKVLDSWQPNIVYLQGEHLSNDEVGSLVWGGLDLSSAEAISGLFSSALPTAVYLELPN 120 Query: 2449 GEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLY 2270 GE+LAEALH+KGIPYV+YWK FS +AA HFRHA V QSS+CH WDAFQLA ASFRLY Sbjct: 121 GEKLAEALHAKGIPYVMYWKSAFSCYAASHFRHAFLCVAQSSTCHVWDAFQLAQASFRLY 180 Query: 2269 CVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYDEDI 2090 CV+NN VLPE SQ+ + +GP L+G+PP I+ +S ALPAIKIYD+D+ Sbjct: 181 CVQNNFVLPEMSQRDSDNMGPHLLGDPPNIDVPPPEAGPDDDEESNSDALPAIKIYDDDV 240 Query: 2089 NMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVT 1910 MRFLVCG SLD +L + DGLNALL+IEMRGSKLHNR SALPPPLQAGTFSRGVVT Sbjct: 241 TMRFLVCGLPCSLDECLLGSIADGLNALLNIEMRGSKLHNRVSALPPPLQAGTFSRGVVT 300 Query: 1909 MRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSE 1730 MRCD+ST SSAHISLLVSGSAQTCFDD +LENHIKSE+I+N+ LVH LP+ EEN+P +S Sbjct: 301 MRCDLSTSSSAHISLLVSGSAQTCFDDLLLENHIKSEIIENSTLVHVLPSDEENRPPISA 360 Query: 1729 PRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDD 1550 PRRS+S+ACG+ V+EVC+K+P WASQVLRQLAPDVSYR+LVALGIASIQGLAVASFEKDD Sbjct: 361 PRRSMSVACGSEVFEVCMKVPMWASQVLRQLAPDVSYRSLVALGIASIQGLAVASFEKDD 420 Query: 1549 AERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVKREEL- 1373 A+RLLFF +QGKD N IG P WLR PAPSRKRS+ Q +S QNG+ + Sbjct: 421 AQRLLFFYTKQGKDGFFGNFKIGDPPAWLRPPAPSRKRSDFYQGASYICQNGSTPGNHVA 480 Query: 1372 --EDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPN 1199 E+KES L NG P+V ARQK KVAA+RPIPHVRHQKMLPFS +SE+D +G QVK N Sbjct: 481 VKEEKESRLGNGVATPLVTARQKLKVAAMRPIPHVRHQKMLPFSRISELDSLDGNQVKTN 540 Query: 1198 LPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLK 1019 LP + +K ++VGVTP +HRKS S+S+QAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLK Sbjct: 541 LPIIPSTKGSNVGVTPVTHRKSASSSHQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLK 600 Query: 1018 DVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 839 DVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV Sbjct: 601 DVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 660 Query: 838 FSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVC 659 FSKMRNHTVTN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+C Sbjct: 661 FSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCNSSAAGDWVNCGIC 720 Query: 658 GEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 509 GEWAHFGCDRR GLGAFKDYAKTDGLEYICPQCSV+ FKKK+ +T+NGYS Sbjct: 721 GEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVTNFKKKVLRTANGYS 770 >ref|XP_004252398.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Solanum lycopersicum] Length = 771 Score = 1174 bits (3037), Expect = 0.0 Identities = 577/771 (74%), Positives = 647/771 (83%), Gaps = 4/771 (0%) Frame = -2 Query: 2809 MSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSAD 2630 M H Q AS+ +CSLLAV CGR +E +Q K+V D K RY FP++VSSGRLEVQ LKNPS D Sbjct: 1 MFHCQGASRQSCSLLAVLCGRTSEYDQKKDVHDGKPRYCFPEIVSSGRLEVQVLKNPSTD 60 Query: 2629 EFGKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPN 2450 EF KVLDSWQPNI+YLQGE L + GSLVWGG++LS+ EAI GLFSSVLPT VYLELPN Sbjct: 61 EFHKVLDSWQPNIVYLQGEHLSNDEVGSLVWGGLDLSSAEAISGLFSSVLPTAVYLELPN 120 Query: 2449 GEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLY 2270 GE+LAEALH+KGIPYV+YWK FS +AA HFRHA V QSS+CH WDAFQLAHASFRLY Sbjct: 121 GEKLAEALHAKGIPYVMYWKSAFSCYAASHFRHAFLCVAQSSTCHVWDAFQLAHASFRLY 180 Query: 2269 CVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYDEDI 2090 CVRNN L E SQ+ + +GP L+G+PP I+ +S ALPAIKIYD+D+ Sbjct: 181 CVRNNFALSEMSQRDSDNVGPHLLGDPPNIDVPLPEAGPEDDEESNSDALPAIKIYDDDV 240 Query: 2089 NMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVT 1910 MRFLVCG SLD +L + DGLNALL+IEMRGSKLHNR SALPPPLQAGTFSRGVVT Sbjct: 241 TMRFLVCGLPCSLDECLLGSIADGLNALLNIEMRGSKLHNRVSALPPPLQAGTFSRGVVT 300 Query: 1909 MRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSE 1730 MRCD+ST SSAHISLLVSGSAQTCFDD +LENHIKSE+I+N+ LVH LP+ EEN+P +S Sbjct: 301 MRCDLSTSSSAHISLLVSGSAQTCFDDLLLENHIKSEIIENSTLVHVLPSDEENRPPISA 360 Query: 1729 PRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDD 1550 PRRS+S+ACG+ V+EVC+K+P WASQVLRQLAPDVSYR+LVALGIASIQGLAVASFEKDD Sbjct: 361 PRRSMSVACGSEVFEVCMKVPMWASQVLRQLAPDVSYRSLVALGIASIQGLAVASFEKDD 420 Query: 1549 AERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVKREEL- 1373 A+RLLFF +QGKD N +G+ P WLR PAPSRKRS+ Q +S QNG + Sbjct: 421 AQRLLFFCTKQGKDGFFGNFKMGNPPAWLRPPAPSRKRSDFYQGASYICQNGLTPGNHVA 480 Query: 1372 --EDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPN 1199 E+KES L NG P+V ARQK KVAA+RPIPHVRHQKMLPFS +SE+D +G QVK N Sbjct: 481 VKEEKESRLGNGVATPLVTARQKLKVAAMRPIPHVRHQKMLPFSRISELDSLDGNQVKTN 540 Query: 1198 LPPVAPS-KHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFL 1022 LP + S K ++VGVTPA+HRKS S+S+QAKQIISLNPLPLKKHGCGRSPIHVCSEEEFL Sbjct: 541 LPIIPSSTKGSNVGVTPATHRKSASSSHQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFL 600 Query: 1021 KDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQ 842 KDVMQFLILRGHTRLIPQ G+AEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQ Sbjct: 601 KDVMQFLILRGHTRLIPQSGIAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQ 660 Query: 841 VFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGV 662 VFSKMRNHTVTN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+ Sbjct: 661 VFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCNSSAAGDWVNCGI 720 Query: 661 CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 509 CGEWAHFGCDRR GLGAFKDYAKTDGLEYICPQCSV+ FKKK+ +T+NGYS Sbjct: 721 CGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVTNFKKKVLRTANGYS 771 >emb|CBI35803.3| unnamed protein product [Vitis vinifera] Length = 746 Score = 1145 bits (2961), Expect = 0.0 Identities = 569/767 (74%), Positives = 626/767 (81%) Frame = -2 Query: 2809 MSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSAD 2630 M H Q S +TC LLAV CG+ +E +Q +++ RYPFPD VSSGRLEVQTL +PS D Sbjct: 1 MLHTQGISNHTCGLLAVTCGKTSECKQEHETSNDRPRYPFPDFVSSGRLEVQTLTSPSPD 60 Query: 2629 EFGKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPN 2450 EF +V +S QPN +Y QGEQL + GSLVWGGV LS+ E ICGLF S LPTTVYLE+PN Sbjct: 61 EFRRVFESVQPNFVYFQGEQLQNDEVGSLVWGGVELSSAEDICGLFGSKLPTTVYLEIPN 120 Query: 2449 GEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLY 2270 GE+LAEALHSKGIPYVIYWK+ FS +AACHFR+ALFSVVQSSS HTWDAFQLA+ASFRLY Sbjct: 121 GEKLAEALHSKGIPYVIYWKNAFSCYAACHFRNALFSVVQSSSTHTWDAFQLAYASFRLY 180 Query: 2269 CVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYDEDI 2090 CVRNN VLP NS KV+GKLGP L+G+P I+ S G LPAIKIYD+D+ Sbjct: 181 CVRNNHVLPANSHKVSGKLGPRLLGDPATIDVPPPEVDAGEDEEGSLGTLPAIKIYDDDV 240 Query: 2089 NMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVT 1910 +RFLVCG+ LDS + E LEDGLNALLSIE+RGSKLHNR SA PPPLQAGTFSRGVVT Sbjct: 241 GIRFLVCGEPCMLDSCLFESLEDGLNALLSIEIRGSKLHNRVSAPPPPLQAGTFSRGVVT 300 Query: 1909 MRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSE 1730 MRCD+ST SSAHISLLVSGSAQTCFDDQ+LEN+IK EV + + LVHALP SE NKP LSE Sbjct: 301 MRCDLSTCSSAHISLLVSGSAQTCFDDQLLENNIKKEVTEQSQLVHALPYSEGNKPPLSE 360 Query: 1729 PRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDD 1550 PRRS SIACGA+V+EVC K+P WASQVLRQLAPDVSYR+LVALGIASIQGLAVASFEKDD Sbjct: 361 PRRSASIACGAAVFEVCAKVPAWASQVLRQLAPDVSYRSLVALGIASIQGLAVASFEKDD 420 Query: 1549 AERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVKREELE 1370 A RLLFF RQGK H NN LP+WL+ P PSRKR E Q + Sbjct: 421 ANRLLFFCTRQGKYIHPNNFTPSRLPSWLKPPPPSRKRVEPSQDT--------------- 465 Query: 1369 DKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPNLPP 1190 NG +P++PA Q+ KVAA+RPIPH+RH KMLPFSG+SE DGH+G QVK NL Sbjct: 466 ------MNGVTMPLLPAGQRLKVAAMRPIPHIRHHKMLPFSGISEADGHDGGQVKANLSV 519 Query: 1189 VAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLKDVM 1010 P+KH+ VG T A HRKS S+SYQAKQIISLNPLPLKKHGCGRSPI VCSEEEFLKDVM Sbjct: 520 PPPTKHSIVGSTSAMHRKSFSSSYQAKQIISLNPLPLKKHGCGRSPIRVCSEEEFLKDVM 579 Query: 1009 QFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSK 830 QFL LRGHTRLIPQGGLAEFPDAILNAKRLDL+NLYREVVSRGGFHVGNGINWKGQVFSK Sbjct: 580 QFLNLRGHTRLIPQGGLAEFPDAILNAKRLDLYNLYREVVSRGGFHVGNGINWKGQVFSK 639 Query: 829 MRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVCGEW 650 MRNHTVTN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+CGEW Sbjct: 640 MRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEW 699 Query: 649 AHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 509 AHFGCDRRQGLGAFKDYAKTDGLEYICPQCSV+ FKKK K NG+S Sbjct: 700 AHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVTNFKKKANKAPNGFS 746 >gb|EMJ18234.1| hypothetical protein PRUPE_ppa001668mg [Prunus persica] Length = 783 Score = 1120 bits (2896), Expect = 0.0 Identities = 552/774 (71%), Positives = 640/774 (82%), Gaps = 7/774 (0%) Frame = -2 Query: 2809 MSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSAD 2630 M+H Q ASK TCSLL V CG+ +E + ++ LDEK +YPFP+LVS GRLEVQTL PS + Sbjct: 1 MNHSQGASKQTCSLLVVTCGKISEEKPNEDTLDEKLKYPFPELVSLGRLEVQTLTKPSKE 60 Query: 2629 EFGKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPN 2450 EF K+L+S++PN++YLQGEQL + GS VW V+LST EAI +FS+ LPTTVYLE+PN Sbjct: 61 EFCKMLESYKPNLVYLQGEQLENNEIGSPVWEDVDLSTAEAISEIFSATLPTTVYLEVPN 120 Query: 2449 GEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLY 2270 GE LA ALHSKGIPYVIYWKH+FSS+AACHFRHAL SVVQSSS HTWDAFQLA+ASFRLY Sbjct: 121 GENLAAALHSKGIPYVIYWKHEFSSYAACHFRHALLSVVQSSSTHTWDAFQLAYASFRLY 180 Query: 2269 CVRNNLVLPENSQKVNG-KLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYDED 2093 CV N+ +P N K + +LGP L+G+ KIN S G LPAIKI+D+D Sbjct: 181 CVENSHAIPANRHKSSSAELGPCLLGDRLKINVDPPEADVEEDEEGSLGTLPAIKIHDDD 240 Query: 2092 INMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVV 1913 + +RFLVCG+ +LD+S+LEPLEDGLNALL+IEMRGSKLH + SA PPPLQAGTFSRGVV Sbjct: 241 VILRFLVCGEPSTLDASLLEPLEDGLNALLNIEMRGSKLHGKFSAPPPPLQAGTFSRGVV 300 Query: 1912 TMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLS 1733 TMRCD+ST SSAHISLLVSGSAQTCFDDQ+LENHIK+EVI+ LV ALP +E NK L+ Sbjct: 301 TMRCDVSTCSSAHISLLVSGSAQTCFDDQLLENHIKNEVIEEIQLVRALPNNEGNKVPLA 360 Query: 1732 EPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKD 1553 EPR+S SIACGA+V+EVC+K+P WASQVLRQLAPDVSY +LVALGIASIQGL VASFEK+ Sbjct: 361 EPRKSASIACGATVFEVCMKVPAWASQVLRQLAPDVSYHSLVALGIASIQGLPVASFEKE 420 Query: 1552 DAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSI---YSQ---NGA 1391 DAERLLFF + GKD N+ +GS PTWLR P PSRKRS+ CQ +S YSQ + A Sbjct: 421 DAERLLFFCSSLGKDNKSNDFILGSPPTWLRPPPPSRKRSQPCQETSRGSNYSQRLPSLA 480 Query: 1390 VKREELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQ 1211 + + ++KE+ NG P++P RQ+ K+AA+RPIPHVR KM PFSG+SE+DGH+G Q Sbjct: 481 ASKIDEDNKEAGAMNGVSTPLLPPRQRLKIAAMRPIPHVRRPKMTPFSGMSELDGHDGGQ 540 Query: 1210 VKPNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEE 1031 K NLPP P+K N VG+TP + RKS S+S +KQIISLNPLPLKKHGCGRSPIH C EE Sbjct: 541 FKANLPPAPPTKLNIVGLTPTTQRKSYSSSSHSKQIISLNPLPLKKHGCGRSPIHSCLEE 600 Query: 1030 EFLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 851 EFLKDVMQFLILRGH+RLIPQGGLAEFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINW Sbjct: 601 EFLKDVMQFLILRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINW 660 Query: 850 KGQVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVN 671 KGQ+FSKMRN+T+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVN Sbjct: 661 KGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVN 720 Query: 670 CGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 509 CG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CS+S FKKK QK +NG+S Sbjct: 721 CGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSISNFKKKPQKIANGFS 774 >ref|XP_006452906.1| hypothetical protein CICLE_v10007563mg [Citrus clementina] gi|557556132|gb|ESR66146.1| hypothetical protein CICLE_v10007563mg [Citrus clementina] Length = 745 Score = 1108 bits (2866), Expect = 0.0 Identities = 551/769 (71%), Positives = 627/769 (81%), Gaps = 2/769 (0%) Frame = -2 Query: 2812 LMSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSA 2633 +M H Q +S+ CSLLAV + +++Q + D+K +YPFP++ SSGRLEV L +PS Sbjct: 1 MMFHAQSSSRNHCSLLAVLSRKFVDDKQKQAATDDKPKYPFPEIASSGRLEVHLLSSPST 60 Query: 2632 DEFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEAICGLFSSVLPTTVYLEL 2456 DEF ++L+S +PNI+YLQGE++ D GSLVWG V+LSTPEA+CGLF S LPTTVYLE+ Sbjct: 61 DEFRRLLESSEPNIVYLQGEKINDSEEIGSLVWGDVDLSTPEALCGLFGSTLPTTVYLEI 120 Query: 2455 PNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFR 2276 PNGE AEALHS+G+PYVIYWKH FS +AACHF AL SVVQSS HTWDAFQLAHASFR Sbjct: 121 PNGENFAEALHSRGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFR 180 Query: 2275 LYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYDE 2096 LYCVRNN+V+ NSQK + KLGP L+G+PPKI+ S LPAIKIYD+ Sbjct: 181 LYCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEEN-SPENLPAIKIYDD 239 Query: 2095 DINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGV 1916 D+ MRFLVCG +LD+S+L PLEDGLNALL+IE+RGSKLHNR SA PPPLQAG FSRGV Sbjct: 240 DVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGV 299 Query: 1915 VTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRL 1736 VTMRCD+ST SSAHISLLVSGSAQTCF+DQ+LENHIK+E+I+N+ LVHALP S +N+ Sbjct: 300 VTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPP 359 Query: 1735 SEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEK 1556 SEPR+S SIACGASV+EV +K+ WASQVLRQLAPDVSYR+LV LGIASIQGL+VASFEK Sbjct: 360 SEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEK 419 Query: 1555 DDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISS-IYSQNGAVKRE 1379 DDAERLLFF RQGK +H N + P+WL SPAPSRKRSE C+ S + S+N Sbjct: 420 DDAERLLFFCTRQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESEN------ 473 Query: 1378 ELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPN 1199 V R K AA+RPIPH RH KMLPFSG SEI+ ++G QVK N Sbjct: 474 ----------------VCNVRPKLNAAAMRPIPHTRHHKMLPFSGFSEIERYDGDQVKAN 517 Query: 1198 LPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLK 1019 LP VAP KH+S G TP +HRKS S+SYQA+QIISLNPLPLKKHGCGR+PI VCSEEEFL+ Sbjct: 518 LP-VAPLKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLR 576 Query: 1018 DVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 839 DVMQFLILRGHTRL+PQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV Sbjct: 577 DVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 636 Query: 838 FSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVC 659 FSKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+C Sbjct: 637 FSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGIC 696 Query: 658 GEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 512 GEWAHFGCDRRQGLGAFKDYAKTDGLEY+CPQCSV+ FKKK QKTSNGY Sbjct: 697 GEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNGY 745 >ref|XP_006474564.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Citrus sinensis] Length = 745 Score = 1107 bits (2863), Expect = 0.0 Identities = 551/769 (71%), Positives = 627/769 (81%), Gaps = 2/769 (0%) Frame = -2 Query: 2812 LMSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSA 2633 +M H Q +S+ CSLLAV + +++Q + D+K +YPFP++ SSGRLEV L +PS Sbjct: 1 MMFHAQSSSRNHCSLLAVLSRKFVDDKQKQAATDDKPKYPFPEIASSGRLEVHLLSSPST 60 Query: 2632 DEFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEAICGLFSSVLPTTVYLEL 2456 DEF ++L+S +PNI+YLQGE++ D GSLVWG V+LSTPEA+CGLF S LPTTVYLE+ Sbjct: 61 DEFRRLLESSEPNIVYLQGEKINDSEEIGSLVWGDVDLSTPEALCGLFGSTLPTTVYLEI 120 Query: 2455 PNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFR 2276 PNGE AEALHS+G+PYVIYWKH FS +AACHF AL SVVQSS HTWDAFQLAHASFR Sbjct: 121 PNGENFAEALHSRGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFR 180 Query: 2275 LYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYDE 2096 LYCVRNN+V+ NSQK + KLGP L+G+PPKI+ S LPAIKIYD+ Sbjct: 181 LYCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEEN-SPENLPAIKIYDD 239 Query: 2095 DINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGV 1916 D+ MRFLVCG +LD+S+L PLEDGLNALL+IE+RGSKLHNR SA PPPLQAG FSRGV Sbjct: 240 DVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGV 299 Query: 1915 VTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRL 1736 VTMRCD+ST SSAHISLLVSGSAQTCF+DQ+LENHIK+E+I+N+ LVHALP S +N+ Sbjct: 300 VTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPP 359 Query: 1735 SEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEK 1556 SEPR+S SIACGASV+EV +K+ WASQVLRQLAPDVSYR+LV LGIASIQGL+VASFEK Sbjct: 360 SEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEK 419 Query: 1555 DDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISS-IYSQNGAVKRE 1379 DDAERLLFF RQGK +H N + P+WL SPAPSRKRSE C+ S + S+N Sbjct: 420 DDAERLLFFCTRQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESEN------ 473 Query: 1378 ELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPN 1199 V R K AA+RPIPH RH KMLPFSG SEI+ ++G QVK N Sbjct: 474 ----------------VCNVRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKAN 517 Query: 1198 LPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLK 1019 LP VAP KH+S G TP +HRKS S+SYQA+QIISLNPLPLKKHGCGR+PI VCSEEEFL+ Sbjct: 518 LP-VAPLKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLR 576 Query: 1018 DVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 839 DVMQFLILRGHTRL+PQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV Sbjct: 577 DVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 636 Query: 838 FSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVC 659 FSKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+C Sbjct: 637 FSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGIC 696 Query: 658 GEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 512 GEWAHFGCDRRQGLGAFKDYAKTDGLEY+CPQCSV+ FKKK QKTSNGY Sbjct: 697 GEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNGY 745 >gb|EXB64667.1| AT-rich interactive domain-containing protein 4 [Morus notabilis] Length = 779 Score = 1096 bits (2835), Expect = 0.0 Identities = 543/771 (70%), Positives = 633/771 (82%), Gaps = 4/771 (0%) Frame = -2 Query: 2809 MSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSAD 2630 M H Q +SK TCSLLAV CG +E+++ K+V + +S YPFP+L+SSGRLEVQTL +PS + Sbjct: 1 MFHSQGSSKQTCSLLAVTCGNVSESKRKKDVPENRSLYPFPELISSGRLEVQTLTSPSKE 60 Query: 2629 EFGKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPN 2450 EF K+L+S++PN++YLQGEQL + G LVWG V+LSTPE++ LF + LPTTVYLE+P+ Sbjct: 61 EFSKLLESYKPNLVYLQGEQLANDEVGPLVWGDVDLSTPESVSELFGTTLPTTVYLEIPD 120 Query: 2449 GEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLY 2270 EELAE LHSKG+PYVIYWK +FS AACHFR+AL SVV+SSS H WDAFQLA+ASFRLY Sbjct: 121 CEELAEELHSKGVPYVIYWKDRFSRHAACHFRNALLSVVKSSSTHAWDAFQLAYASFRLY 180 Query: 2269 CVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYDEDI 2090 CVRNN VLP +++ + GP L+G+ KIN S LPAIKI+D+D+ Sbjct: 181 CVRNNHVLPSKGHEISDEQGPCLLGDRLKINVDPPAADVEDDEDGSLDTLPAIKIHDDDL 240 Query: 2089 NMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVT 1910 ++RFLVCG +LD S+LEPLEDGLNALL+IE+RG +LH + SA PPPLQAGTFSRGVVT Sbjct: 241 SLRFLVCGVPSTLDESVLEPLEDGLNALLNIEIRGGRLHGKFSAPPPPLQAGTFSRGVVT 300 Query: 1909 MRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPT-SEENKPRLS 1733 MRCD+ST S AHIS+L+SGSAQTCFDDQ+LENHIK+E+I+N+ LV ALPT SE NK LS Sbjct: 301 MRCDLSTCSCAHISILLSGSAQTCFDDQLLENHIKNEIIENSQLVRALPTASEGNKLPLS 360 Query: 1732 EPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKD 1553 EPR+S SIACGA+V+EVC+K+P WASQVLRQLAPDVSY +LVALGIASIQG+ VASFEK+ Sbjct: 361 EPRKSASIACGATVFEVCMKVPAWASQVLRQLAPDVSYHSLVALGIASIQGIPVASFEKE 420 Query: 1552 DAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQ---NGAVKR 1382 DAERLLFF + QGK E N+L + P WLR PAPSRKRS+ S N V + Sbjct: 421 DAERLLFFCSSQGK-EISNDLVFSNPPPWLRPPAPSRKRSQETSPGSHDGHRVPNQVVSK 479 Query: 1381 EELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKP 1202 E EDKE NG LP++PARQ+ KVAA+RPIPHVR KM PFSG+SE DGH+G QVK Sbjct: 480 SEEEDKERGPSNGVSLPLLPARQRLKVAAMRPIPHVRRPKMTPFSGISEADGHDGGQVKA 539 Query: 1201 NLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFL 1022 +P P+K + VG+TP++ RKS S+S QAKQIISLNPLPLKKHGCGRS IH CSEEEFL Sbjct: 540 IVPVAPPTKLSIVGLTPSAQRKSFSSSSQAKQIISLNPLPLKKHGCGRSSIHTCSEEEFL 599 Query: 1021 KDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQ 842 KDVMQFLILRGHTRLIPQ GLAEFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ Sbjct: 600 KDVMQFLILRGHTRLIPQSGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQ 659 Query: 841 VFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGV 662 +FSKMRN+T+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+ Sbjct: 660 IFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGI 719 Query: 661 CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 509 CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CSVS FKKK QK SNG+S Sbjct: 720 CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVSNFKKKSQKVSNGFS 770 >ref|XP_004303747.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Fragaria vesca subsp. vesca] Length = 779 Score = 1086 bits (2809), Expect = 0.0 Identities = 534/763 (69%), Positives = 630/763 (82%), Gaps = 7/763 (0%) Frame = -2 Query: 2779 TCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSADEFGKVLDSWQ 2600 TCS+L V CG +E+++ K ++K RYPFP+LVSSGRLEVQTL NPS +EF K+L+S++ Sbjct: 7 TCSVLVVTCGEISEDKRGKETPEDKLRYPFPELVSSGRLEVQTLTNPSEEEFCKLLESYK 66 Query: 2599 PNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPNGEELAEALHS 2420 PN++YLQGEQL + G LVW LST E++ +F + LPTTVYLE+PNGEELA AL S Sbjct: 67 PNLVYLQGEQLENDEVGPLVWRDAYLSTAESMSDIFDATLPTTVYLEVPNGEELAVALQS 126 Query: 2419 KGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLYCVRNNLVLPE 2240 KGIPYVIYWK S++AACHFRHAL SVVQSSS HTWDAFQLAHASFRLYCV+N+ V+ Sbjct: 127 KGIPYVIYWKDAISTYAACHFRHALLSVVQSSSTHTWDAFQLAHASFRLYCVQNDHVVRV 186 Query: 2239 NSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYDEDINMRFLVCGDT 2060 N K + +LGP ++GE KI+ ++G+LPAIKI+D+D+++RFLVCG Sbjct: 187 NLDKPSAELGPCILGEHLKISVDPPEADMEEDEEGATGSLPAIKIHDDDVSLRFLVCGQP 246 Query: 2059 RSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVTMRCDISTVSS 1880 +LD+ ILEPLEDGLNALL+IEMRGSKLH + SA PPPLQAGTFSRGVVTMRCDIST SS Sbjct: 247 STLDAGILEPLEDGLNALLNIEMRGSKLHGKFSAPPPPLQAGTFSRGVVTMRCDISTCSS 306 Query: 1879 AHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSEPRRSVSIACG 1700 AHISLLVSGSAQTCFDDQ+LENHIK EVI+ LVHA+P ++ NK L EPR+S +IACG Sbjct: 307 AHISLLVSGSAQTCFDDQLLENHIKHEVIEINQLVHAVPNNDRNKLPLVEPRKSAAIACG 366 Query: 1699 ASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDDAERLLFFSAR 1520 A+V+EV +K+P WASQVLRQLAPDVSYR+LV+LGIASIQGL VASFEKDDA+RLLFF + Sbjct: 367 ATVFEVSMKVPVWASQVLRQLAPDVSYRSLVSLGIASIQGLPVASFEKDDADRLLFFCSS 426 Query: 1519 QGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQ--ISSIYSQNGA--VKREELEDKESAL 1352 + KD LN+L + + P WLR PAPS+KRS +CQ I ++ G + ++E+ E AL Sbjct: 427 RTKDSQLNDLFLSTPPAWLRPPAPSKKRSRLCQEAIPGFRNRQGLPNLAASKVEENEKAL 486 Query: 1351 R--NGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQ-VKPNLPPVAP 1181 NGF P++PARQ+ K AA+RPIPHVR KM PFSG+SE++GH+G+Q VK +LPPV P Sbjct: 487 GAVNGFSTPLLPARQRLKTAAMRPIPHVRRPKMTPFSGISEVNGHDGSQVVKAHLPPVPP 546 Query: 1180 SKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLKDVMQFL 1001 +K N VG+TP + RKS S+S QAKQIISLNPLPLKKHGCGR PIH C EEEFLKDVMQFL Sbjct: 547 TKLNIVGLTPTTQRKSYSSSSQAKQIISLNPLPLKKHGCGRGPIHSCLEEEFLKDVMQFL 606 Query: 1000 ILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRN 821 ILRGH+RLIPQGGL EFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKMRN Sbjct: 607 ILRGHSRLIPQGGLTEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMRN 666 Query: 820 HTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHF 641 +T+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+CGEWAHF Sbjct: 667 YTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHF 726 Query: 640 GCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 512 GCDRRQGLGAFKDYAKTDGLEYICP CS+S FKKK QK +NG+ Sbjct: 727 GCDRRQGLGAFKDYAKTDGLEYICPHCSISNFKKKPQKVTNGF 769 >ref|XP_006381551.1| hypothetical protein POPTR_0006s13780g [Populus trichocarpa] gi|550336257|gb|ERP59348.1| hypothetical protein POPTR_0006s13780g [Populus trichocarpa] Length = 749 Score = 1070 bits (2767), Expect = 0.0 Identities = 533/768 (69%), Positives = 613/768 (79%), Gaps = 1/768 (0%) Frame = -2 Query: 2812 LMSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSA 2633 +M H Q + C+LLAV CG++ +N+Q + + D+K R+PFP+L S+GRLEVQ L NPS Sbjct: 1 MMFHAQGPLRNHCTLLAVLCGKSGDNKQKQPLSDDKPRFPFPELASAGRLEVQVLTNPST 60 Query: 2632 DEFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEAICGLFSSVLPTTVYLEL 2456 DEF +VL S +P+I+Y QGEQ+ D G L WG ++LSTPE++CGLF S LP TVYLE+ Sbjct: 61 DEFQRVLHSLEPSIVYFQGEQIEDSEEIGPLRWGDIDLSTPESLCGLFGSTLPPTVYLEI 120 Query: 2455 PNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFR 2276 PNGE+LAEALHSKG+PYVIYWK FS +A HFR AL SVVQSS HT DAFQLA+ASFR Sbjct: 121 PNGEKLAEALHSKGVPYVIYWKSMFSCYAVSHFRQALLSVVQSSCSHTCDAFQLAYASFR 180 Query: 2275 LYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYDE 2096 LYC RNN L N QKV GK GP L+G+PPK + SSGALPAIKIYD+ Sbjct: 181 LYCGRNNNTLASNGQKVGGKPGPQLLGDPPKFDITLPEADDQGEES-SSGALPAIKIYDD 239 Query: 2095 DINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGV 1916 D+ MRFLVCG + +LD+ +LE LEDGLNALL+IE+RGSKLHNR SA PPPLQAGTFSRGV Sbjct: 240 DVTMRFLVCGLSCTLDACLLESLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGTFSRGV 299 Query: 1915 VTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRL 1736 VTMRCD+ST SSAHISLLVSGSAQTCF+DQ+LENHIK+E+I+N+ LVHAL + EE+K Sbjct: 300 VTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALTSFEESKSPS 359 Query: 1735 SEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEK 1556 SEPR+S SIACGASV+EV +K+P WASQVLRQLAPDVSYR+LV LGIASIQGL+VASFEK Sbjct: 360 SEPRKSASIACGASVFEVSMKVPTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEK 419 Query: 1555 DDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVKREE 1376 DDA+RLLFF + QGK+ H N + PTWL PAP RKRSE + + + Sbjct: 420 DDADRLLFFCSEQGKESHPLNTFLTRPPTWLIPPAPCRKRSEPTR-----------ETKP 468 Query: 1375 LEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPNL 1196 L G + K VAA+RPIPH KMLPFSG + + ++G Q KP+L Sbjct: 469 LTSGRGGENGG------NVKHKFHVAAMRPIPHTHRHKMLPFSGFFDAERYDGEQAKPSL 522 Query: 1195 PPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLKD 1016 PP P KH+ VG P +HRKS S+SYQA+QIISLNPLPLKKHGCGRSPI VCSEEEFL+D Sbjct: 523 PP-PPPKHSVVGPAPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRSPIQVCSEEEFLRD 581 Query: 1015 VMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVF 836 VMQFLILRGH+RL+PQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVF Sbjct: 582 VMQFLILRGHSRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVF 641 Query: 835 SKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVCG 656 SKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+CG Sbjct: 642 SKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICG 701 Query: 655 EWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 512 EWAHFGCDRRQGLGAFKDYAKTDGLEYICP CS++ FKKK QKT+NGY Sbjct: 702 EWAHFGCDRRQGLGAFKDYAKTDGLEYICPNCSIANFKKKSQKTTNGY 749 >gb|EOY30139.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 1 [Theobroma cacao] gi|508782884|gb|EOY30140.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 1 [Theobroma cacao] Length = 746 Score = 1062 bits (2747), Expect = 0.0 Identities = 542/769 (70%), Positives = 616/769 (80%), Gaps = 2/769 (0%) Frame = -2 Query: 2812 LMSHIQVASKYTCSLLAVFCGRN-AENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPS 2636 +M Q +S+ CSLLAV G N ++N+Q + V D+K RYPFP+L SSGRLEVQ L +P+ Sbjct: 1 MMFSAQGSSRNHCSLLAVLSGGNVSDNKQKQPVSDDKPRYPFPELASSGRLEVQLLNSPN 60 Query: 2635 ADEFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEAICGLFSSVLPTTVYLE 2459 DE +VL+S +PN++YLQGEQ D G L+WG V+LSTPE +CGLF S LPTTVYLE Sbjct: 61 IDELRRVLESTEPNVVYLQGEQNADSEEIGPLIWGDVDLSTPETLCGLFDSTLPTTVYLE 120 Query: 2458 LPNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASF 2279 PNG++LAEALHS+G+PYVIYWK+ FS FAACHFR AL SV+QSS HTWDAFQLAHASF Sbjct: 121 TPNGDKLAEALHSQGVPYVIYWKNTFSRFAACHFRQALLSVIQSSCSHTWDAFQLAHASF 180 Query: 2278 RLYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYD 2099 RLYCVRNN V+ NSQK + K GP L+GE PKI+ S LPAIKIYD Sbjct: 181 RLYCVRNNNVVSSNSQKQSVKPGPRLLGEAPKIDVSQPEVDMQGEES-SPENLPAIKIYD 239 Query: 2098 EDINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRG 1919 +D+ +RFLVCG LD+ +L LEDGLNALLSIE+RGSKLHNRASA PPPLQAGTFSRG Sbjct: 240 DDVTVRFLVCGSPCILDAFLLGSLEDGLNALLSIEIRGSKLHNRASAPPPPLQAGTFSRG 299 Query: 1918 VVTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPR 1739 VVTMRCD ST SSAHISLLVSGSAQTCF+DQ+LENHIK+E+I+ + LVHA +SEE+K Sbjct: 300 VVTMRCDFSTCSSAHISLLVSGSAQTCFNDQLLENHIKNEIIEKSQLVHAQSSSEESKLP 359 Query: 1738 LSEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFE 1559 SEPRRS SIACGASV+EVC+K+P WASQVLRQLAPDVSYR+LV LGIASIQGL+VASFE Sbjct: 360 SSEPRRSASIACGASVFEVCMKVPTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFE 419 Query: 1558 KDDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVKRE 1379 KDDAERLLFF RQ KD ++ I P+WL PAPSRKRSE C+ S + G Sbjct: 420 KDDAERLLFFCMRQDKDPLQDSSVIAISPSWLVPPAPSRKRSEPCKDSKPLNCTG----- 474 Query: 1378 ELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPN 1199 +E + NG AR KS VAA+RPIPH K++PFSG SE + ++G Q K N Sbjct: 475 -MEGE-----NGI------ARPKSNVAAMRPIPHTHRHKIIPFSGFSEAERYDGDQGKVN 522 Query: 1198 LPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLK 1019 LP V P K + P +HRK+ S+SYQA+QIISLNPLPLKKHGCGR+PI VCSEEEFL+ Sbjct: 523 LP-VVPVKQPA----PVTHRKALSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLR 577 Query: 1018 DVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 839 DVMQFLILRGHTRL+PQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV Sbjct: 578 DVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 637 Query: 838 FSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVC 659 FSKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+C Sbjct: 638 FSKMRNHTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGIC 697 Query: 658 GEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 512 GEWAHFGCDRRQGLGAFKDYAKTDGLEY+CP CS+S FKKK QKT NGY Sbjct: 698 GEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSISNFKKKPQKTVNGY 746 >ref|XP_002277324.1| PREDICTED: AT-rich interactive domain-containing protein 4 [Vitis vinifera] gi|297738501|emb|CBI27746.3| unnamed protein product [Vitis vinifera] Length = 739 Score = 1051 bits (2718), Expect = 0.0 Identities = 536/769 (69%), Positives = 612/769 (79%), Gaps = 3/769 (0%) Frame = -2 Query: 2809 MSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSAD 2630 M H+Q AS+ C+LLAV CG+ +E ++ YPFP+LVSSGRLEVQ LKNPS Sbjct: 1 MFHVQAASRNHCALLAVVCGKIPVSED-----QQQHPYPFPELVSSGRLEVQILKNPSIH 55 Query: 2629 EFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELP 2453 EF + L+S +PN LYLQGEQLP GSL WGGV+LS+ EA+ LF LPTTVYLE P Sbjct: 56 EFQRSLESLEPNFLYLQGEQLPGSEEIGSLTWGGVDLSSAEALVELFGPTLPTTVYLETP 115 Query: 2452 NGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRL 2273 NGE+LA+ALHSKG+ YVIYWK+ FS +AACHFR ALFSVVQSS HTWDAFQLAHASFRL Sbjct: 116 NGEKLAKALHSKGVSYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFRL 175 Query: 2272 YCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYDED 2093 YCV+NN V P N+QKV+GKLGP L+G+PPKIN + LP IKIYD D Sbjct: 176 YCVQNNTV-PSNNQKVSGKLGPCLLGDPPKINVVPPEVDEEESLPAT---LPVIKIYDAD 231 Query: 2092 INMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVV 1913 ++MRFLVCG +LD+ +L LEDGLNALL IE+RGSKLHNR SA PPPLQAGTFSRGVV Sbjct: 232 VSMRFLVCGAPSALDACLLGSLEDGLNALLCIEIRGSKLHNRVSAPPPPLQAGTFSRGVV 291 Query: 1912 TMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLS 1733 TMRCD+ST SSAHISLLVSGSAQTC +DQ+LE++IK+E+I+ + LVHA+P+ EE+K S Sbjct: 292 TMRCDLSTCSSAHISLLVSGSAQTCLNDQLLESYIKNELIEKSQLVHAVPSCEESKLSSS 351 Query: 1732 EPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKD 1553 EPRRS SIACGASV+EV +K+P WASQVLRQLAPDVSYR+LV LGIASIQGL+VASFEKD Sbjct: 352 EPRRSASIACGASVFEVRIKVPTWASQVLRQLAPDVSYRSLVTLGIASIQGLSVASFEKD 411 Query: 1552 DAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQIS--SIYSQNGAVKRE 1379 DA+RLLFF R K + NN + P+WL +P SRKRS C + S Y G V Sbjct: 412 DADRLLFFCTRHAKQLNQNNSILPRPPSWLIAPPASRKRSGPCHETKPSGYKVLGGV--- 468 Query: 1378 ELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPN 1199 NG L +QK K+AA+RPIPH R+ KMLPFSG+SE +G Q K N Sbjct: 469 ----------NGGVL-----QQKPKIAAMRPIPHTRNHKMLPFSGISEASRCDGDQAKGN 513 Query: 1198 LPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLK 1019 L V P+KHN G TP +HRK S+S+QA+QIISLNPLPLKKHGCGRSPI +CSEEEFL+ Sbjct: 514 LS-VVPAKHN--GTTPVTHRKLLSSSFQAQQIISLNPLPLKKHGCGRSPIQICSEEEFLR 570 Query: 1018 DVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 839 DVMQFLILRGHTRL+PQGGLAEFPDAILNAKRLDL+NLYREVVSRGGFHVGNGINWKGQV Sbjct: 571 DVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRGGFHVGNGINWKGQV 630 Query: 838 FSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVC 659 FSKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+C Sbjct: 631 FSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGIC 690 Query: 658 GEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 512 GEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CS++ F+KK QKT+NGY Sbjct: 691 GEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFQKKSQKTANGY 739 >ref|XP_002324130.2| arid/bright DNA-binding domain-containing family protein [Populus trichocarpa] gi|550318261|gb|EEF02695.2| arid/bright DNA-binding domain-containing family protein [Populus trichocarpa] Length = 746 Score = 1050 bits (2715), Expect = 0.0 Identities = 533/773 (68%), Positives = 604/773 (78%), Gaps = 6/773 (0%) Frame = -2 Query: 2812 LMSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSA 2633 +M H Q + C+LLAV CG++ E Q + D+K RYP P+L S+GRLEVQ L NPS Sbjct: 1 MMFHAQGPLRNHCTLLAVLCGKSGE--QKLPLSDDKPRYPLPELESTGRLEVQVLNNPST 58 Query: 2632 DEFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEAICGLFSSVLPTTVYLEL 2456 DEF +VL S +P+I+Y QGEQ+ D GSL W V LSTPE++CGLF S LP TVYLE+ Sbjct: 59 DEFRQVLQSLEPSIVYFQGEQVEDREEIGSLRWADVGLSTPESLCGLFGSTLPPTVYLEM 118 Query: 2455 PNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFR 2276 PNGE+LAEALHSKG+PYVIYWK FS +AA HFR AL SVVQSS HT DAFQLAHASFR Sbjct: 119 PNGEKLAEALHSKGVPYVIYWKSAFSCYAASHFRQALLSVVQSSCSHTCDAFQLAHASFR 178 Query: 2275 LYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYDE 2096 LYCV+NN NSQKV GK GP L+G+PPK + SSGALPAIKIYD+ Sbjct: 179 LYCVQNNNTPASNSQKVGGKPGPRLLGDPPKFDISLPEADDQGEEG-SSGALPAIKIYDD 237 Query: 2095 DINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGV 1916 D+ MRFLVCG T +LD+ L LEDGLNALL+IE+RGSKLHNR SA PPPLQAGTFSRGV Sbjct: 238 DVTMRFLVCGLTGTLDACALGSLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGTFSRGV 297 Query: 1915 VTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRL 1736 VTMRCD+ST SSAHISLLVSGSAQ CF+DQ+LENHIKSE+I+N+ LVHA +S+E K Sbjct: 298 VTMRCDLSTCSSAHISLLVSGSAQNCFNDQLLENHIKSELIENSQLVHASTSSDEIKSPS 357 Query: 1735 SEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEK 1556 SEPR+S SIACGASV+EV +K+P WASQVLRQLAPDV+YR+LV LGIASIQGL+VASFEK Sbjct: 358 SEPRKSASIACGASVFEVSMKVPTWASQVLRQLAPDVTYRSLVMLGIASIQGLSVASFEK 417 Query: 1555 DDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYS-----QNGA 1391 DDA+RLLFF +Q KD H N + P+WL PAP RKR E + + + +NG Sbjct: 418 DDADRLLFFCTKQSKDPHPRNPVLTRHPSWLIPPAPCRKRYEPSRETKPLTFGCGGENGG 477 Query: 1390 VKREELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQ 1211 +QK VAA+RPIPH R KMLPFSG E + ++G Q Sbjct: 478 ----------------------NFKQKLYVAAMRPIPHTRRHKMLPFSGFLEAERYDGEQ 515 Query: 1210 VKPNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEE 1031 KP+LPP P KH+ VG P +HRKS S SYQA+QIISLNPLPLKKHGCGRSPI CSEE Sbjct: 516 TKPSLPP--PPKHSVVGPAPVTHRKSLSNSYQAQQIISLNPLPLKKHGCGRSPIQACSEE 573 Query: 1030 EFLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 851 EFL+DVMQFLILRGH+RL+PQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW Sbjct: 574 EFLRDVMQFLILRGHSRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 633 Query: 850 KGQVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVN 671 KGQVFSKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVN Sbjct: 634 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVN 693 Query: 670 CGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 512 CG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CS++ FKKK QK +NGY Sbjct: 694 CGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSIANFKKKSQKNANGY 746 >ref|XP_002516200.1| DNA binding protein, putative [Ricinus communis] gi|223544686|gb|EEF46202.1| DNA binding protein, putative [Ricinus communis] Length = 749 Score = 1048 bits (2709), Expect = 0.0 Identities = 526/725 (72%), Positives = 595/725 (82%), Gaps = 1/725 (0%) Frame = -2 Query: 2683 LVSSGRLEVQTLKNPSADEFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEA 2507 L SSGRLEVQ L +PS DEF +VL S +PNI+YLQGE + D GSL W G +LSTP+A Sbjct: 43 LXSSGRLEVQILSSPSTDEFRRVLQSSEPNIVYLQGEIIEDSEEIGSLRWAGADLSTPDA 102 Query: 2506 ICGLFSSVLPTTVYLELPNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQS 2327 +C LF S LP TVYLE+PNGE+LAEALH KG+PYVIYWK FS +AA HFR AL SVVQS Sbjct: 103 LCELFGSTLPPTVYLEIPNGEKLAEALHFKGVPYVIYWKSTFSCYAAAHFRQALLSVVQS 162 Query: 2326 SSCHTWDAFQLAHASFRLYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXX 2147 S HT DAFQLAHASF LYCVRNN L N+QKV GK GP L+GEPPKI+ Sbjct: 163 SCSHTCDAFQLAHASFSLYCVRNNTGLSSNNQKVGGKPGPRLLGEPPKIDITLPEADVQD 222 Query: 2146 XXXDSSGALPAIKIYDEDINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNR 1967 SSG LPAIKIYD+D+ MRFLVC +LD+ +L LEDGLNALL+IE+RGSKLHNR Sbjct: 223 EES-SSGTLPAIKIYDDDVTMRFLVCELPSTLDACLLGSLEDGLNALLNIEIRGSKLHNR 281 Query: 1966 ASALPPPLQAGTFSRGVVTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDN 1787 SA PPPLQAGTFSRGVVTMRCD+ST SSAHISLLVSGSAQ CF+DQ+LENHIK+E+I+N Sbjct: 282 TSAPPPPLQAGTFSRGVVTMRCDLSTCSSAHISLLVSGSAQACFNDQLLENHIKNELIEN 341 Query: 1786 TCLVHALPTSEENKPRLSEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLV 1607 + LVHALP+SEE+K SEPR+S SI CGASV+EVC+K+P+WASQVLRQLAPDVSYR+LV Sbjct: 342 SQLVHALPSSEESKLLTSEPRKSASIGCGASVFEVCLKVPSWASQVLRQLAPDVSYRSLV 401 Query: 1606 ALGIASIQGLAVASFEKDDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEM 1427 LGIASIQGL+VASFEK+D ERLLFF RQGK+ + NN I P WL PAPSRKRSE Sbjct: 402 MLGIASIQGLSVASFEKEDTERLLFFCTRQGKELYPNNSIIIKPPCWLIPPAPSRKRSEP 461 Query: 1426 CQISSIYSQNGAVKREELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFS 1247 C+ + +++ G ++RE NG + +QK VAA+RPIPH RH KMLPFS Sbjct: 462 CRETKLFTSKG-LERE----------NGGSV-----KQKLNVAAMRPIPHTRHHKMLPFS 505 Query: 1246 GLSEIDGHEGTQVKPNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHG 1067 G +E + ++G Q KP+LP VAP+KH VG P SHRKS S+SYQA+QIISLNPLPLKKHG Sbjct: 506 GFAEGERYDGDQGKPSLP-VAPAKHGVVGPAPVSHRKSLSSSYQAQQIISLNPLPLKKHG 564 Query: 1066 CGRSPIHVCSEEEFLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVS 887 CGR+PI CSEEEFL+DVMQFLILRGHTRL+PQGGLAEFPDAILNAKRLDLFNLYREVVS Sbjct: 565 CGRAPIQACSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVS 624 Query: 886 RGGFHVGNGINWKGQVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCL 707 RGGFHVGNGINWKGQVFSKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCL Sbjct: 625 RGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCL 684 Query: 706 LCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQK 527 LC SSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CS++ F+KK QK Sbjct: 685 LCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSIANFRKKSQK 744 Query: 526 TSNGY 512 T+NGY Sbjct: 745 TANGY 749 >ref|XP_003547888.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Glycine max] Length = 782 Score = 1038 bits (2683), Expect = 0.0 Identities = 509/773 (65%), Positives = 607/773 (78%), Gaps = 8/773 (1%) Frame = -2 Query: 2803 HIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSADEF 2624 H Q K+TC+LLAV C ++ ++ + + YPFP+LVS+GRLEVQTL +P ++F Sbjct: 5 HSQGTPKHTCTLLAVTCRTSSAEHKLSHA---QRTYPFPELVSAGRLEVQTLCSPEKEQF 61 Query: 2623 GKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPNGE 2444 KVL+S+QPN +YL+G+QL +G GSLVW GV LST E I LF S LPT VYLE+PNGE Sbjct: 62 RKVLESFQPNFVYLRGDQLENGEVGSLVWQGVELSTCEDITELFGSTLPTAVYLEIPNGE 121 Query: 2443 ELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLYCV 2264 AEALH KGIPYVI+WK+ FS +AACHFR A SVVQSSS HTWDAF LA ASF LYCV Sbjct: 122 SFAEALHLKGIPYVIFWKNTFSCYAACHFRQAFLSVVQSSSTHTWDAFHLARASFELYCV 181 Query: 2263 RNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYDEDINM 2084 +NN VLP +S + ++GP L+G+ KIN SSG+LPAIKI+++++N+ Sbjct: 182 QNNQVLPSDSDDASSEMGPHLLGDCLKINVDPPEIDEEDDDESSSGSLPAIKIHEDEVNL 241 Query: 2083 RFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVTMR 1904 RFL+CG ++D S+L LEDGL ALL+IE+RG KLH + SA PPPLQA FSRGVVTMR Sbjct: 242 RFLICGAPSTVDESLLRSLEDGLRALLTIEIRGCKLHGKFSAPPPPLQAAAFSRGVVTMR 301 Query: 1903 CDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSEPR 1724 CDIST SSAHISLLVSGSAQTCF+DQ+LENHIK+E+I+ + LVHA +E NK + EPR Sbjct: 302 CDISTCSSAHISLLVSGSAQTCFNDQLLENHIKNEIIEKSQLVHAQLNNEGNKENICEPR 361 Query: 1723 RSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDDAE 1544 RS SIACGASV+E+C+KLP WA Q+LRQLAP+VSYR+LVALGIASIQGL +ASFEKDDAE Sbjct: 362 RSASIACGASVFEICMKLPQWALQILRQLAPEVSYRSLVALGIASIQGLPIASFEKDDAE 421 Query: 1543 RLLFFSARQGKDE--HLNNLNIGSLPTWLRSPAPSRKRSEMCQISS------IYSQNGAV 1388 RLLFF KD + NN+ S P WL+ P P+RKR E Q +S +++ G V Sbjct: 422 RLLFFYQNCEKDSCTNKNNIIFSSPPGWLKPPPPTRKRCEPRQEASPGLHEGVFAGQGGV 481 Query: 1387 KREELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQV 1208 + E+K+ + NG +P+ PARQ+ KV+A+RPIPH+R +M PF G SE DG +GTQV Sbjct: 482 CKLNEEEKDRKIVNGISMPLTPARQRLKVSAMRPIPHIRRHRMTPFCGPSETDGFDGTQV 541 Query: 1207 KPNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEE 1028 + LP VAP+K S+G T +HRKS S++ Q+KQ+ISLNPLPLKKHGCGR P+ CSEEE Sbjct: 542 EAILPLVAPTKRTSIGSTSGTHRKSFSSAAQSKQVISLNPLPLKKHGCGRGPVQTCSEEE 601 Query: 1027 FLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWK 848 FLKDVM+FLILRGH RLIPQGGL EFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINWK Sbjct: 602 FLKDVMEFLILRGHNRLIPQGGLTEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWK 661 Query: 847 GQVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNC 668 GQ+FSKMRN+T TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNC Sbjct: 662 GQIFSKMRNYTTTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC 721 Query: 667 GVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 509 G+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CSV+ FKKK Q +NGYS Sbjct: 722 GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVTNFKKK-QNVANGYS 773 >gb|ESW07366.1| hypothetical protein PHAVU_010G123900g [Phaseolus vulgaris] Length = 781 Score = 1034 bits (2674), Expect = 0.0 Identities = 513/774 (66%), Positives = 607/774 (78%), Gaps = 9/774 (1%) Frame = -2 Query: 2803 HIQVASKYTCSLLAVFCGRN-AENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSADE 2627 H A K+ C+LLAV CG + AE++ +N + +YPFP+LVS+GRLEVQTL+NP ++ Sbjct: 5 HPHGAPKHACTLLAVTCGASFAEHKASQN----QHKYPFPELVSAGRLEVQTLRNPDKEQ 60 Query: 2626 FGKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPNG 2447 F KVL+S+QPN +YLQGEQL + + GSLVW G+ LST E I LF S LPT VYLE+PNG Sbjct: 61 FRKVLESYQPNFVYLQGEQLENDKVGSLVWQGLELSTSEDIIELFGSTLPTAVYLEIPNG 120 Query: 2446 EELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLYC 2267 E AEALH KGIPYVI+WK+ F S+AACHFR A SVVQSSS HTWDAF LA ASF LYC Sbjct: 121 ESFAEALHLKGIPYVIFWKNAFFSYAACHFRQAFLSVVQSSSTHTWDAFHLARASFELYC 180 Query: 2266 VRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYDEDIN 2087 V+NN VL N ++GP L+G+ KIN +SSG LPAIKI+++++N Sbjct: 181 VQNNQVLSTNIHDAISEMGPHLLGDCLKINVDPPEIDEEDDDENSSGTLPAIKIHEDEVN 240 Query: 2086 MRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVTM 1907 +RFLVCG ++D S+L LEDGL ALL+IE+RG KLH + SA PPPLQA TFSRGVVTM Sbjct: 241 LRFLVCGAPSTVDESLLRSLEDGLRALLTIEIRGCKLHGKFSAPPPPLQAATFSRGVVTM 300 Query: 1906 RCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSEP 1727 RCDIST SSAHISLLVSGSAQTCF+DQ+LE+HIK+E+I+ + LVHA +E NK +SEP Sbjct: 301 RCDISTCSSAHISLLVSGSAQTCFNDQLLESHIKNEIIEKSQLVHAQLNNEGNKQNISEP 360 Query: 1726 RRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDDA 1547 RRS SIACGA V+E+C+KLP WA Q+LRQLAP+VSYR+LVALGIASIQGL +ASFEKDDA Sbjct: 361 RRSASIACGAPVFEICMKLPQWALQILRQLAPEVSYRSLVALGIASIQGLPIASFEKDDA 420 Query: 1546 ERLLFF--SARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISS------IYSQNGA 1391 ERLLFF S + NN+ GS P WL+ P P RKR E Q +S +++ Sbjct: 421 ERLLFFYQSCEKDSGTSKNNIIFGSPPGWLKPPPPRRKRCESSQGASPGLHEGVFAGPAT 480 Query: 1390 VKREELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQ 1211 V + E+K+ + NG P+ PARQ+ KV+A+RPIPH+R +M PF G SE DG +G Q Sbjct: 481 VYKVNEEEKDRKMANGISTPLAPARQRLKVSAMRPIPHIRRHRMTPFCGPSETDGFDGGQ 540 Query: 1210 VKPNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEE 1031 V+P LP VAP+K S+G T A+HRKS S++ Q+KQ+ISLNPLPLKKHGCGR P+ CSEE Sbjct: 541 VEPTLPLVAPTK-RSIGSTSATHRKSFSSAAQSKQVISLNPLPLKKHGCGRGPVQTCSEE 599 Query: 1030 EFLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 851 EFLKDVM+FLILRGH RLIPQGGL EFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINW Sbjct: 600 EFLKDVMEFLILRGHNRLIPQGGLTEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINW 659 Query: 850 KGQVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVN 671 KGQ+FSKMRN+T TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVN Sbjct: 660 KGQIFSKMRNYTTTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVN 719 Query: 670 CGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 509 CG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CSV+ FKKK Q +NGYS Sbjct: 720 CGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVTNFKKK-QNVTNGYS 772 >gb|EOY30141.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 3, partial [Theobroma cacao] Length = 708 Score = 1024 bits (2647), Expect = 0.0 Identities = 520/724 (71%), Positives = 585/724 (80%), Gaps = 1/724 (0%) Frame = -2 Query: 2683 LVSSGRLEVQTLKNPSADEFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEA 2507 L SSGRLEVQ L +P+ DE +VL+S +PN++YLQGEQ D G L+WG V+LSTPE Sbjct: 1 LASSGRLEVQLLNSPNIDELRRVLESTEPNVVYLQGEQNADSEEIGPLIWGDVDLSTPET 60 Query: 2506 ICGLFSSVLPTTVYLELPNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQS 2327 +CGLF S LPTTVYLE PNG++LAEALHS+G+PYVIYWK+ FS FAACHFR AL SV+QS Sbjct: 61 LCGLFDSTLPTTVYLETPNGDKLAEALHSQGVPYVIYWKNTFSRFAACHFRQALLSVIQS 120 Query: 2326 SSCHTWDAFQLAHASFRLYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXX 2147 S HTWDAFQLAHASFRLYCVRNN V+ NSQK + K GP L+GE PKI+ Sbjct: 121 SCSHTWDAFQLAHASFRLYCVRNNNVVSSNSQKQSVKPGPRLLGEAPKIDVSQPEVDMQG 180 Query: 2146 XXXDSSGALPAIKIYDEDINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNR 1967 S LPAIKIYD+D+ +RFLVCG LD+ +L LEDGLNALLSIE+RGSKLHNR Sbjct: 181 EES-SPENLPAIKIYDDDVTVRFLVCGSPCILDAFLLGSLEDGLNALLSIEIRGSKLHNR 239 Query: 1966 ASALPPPLQAGTFSRGVVTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDN 1787 ASA PPPLQAGTFSRGVVTMRCD ST SSAHISLLVSGSAQTCF+DQ+LENHIK+E+I+ Sbjct: 240 ASAPPPPLQAGTFSRGVVTMRCDFSTCSSAHISLLVSGSAQTCFNDQLLENHIKNEIIEK 299 Query: 1786 TCLVHALPTSEENKPRLSEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLV 1607 + LVHA +SEE+K SEPRRS SIACGASV+EVC+K+P WASQVLRQLAPDVSYR+LV Sbjct: 300 SQLVHAQSSSEESKLPSSEPRRSASIACGASVFEVCMKVPTWASQVLRQLAPDVSYRSLV 359 Query: 1606 ALGIASIQGLAVASFEKDDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEM 1427 LGIASIQGL+VASFEKDDAERLLFF RQ KD ++ I P+WL PAPSRKRSE Sbjct: 360 MLGIASIQGLSVASFEKDDAERLLFFCMRQDKDPLQDSSVIAISPSWLVPPAPSRKRSEP 419 Query: 1426 CQISSIYSQNGAVKREELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFS 1247 C+ S + G +E + NG AR KS VAA+RPIPH K++PFS Sbjct: 420 CKDSKPLNCTG------MEGE-----NGI------ARPKSNVAAMRPIPHTHRHKIIPFS 462 Query: 1246 GLSEIDGHEGTQVKPNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHG 1067 G SE + ++G Q K NLP V P K + P +HRK+ S+SYQA+QIISLNPLPLKKHG Sbjct: 463 GFSEAERYDGDQGKVNLP-VVPVKQPA----PVTHRKALSSSYQAQQIISLNPLPLKKHG 517 Query: 1066 CGRSPIHVCSEEEFLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVS 887 CGR+PI VCSEEEFL+DVMQFLILRGHTRL+PQGGLAEFPDAILNAKRLDLFNLYREVVS Sbjct: 518 CGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVS 577 Query: 886 RGGFHVGNGINWKGQVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCL 707 RGGFHVGNGINWKGQVFSKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCL Sbjct: 578 RGGFHVGNGINWKGQVFSKMRNHTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCL 637 Query: 706 LCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQK 527 LC SSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEY+CP CS+S FKKK QK Sbjct: 638 LCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSISNFKKKPQK 697 Query: 526 TSNG 515 T NG Sbjct: 698 TVNG 701 >ref|XP_004510562.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Cicer arietinum] Length = 783 Score = 1004 bits (2595), Expect = 0.0 Identities = 506/780 (64%), Positives = 605/780 (77%), Gaps = 12/780 (1%) Frame = -2 Query: 2812 LMSHIQVASKYTCSLLAVFCG-RNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPS 2636 L H Q +SK TC+LL V R AE + +N +PFP+LVSSGRLEVQTL NP Sbjct: 2 LQFHPQGSSKQTCTLLTVTSATRCAEQKHPQN----HHNFPFPELVSSGRLEVQTLCNPE 57 Query: 2635 ADEFGKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLEL 2456 ++F KVL+S+QP+I+YLQGEQL + GS+VW GV LSTPE I LF + LPT VYLE+ Sbjct: 58 KEQFCKVLESYQPSIVYLQGEQLVNEEVGSVVWQGVELSTPEDISELFGTSLPTAVYLEI 117 Query: 2455 PNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFR 2276 PNGE AEALH KGIPYV++WK+ FS +AACHFR A FSVVQSSS HTWDAF LAHASF Sbjct: 118 PNGESFAEALHLKGIPYVVFWKNAFSRYAACHFRQAFFSVVQSSSTHTWDAFHLAHASFE 177 Query: 2275 LYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXD----SSGALPAIK 2108 LYCV+NN VLP +S + +GP L+G+ KI+ D SSG+LP+I+ Sbjct: 178 LYCVQNNQVLPTDSNDADSDMGPHLLGDCLKIHIDPPEMGEEEEDDDDDESSSGSLPSIQ 237 Query: 2107 IYDEDINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTF 1928 I+D+++N+RFL+CG+ ++D S+L LEDGL ALL+IE+RG KLH + SA PPPLQA F Sbjct: 238 IHDDEVNLRFLICGEPSTVDESLLRSLEDGLRALLTIEIRGCKLHGKYSAPPPPLQAAAF 297 Query: 1927 SRGVVTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEEN 1748 SRGVVTMRCDIST SSAHISLLVSGSAQ CF+DQ+LENHIK+E+I+ +VHA SE N Sbjct: 298 SRGVVTMRCDISTCSSAHISLLVSGSAQACFNDQLLENHIKNEIIEKGQIVHA-QLSEAN 356 Query: 1747 KPRLSEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVA 1568 K +SEPRRS SIACGA+++E+ +KLP WA Q+LRQLAPDVSYR+LVALGIASIQGL VA Sbjct: 357 KQTISEPRRSASIACGATIFEISMKLPQWALQILRQLAPDVSYRSLVALGIASIQGLPVA 416 Query: 1567 SFEKDDAERLLFFSARQGKDEHLN-NLNIGSLPTWLRSPAPSRKRSEMCQISS------I 1409 SFEKDDAERLLFF KD N N+ P WL+ P P+RKRSE Q +S + Sbjct: 417 SFEKDDAERLLFFYQSSEKDGCANHNIVFSRPPIWLKPPPPTRKRSESSQGASPDIDDGV 476 Query: 1408 YSQNGAVKREELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEID 1229 +S GA+K+ + E+K+ + NG P+ PARQ+ KV+A+RPIP VR +M PF G SE+D Sbjct: 477 FSGQGAIKKVDEEEKDRKMVNGISTPLTPARQRLKVSAMRPIPQVRRHRMTPFCGPSEMD 536 Query: 1228 GHEGTQVKPNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPI 1049 G G V+ ++P V P K +S+ + A+ RKS S+S +KQ+ISLNPLPLKKHGC R P+ Sbjct: 537 GFGGAHVEASVPLV-PMKRSSIASSSATQRKSFSSSALSKQVISLNPLPLKKHGCSRGPV 595 Query: 1048 HVCSEEEFLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHV 869 CSEEEFLKDVM+FLILRGH+RLIPQGGL+EFPDAILN KRLDL+NLY+EVV+RGGFHV Sbjct: 596 QTCSEEEFLKDVMEFLILRGHSRLIPQGGLSEFPDAILNGKRLDLYNLYKEVVTRGGFHV 655 Query: 868 GNGINWKGQVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSA 689 GNGINWKGQ+FSKM N+T TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSA Sbjct: 656 GNGINWKGQIFSKMGNYTSTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA 715 Query: 688 AGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 509 AGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CS++ FKKK Q +NGYS Sbjct: 716 AGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFKKK-QSVANGYS 774 >ref|XP_003533805.1| PREDICTED: AT-rich interactive domain-containing protein 4-like isoform X1 [Glycine max] Length = 752 Score = 998 bits (2581), Expect = 0.0 Identities = 503/771 (65%), Positives = 589/771 (76%), Gaps = 4/771 (0%) Frame = -2 Query: 2812 LMSHIQVASKYTCSLLAVFCGRNAENEQIK---NVLDEKSRYPFPDLVSSGRLEVQTLKN 2642 +M H Q S++ CSLLAV G++ + +Q + N +++ YPFP+L SSGRLEV+ L Sbjct: 1 MMFHSQGVSRH-CSLLAVLSGKSRDIKQKQKQGNASEDQFPYPFPELSSSGRLEVKVLIE 59 Query: 2641 PSADEFGKVLDSWQPNILYLQGEQLPD-GRYGSLVWGGVNLSTPEAICGLFSSVLPTTVY 2465 P+ADE G L+ QP+ +YLQG+QL D G G L W +LS PEA+CGLFSS LP TVY Sbjct: 60 PTADELGLALEQLQPDFVYLQGQQLEDRGEIGPLGWEDFDLSVPEALCGLFSSKLPNTVY 119 Query: 2464 LELPNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHA 2285 LE P GE+LAEAL SKG+PY IYWK+ FS +AA HFRH+LFSV QS+S HTWDAFQLA A Sbjct: 120 LETPKGEKLAEALRSKGVPYTIYWKNDFSKYAASHFRHSLFSVAQSTSSHTWDAFQLALA 179 Query: 2284 SFRLYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKI 2105 SFRLYC+ NN VLP N K GKLGP ++G PP I+ DS + A+KI Sbjct: 180 SFRLYCIHNN-VLPSNCHKGAGKLGPQILGVPPNIDVSPCVADMKEEEEDSPETISAVKI 238 Query: 2104 YDEDINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFS 1925 YD+D+NMRFL+CG +LD+ +L LEDGLNALL E+RG KLHNR SA PPPLQAGTFS Sbjct: 239 YDDDVNMRFLICGVPCTLDACLLGSLEDGLNALLFAEIRGCKLHNRTSATPPPLQAGTFS 298 Query: 1924 RGVVTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENK 1745 RGVVTMRCDIST SSAHISLLVSGSA TCF+DQ+LENHIK E+I+ + LV A P E++K Sbjct: 299 RGVVTMRCDISTCSSAHISLLVSGSADTCFNDQLLENHIKKELIEKSQLVQAFPNHEQSK 358 Query: 1744 PRLSEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVAS 1565 SEPRRS S+ACG+SV+EVC+++P WASQVLRQLAP++SYR+LV LGIASIQGL VAS Sbjct: 359 APSSEPRRSASVACGSSVFEVCMQVPAWASQVLRQLAPNLSYRSLVMLGIASIQGLPVAS 418 Query: 1564 FEKDDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVK 1385 F KDDAERLLFF RQ K+ N+ +P+WL+ P+ SRKRSE C S + +G Sbjct: 419 FNKDDAERLLFFCTRQEKENCPNDHVFSGIPSWLKPPSTSRKRSEPCSSSKSINDSGRGV 478 Query: 1384 REELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVK 1205 + RQK +A++RPIPH K+LPFSGLSE ++G K Sbjct: 479 EA----------------IGSHRQKFNLASMRPIPHSNRHKILPFSGLSEGTRYDGDHGK 522 Query: 1204 PNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEF 1025 NLP +AP KHN G T ++RKS S S+QA QIISLNPLP+KKHGC R+PI CSEEEF Sbjct: 523 SNLP-LAPIKHNVSGPTSVTNRKSVSNSFQAHQIISLNPLPMKKHGCDRAPIRACSEEEF 581 Query: 1024 LKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKG 845 L+DVMQFLILRGH RLIP GGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKG Sbjct: 582 LRDVMQFLILRGHNRLIPPGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKG 641 Query: 844 QVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCG 665 QVFSKMRNHT+TN+MTGVGNTLKRHYETYLLEYEL+HDDVDGECCL+C SSAAGDWVNCG Sbjct: 642 QVFSKMRNHTMTNRMTGVGNTLKRHYETYLLEYELSHDDVDGECCLMCHSSAAGDWVNCG 701 Query: 664 VCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 512 +CGEWAHFGCDRRQGLGAFKDYAKTDGLEY+CP+CS F KK QKT+NG+ Sbjct: 702 ICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPRCSALKFSKKSQKTANGF 752 >ref|XP_006587068.1| PREDICTED: AT-rich interactive domain-containing protein 4-like isoform X3 [Glycine max] Length = 772 Score = 995 bits (2572), Expect = 0.0 Identities = 499/759 (65%), Positives = 582/759 (76%), Gaps = 4/759 (0%) Frame = -2 Query: 2776 CSLLAVFCGRNAENEQIK---NVLDEKSRYPFPDLVSSGRLEVQTLKNPSADEFGKVLDS 2606 CSLLAV G++ + +Q + N +++ YPFP+L SSGRLEV+ L P+ADE G L+ Sbjct: 32 CSLLAVLSGKSRDIKQKQKQGNASEDQFPYPFPELSSSGRLEVKVLIEPTADELGLALEQ 91 Query: 2605 WQPNILYLQGEQLPD-GRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPNGEELAEA 2429 QP+ +YLQG+QL D G G L W +LS PEA+CGLFSS LP TVYLE P GE+LAEA Sbjct: 92 LQPDFVYLQGQQLEDRGEIGPLGWEDFDLSVPEALCGLFSSKLPNTVYLETPKGEKLAEA 151 Query: 2428 LHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLYCVRNNLV 2249 L SKG+PY IYWK+ FS +AA HFRH+LFSV QS+S HTWDAFQLA ASFRLYC+ NN V Sbjct: 152 LRSKGVPYTIYWKNDFSKYAASHFRHSLFSVAQSTSSHTWDAFQLALASFRLYCIHNN-V 210 Query: 2248 LPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYDEDINMRFLVC 2069 LP N K GKLGP ++G PP I+ DS + A+KIYD+D+NMRFL+C Sbjct: 211 LPSNCHKGAGKLGPQILGVPPNIDVSPCVADMKEEEEDSPETISAVKIYDDDVNMRFLIC 270 Query: 2068 GDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVTMRCDIST 1889 G +LD+ +L LEDGLNALL E+RG KLHNR SA PPPLQAGTFSRGVVTMRCDIST Sbjct: 271 GVPCTLDACLLGSLEDGLNALLFAEIRGCKLHNRTSATPPPLQAGTFSRGVVTMRCDIST 330 Query: 1888 VSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSEPRRSVSI 1709 SSAHISLLVSGSA TCF+DQ+LENHIK E+I+ + LV A P E++K SEPRRS S+ Sbjct: 331 CSSAHISLLVSGSADTCFNDQLLENHIKKELIEKSQLVQAFPNHEQSKAPSSEPRRSASV 390 Query: 1708 ACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDDAERLLFF 1529 ACG+SV+EVC+++P WASQVLRQLAP++SYR+LV LGIASIQGL VASF KDDAERLLFF Sbjct: 391 ACGSSVFEVCMQVPAWASQVLRQLAPNLSYRSLVMLGIASIQGLPVASFNKDDAERLLFF 450 Query: 1528 SARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVKREELEDKESALR 1349 RQ K+ N+ +P+WL+ P+ SRKRSE C S + +G Sbjct: 451 CTRQEKENCPNDHVFSGIPSWLKPPSTSRKRSEPCSSSKSINDSGRGVEA---------- 500 Query: 1348 NGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPNLPPVAPSKHN 1169 + RQK +A++RPIPH K+LPFSGLSE ++G K NLP +AP KHN Sbjct: 501 ------IGSHRQKFNLASMRPIPHSNRHKILPFSGLSEGTRYDGDHGKSNLP-LAPIKHN 553 Query: 1168 SVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLKDVMQFLILRG 989 G T ++RKS S S+QA QIISLNPLP+KKHGC R+PI CSEEEFL+DVMQFLILRG Sbjct: 554 VSGPTSVTNRKSVSNSFQAHQIISLNPLPMKKHGCDRAPIRACSEEEFLRDVMQFLILRG 613 Query: 988 HTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTVT 809 H RLIP GGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHT+T Sbjct: 614 HNRLIPPGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTMT 673 Query: 808 NKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDR 629 N+MTGVGNTLKRHYETYLLEYEL+HDDVDGECCL+C SSAAGDWVNCG+CGEWAHFGCDR Sbjct: 674 NRMTGVGNTLKRHYETYLLEYELSHDDVDGECCLMCHSSAAGDWVNCGICGEWAHFGCDR 733 Query: 628 RQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 512 RQGLGAFKDYAKTDGLEY+CP+CS F KK QKT+NG+ Sbjct: 734 RQGLGAFKDYAKTDGLEYVCPRCSALKFSKKSQKTANGF 772 >ref|XP_006587067.1| PREDICTED: AT-rich interactive domain-containing protein 4-like isoform X2 [Glycine max] Length = 795 Score = 995 bits (2572), Expect = 0.0 Identities = 499/759 (65%), Positives = 582/759 (76%), Gaps = 4/759 (0%) Frame = -2 Query: 2776 CSLLAVFCGRNAENEQIK---NVLDEKSRYPFPDLVSSGRLEVQTLKNPSADEFGKVLDS 2606 CSLLAV G++ + +Q + N +++ YPFP+L SSGRLEV+ L P+ADE G L+ Sbjct: 55 CSLLAVLSGKSRDIKQKQKQGNASEDQFPYPFPELSSSGRLEVKVLIEPTADELGLALEQ 114 Query: 2605 WQPNILYLQGEQLPD-GRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPNGEELAEA 2429 QP+ +YLQG+QL D G G L W +LS PEA+CGLFSS LP TVYLE P GE+LAEA Sbjct: 115 LQPDFVYLQGQQLEDRGEIGPLGWEDFDLSVPEALCGLFSSKLPNTVYLETPKGEKLAEA 174 Query: 2428 LHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLYCVRNNLV 2249 L SKG+PY IYWK+ FS +AA HFRH+LFSV QS+S HTWDAFQLA ASFRLYC+ NN V Sbjct: 175 LRSKGVPYTIYWKNDFSKYAASHFRHSLFSVAQSTSSHTWDAFQLALASFRLYCIHNN-V 233 Query: 2248 LPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXDSSGALPAIKIYDEDINMRFLVC 2069 LP N K GKLGP ++G PP I+ DS + A+KIYD+D+NMRFL+C Sbjct: 234 LPSNCHKGAGKLGPQILGVPPNIDVSPCVADMKEEEEDSPETISAVKIYDDDVNMRFLIC 293 Query: 2068 GDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVTMRCDIST 1889 G +LD+ +L LEDGLNALL E+RG KLHNR SA PPPLQAGTFSRGVVTMRCDIST Sbjct: 294 GVPCTLDACLLGSLEDGLNALLFAEIRGCKLHNRTSATPPPLQAGTFSRGVVTMRCDIST 353 Query: 1888 VSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSEPRRSVSI 1709 SSAHISLLVSGSA TCF+DQ+LENHIK E+I+ + LV A P E++K SEPRRS S+ Sbjct: 354 CSSAHISLLVSGSADTCFNDQLLENHIKKELIEKSQLVQAFPNHEQSKAPSSEPRRSASV 413 Query: 1708 ACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDDAERLLFF 1529 ACG+SV+EVC+++P WASQVLRQLAP++SYR+LV LGIASIQGL VASF KDDAERLLFF Sbjct: 414 ACGSSVFEVCMQVPAWASQVLRQLAPNLSYRSLVMLGIASIQGLPVASFNKDDAERLLFF 473 Query: 1528 SARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVKREELEDKESALR 1349 RQ K+ N+ +P+WL+ P+ SRKRSE C S + +G Sbjct: 474 CTRQEKENCPNDHVFSGIPSWLKPPSTSRKRSEPCSSSKSINDSGRGVEA---------- 523 Query: 1348 NGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPNLPPVAPSKHN 1169 + RQK +A++RPIPH K+LPFSGLSE ++G K NLP +AP KHN Sbjct: 524 ------IGSHRQKFNLASMRPIPHSNRHKILPFSGLSEGTRYDGDHGKSNLP-LAPIKHN 576 Query: 1168 SVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLKDVMQFLILRG 989 G T ++RKS S S+QA QIISLNPLP+KKHGC R+PI CSEEEFL+DVMQFLILRG Sbjct: 577 VSGPTSVTNRKSVSNSFQAHQIISLNPLPMKKHGCDRAPIRACSEEEFLRDVMQFLILRG 636 Query: 988 HTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTVT 809 H RLIP GGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHT+T Sbjct: 637 HNRLIPPGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTMT 696 Query: 808 NKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDR 629 N+MTGVGNTLKRHYETYLLEYEL+HDDVDGECCL+C SSAAGDWVNCG+CGEWAHFGCDR Sbjct: 697 NRMTGVGNTLKRHYETYLLEYELSHDDVDGECCLMCHSSAAGDWVNCGICGEWAHFGCDR 756 Query: 628 RQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 512 RQGLGAFKDYAKTDGLEY+CP+CS F KK QKT+NG+ Sbjct: 757 RQGLGAFKDYAKTDGLEYVCPRCSALKFSKKSQKTANGF 795