BLASTX nr result
ID: Catharanthus22_contig00009992
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00009992 (3426 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006362097.1| PREDICTED: AT-rich interactive domain-contai... 1175 0.0 ref|XP_004252398.1| PREDICTED: AT-rich interactive domain-contai... 1174 0.0 emb|CBI35803.3| unnamed protein product [Vitis vinifera] 1145 0.0 gb|EMJ18234.1| hypothetical protein PRUPE_ppa001668mg [Prunus pe... 1120 0.0 ref|XP_006452906.1| hypothetical protein CICLE_v10007563mg [Citr... 1108 0.0 ref|XP_006474564.1| PREDICTED: AT-rich interactive domain-contai... 1107 0.0 gb|EXB64667.1| AT-rich interactive domain-containing protein 4 [... 1096 0.0 ref|XP_004303747.1| PREDICTED: AT-rich interactive domain-contai... 1086 0.0 ref|XP_006381551.1| hypothetical protein POPTR_0006s13780g [Popu... 1070 0.0 gb|EOY30139.1| ARID/BRIGHT DNA-binding domain-containing protein... 1062 0.0 ref|XP_002277324.1| PREDICTED: AT-rich interactive domain-contai... 1051 0.0 ref|XP_002324130.2| arid/bright DNA-binding domain-containing fa... 1050 0.0 ref|XP_002516200.1| DNA binding protein, putative [Ricinus commu... 1048 0.0 ref|XP_003547888.1| PREDICTED: AT-rich interactive domain-contai... 1038 0.0 gb|ESW07366.1| hypothetical protein PHAVU_010G123900g [Phaseolus... 1034 0.0 gb|EOY30141.1| ARID/BRIGHT DNA-binding domain-containing protein... 1024 0.0 ref|XP_004510562.1| PREDICTED: AT-rich interactive domain-contai... 1004 0.0 ref|XP_003533805.1| PREDICTED: AT-rich interactive domain-contai... 998 0.0 ref|XP_006587068.1| PREDICTED: AT-rich interactive domain-contai... 995 0.0 ref|XP_006587067.1| PREDICTED: AT-rich interactive domain-contai... 995 0.0 >ref|XP_006362097.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Solanum tuberosum] Length = 770 Score = 1175 bits (3040), Expect = 0.0 Identities = 575/770 (74%), Positives = 645/770 (83%), Gaps = 3/770 (0%) Frame = +1 Query: 523 MSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSAD 702 M H Q S+ +CSLLAV CG +E +Q K+V D K RY FP++VSSGRLEVQ LKNPS D Sbjct: 1 MFHCQGTSRQSCSLLAVLCGSTSEYDQKKDVHDGKPRYCFPEIVSSGRLEVQVLKNPSTD 60 Query: 703 EFGKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPN 882 EF KVLDSWQPNI+YLQGE L + GSLVWGG++LS+ EAI GLFSS LPT VYLELPN Sbjct: 61 EFHKVLDSWQPNIVYLQGEHLSNDEVGSLVWGGLDLSSAEAISGLFSSALPTAVYLELPN 120 Query: 883 GEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLY 1062 GE+LAEALH+KGIPYV+YWK FS +AA HFRHA V QSS+CH WDAFQLA ASFRLY Sbjct: 121 GEKLAEALHAKGIPYVMYWKSAFSCYAASHFRHAFLCVAQSSTCHVWDAFQLAQASFRLY 180 Query: 1063 CVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYDEDI 1242 CV+NN VLPE SQ+ + +GP L+G+PP I+ +S ALPAIKIYD+D+ Sbjct: 181 CVQNNFVLPEMSQRDSDNMGPHLLGDPPNIDVPPPEAGPDDDEESNSDALPAIKIYDDDV 240 Query: 1243 NMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVT 1422 MRFLVCG SLD +L + DGLNALL+IEMRGSKLHNR SALPPPLQAGTFSRGVVT Sbjct: 241 TMRFLVCGLPCSLDECLLGSIADGLNALLNIEMRGSKLHNRVSALPPPLQAGTFSRGVVT 300 Query: 1423 MRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSE 1602 MRCD+ST SSAHISLLVSGSAQTCFDD +LENHIKSE+I+N+ LVH LP+ EEN+P +S Sbjct: 301 MRCDLSTSSSAHISLLVSGSAQTCFDDLLLENHIKSEIIENSTLVHVLPSDEENRPPISA 360 Query: 1603 PRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDD 1782 PRRS+S+ACG+ V+EVC+K+P WASQVLRQLAPDVSYR+LVALGIASIQGLAVASFEKDD Sbjct: 361 PRRSMSVACGSEVFEVCMKVPMWASQVLRQLAPDVSYRSLVALGIASIQGLAVASFEKDD 420 Query: 1783 AERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVKREEL- 1959 A+RLLFF +QGKD N IG P WLR PAPSRKRS+ Q +S QNG+ + Sbjct: 421 AQRLLFFYTKQGKDGFFGNFKIGDPPAWLRPPAPSRKRSDFYQGASYICQNGSTPGNHVA 480 Query: 1960 --EDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPN 2133 E+KES L NG P+V ARQK KVAA+RPIPHVRHQKMLPFS +SE+D +G QVK N Sbjct: 481 VKEEKESRLGNGVATPLVTARQKLKVAAMRPIPHVRHQKMLPFSRISELDSLDGNQVKTN 540 Query: 2134 LPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLK 2313 LP + +K ++VGVTP +HRKS S+S+QAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLK Sbjct: 541 LPIIPSTKGSNVGVTPVTHRKSASSSHQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLK 600 Query: 2314 DVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 2493 DVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV Sbjct: 601 DVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 660 Query: 2494 FSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVC 2673 FSKMRNHTVTN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+C Sbjct: 661 FSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCNSSAAGDWVNCGIC 720 Query: 2674 GEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 2823 GEWAHFGCDRR GLGAFKDYAKTDGLEYICPQCSV+ FKKK+ +T+NGYS Sbjct: 721 GEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVTNFKKKVLRTANGYS 770 >ref|XP_004252398.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Solanum lycopersicum] Length = 771 Score = 1174 bits (3037), Expect = 0.0 Identities = 577/771 (74%), Positives = 647/771 (83%), Gaps = 4/771 (0%) Frame = +1 Query: 523 MSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSAD 702 M H Q AS+ +CSLLAV CGR +E +Q K+V D K RY FP++VSSGRLEVQ LKNPS D Sbjct: 1 MFHCQGASRQSCSLLAVLCGRTSEYDQKKDVHDGKPRYCFPEIVSSGRLEVQVLKNPSTD 60 Query: 703 EFGKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPN 882 EF KVLDSWQPNI+YLQGE L + GSLVWGG++LS+ EAI GLFSSVLPT VYLELPN Sbjct: 61 EFHKVLDSWQPNIVYLQGEHLSNDEVGSLVWGGLDLSSAEAISGLFSSVLPTAVYLELPN 120 Query: 883 GEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLY 1062 GE+LAEALH+KGIPYV+YWK FS +AA HFRHA V QSS+CH WDAFQLAHASFRLY Sbjct: 121 GEKLAEALHAKGIPYVMYWKSAFSCYAASHFRHAFLCVAQSSTCHVWDAFQLAHASFRLY 180 Query: 1063 CVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYDEDI 1242 CVRNN L E SQ+ + +GP L+G+PP I+ +S ALPAIKIYD+D+ Sbjct: 181 CVRNNFALSEMSQRDSDNVGPHLLGDPPNIDVPLPEAGPEDDEESNSDALPAIKIYDDDV 240 Query: 1243 NMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVT 1422 MRFLVCG SLD +L + DGLNALL+IEMRGSKLHNR SALPPPLQAGTFSRGVVT Sbjct: 241 TMRFLVCGLPCSLDECLLGSIADGLNALLNIEMRGSKLHNRVSALPPPLQAGTFSRGVVT 300 Query: 1423 MRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSE 1602 MRCD+ST SSAHISLLVSGSAQTCFDD +LENHIKSE+I+N+ LVH LP+ EEN+P +S Sbjct: 301 MRCDLSTSSSAHISLLVSGSAQTCFDDLLLENHIKSEIIENSTLVHVLPSDEENRPPISA 360 Query: 1603 PRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDD 1782 PRRS+S+ACG+ V+EVC+K+P WASQVLRQLAPDVSYR+LVALGIASIQGLAVASFEKDD Sbjct: 361 PRRSMSVACGSEVFEVCMKVPMWASQVLRQLAPDVSYRSLVALGIASIQGLAVASFEKDD 420 Query: 1783 AERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVKREEL- 1959 A+RLLFF +QGKD N +G+ P WLR PAPSRKRS+ Q +S QNG + Sbjct: 421 AQRLLFFCTKQGKDGFFGNFKMGNPPAWLRPPAPSRKRSDFYQGASYICQNGLTPGNHVA 480 Query: 1960 --EDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPN 2133 E+KES L NG P+V ARQK KVAA+RPIPHVRHQKMLPFS +SE+D +G QVK N Sbjct: 481 VKEEKESRLGNGVATPLVTARQKLKVAAMRPIPHVRHQKMLPFSRISELDSLDGNQVKTN 540 Query: 2134 LPPVAPS-KHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFL 2310 LP + S K ++VGVTPA+HRKS S+S+QAKQIISLNPLPLKKHGCGRSPIHVCSEEEFL Sbjct: 541 LPIIPSSTKGSNVGVTPATHRKSASSSHQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFL 600 Query: 2311 KDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQ 2490 KDVMQFLILRGHTRLIPQ G+AEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQ Sbjct: 601 KDVMQFLILRGHTRLIPQSGIAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQ 660 Query: 2491 VFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGV 2670 VFSKMRNHTVTN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+ Sbjct: 661 VFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCNSSAAGDWVNCGI 720 Query: 2671 CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 2823 CGEWAHFGCDRR GLGAFKDYAKTDGLEYICPQCSV+ FKKK+ +T+NGYS Sbjct: 721 CGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVTNFKKKVLRTANGYS 771 >emb|CBI35803.3| unnamed protein product [Vitis vinifera] Length = 746 Score = 1145 bits (2961), Expect = 0.0 Identities = 569/767 (74%), Positives = 626/767 (81%) Frame = +1 Query: 523 MSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSAD 702 M H Q S +TC LLAV CG+ +E +Q +++ RYPFPD VSSGRLEVQTL +PS D Sbjct: 1 MLHTQGISNHTCGLLAVTCGKTSECKQEHETSNDRPRYPFPDFVSSGRLEVQTLTSPSPD 60 Query: 703 EFGKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPN 882 EF +V +S QPN +Y QGEQL + GSLVWGGV LS+ E ICGLF S LPTTVYLE+PN Sbjct: 61 EFRRVFESVQPNFVYFQGEQLQNDEVGSLVWGGVELSSAEDICGLFGSKLPTTVYLEIPN 120 Query: 883 GEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLY 1062 GE+LAEALHSKGIPYVIYWK+ FS +AACHFR+ALFSVVQSSS HTWDAFQLA+ASFRLY Sbjct: 121 GEKLAEALHSKGIPYVIYWKNAFSCYAACHFRNALFSVVQSSSTHTWDAFQLAYASFRLY 180 Query: 1063 CVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYDEDI 1242 CVRNN VLP NS KV+GKLGP L+G+P I+ S G LPAIKIYD+D+ Sbjct: 181 CVRNNHVLPANSHKVSGKLGPRLLGDPATIDVPPPEVDAGEDEEGSLGTLPAIKIYDDDV 240 Query: 1243 NMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVT 1422 +RFLVCG+ LDS + E LEDGLNALLSIE+RGSKLHNR SA PPPLQAGTFSRGVVT Sbjct: 241 GIRFLVCGEPCMLDSCLFESLEDGLNALLSIEIRGSKLHNRVSAPPPPLQAGTFSRGVVT 300 Query: 1423 MRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSE 1602 MRCD+ST SSAHISLLVSGSAQTCFDDQ+LEN+IK EV + + LVHALP SE NKP LSE Sbjct: 301 MRCDLSTCSSAHISLLVSGSAQTCFDDQLLENNIKKEVTEQSQLVHALPYSEGNKPPLSE 360 Query: 1603 PRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDD 1782 PRRS SIACGA+V+EVC K+P WASQVLRQLAPDVSYR+LVALGIASIQGLAVASFEKDD Sbjct: 361 PRRSASIACGAAVFEVCAKVPAWASQVLRQLAPDVSYRSLVALGIASIQGLAVASFEKDD 420 Query: 1783 AERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVKREELE 1962 A RLLFF RQGK H NN LP+WL+ P PSRKR E Q + Sbjct: 421 ANRLLFFCTRQGKYIHPNNFTPSRLPSWLKPPPPSRKRVEPSQDT--------------- 465 Query: 1963 DKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPNLPP 2142 NG +P++PA Q+ KVAA+RPIPH+RH KMLPFSG+SE DGH+G QVK NL Sbjct: 466 ------MNGVTMPLLPAGQRLKVAAMRPIPHIRHHKMLPFSGISEADGHDGGQVKANLSV 519 Query: 2143 VAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLKDVM 2322 P+KH+ VG T A HRKS S+SYQAKQIISLNPLPLKKHGCGRSPI VCSEEEFLKDVM Sbjct: 520 PPPTKHSIVGSTSAMHRKSFSSSYQAKQIISLNPLPLKKHGCGRSPIRVCSEEEFLKDVM 579 Query: 2323 QFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSK 2502 QFL LRGHTRLIPQGGLAEFPDAILNAKRLDL+NLYREVVSRGGFHVGNGINWKGQVFSK Sbjct: 580 QFLNLRGHTRLIPQGGLAEFPDAILNAKRLDLYNLYREVVSRGGFHVGNGINWKGQVFSK 639 Query: 2503 MRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVCGEW 2682 MRNHTVTN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+CGEW Sbjct: 640 MRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEW 699 Query: 2683 AHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 2823 AHFGCDRRQGLGAFKDYAKTDGLEYICPQCSV+ FKKK K NG+S Sbjct: 700 AHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVTNFKKKANKAPNGFS 746 >gb|EMJ18234.1| hypothetical protein PRUPE_ppa001668mg [Prunus persica] Length = 783 Score = 1120 bits (2896), Expect = 0.0 Identities = 552/774 (71%), Positives = 640/774 (82%), Gaps = 7/774 (0%) Frame = +1 Query: 523 MSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSAD 702 M+H Q ASK TCSLL V CG+ +E + ++ LDEK +YPFP+LVS GRLEVQTL PS + Sbjct: 1 MNHSQGASKQTCSLLVVTCGKISEEKPNEDTLDEKLKYPFPELVSLGRLEVQTLTKPSKE 60 Query: 703 EFGKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPN 882 EF K+L+S++PN++YLQGEQL + GS VW V+LST EAI +FS+ LPTTVYLE+PN Sbjct: 61 EFCKMLESYKPNLVYLQGEQLENNEIGSPVWEDVDLSTAEAISEIFSATLPTTVYLEVPN 120 Query: 883 GEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLY 1062 GE LA ALHSKGIPYVIYWKH+FSS+AACHFRHAL SVVQSSS HTWDAFQLA+ASFRLY Sbjct: 121 GENLAAALHSKGIPYVIYWKHEFSSYAACHFRHALLSVVQSSSTHTWDAFQLAYASFRLY 180 Query: 1063 CVRNNLVLPENSQKVNG-KLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYDED 1239 CV N+ +P N K + +LGP L+G+ KIN S G LPAIKI+D+D Sbjct: 181 CVENSHAIPANRHKSSSAELGPCLLGDRLKINVDPPEADVEEDEEGSLGTLPAIKIHDDD 240 Query: 1240 INMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVV 1419 + +RFLVCG+ +LD+S+LEPLEDGLNALL+IEMRGSKLH + SA PPPLQAGTFSRGVV Sbjct: 241 VILRFLVCGEPSTLDASLLEPLEDGLNALLNIEMRGSKLHGKFSAPPPPLQAGTFSRGVV 300 Query: 1420 TMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLS 1599 TMRCD+ST SSAHISLLVSGSAQTCFDDQ+LENHIK+EVI+ LV ALP +E NK L+ Sbjct: 301 TMRCDVSTCSSAHISLLVSGSAQTCFDDQLLENHIKNEVIEEIQLVRALPNNEGNKVPLA 360 Query: 1600 EPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKD 1779 EPR+S SIACGA+V+EVC+K+P WASQVLRQLAPDVSY +LVALGIASIQGL VASFEK+ Sbjct: 361 EPRKSASIACGATVFEVCMKVPAWASQVLRQLAPDVSYHSLVALGIASIQGLPVASFEKE 420 Query: 1780 DAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSI---YSQ---NGA 1941 DAERLLFF + GKD N+ +GS PTWLR P PSRKRS+ CQ +S YSQ + A Sbjct: 421 DAERLLFFCSSLGKDNKSNDFILGSPPTWLRPPPPSRKRSQPCQETSRGSNYSQRLPSLA 480 Query: 1942 VKREELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQ 2121 + + ++KE+ NG P++P RQ+ K+AA+RPIPHVR KM PFSG+SE+DGH+G Q Sbjct: 481 ASKIDEDNKEAGAMNGVSTPLLPPRQRLKIAAMRPIPHVRRPKMTPFSGMSELDGHDGGQ 540 Query: 2122 VKPNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEE 2301 K NLPP P+K N VG+TP + RKS S+S +KQIISLNPLPLKKHGCGRSPIH C EE Sbjct: 541 FKANLPPAPPTKLNIVGLTPTTQRKSYSSSSHSKQIISLNPLPLKKHGCGRSPIHSCLEE 600 Query: 2302 EFLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 2481 EFLKDVMQFLILRGH+RLIPQGGLAEFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINW Sbjct: 601 EFLKDVMQFLILRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINW 660 Query: 2482 KGQVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVN 2661 KGQ+FSKMRN+T+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVN Sbjct: 661 KGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVN 720 Query: 2662 CGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 2823 CG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CS+S FKKK QK +NG+S Sbjct: 721 CGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSISNFKKKPQKIANGFS 774 >ref|XP_006452906.1| hypothetical protein CICLE_v10007563mg [Citrus clementina] gi|557556132|gb|ESR66146.1| hypothetical protein CICLE_v10007563mg [Citrus clementina] Length = 745 Score = 1108 bits (2866), Expect = 0.0 Identities = 551/769 (71%), Positives = 627/769 (81%), Gaps = 2/769 (0%) Frame = +1 Query: 520 LMSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSA 699 +M H Q +S+ CSLLAV + +++Q + D+K +YPFP++ SSGRLEV L +PS Sbjct: 1 MMFHAQSSSRNHCSLLAVLSRKFVDDKQKQAATDDKPKYPFPEIASSGRLEVHLLSSPST 60 Query: 700 DEFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEAICGLFSSVLPTTVYLEL 876 DEF ++L+S +PNI+YLQGE++ D GSLVWG V+LSTPEA+CGLF S LPTTVYLE+ Sbjct: 61 DEFRRLLESSEPNIVYLQGEKINDSEEIGSLVWGDVDLSTPEALCGLFGSTLPTTVYLEI 120 Query: 877 PNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFR 1056 PNGE AEALHS+G+PYVIYWKH FS +AACHF AL SVVQSS HTWDAFQLAHASFR Sbjct: 121 PNGENFAEALHSRGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFR 180 Query: 1057 LYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYDE 1236 LYCVRNN+V+ NSQK + KLGP L+G+PPKI+ S LPAIKIYD+ Sbjct: 181 LYCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEEN-SPENLPAIKIYDD 239 Query: 1237 DINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGV 1416 D+ MRFLVCG +LD+S+L PLEDGLNALL+IE+RGSKLHNR SA PPPLQAG FSRGV Sbjct: 240 DVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGV 299 Query: 1417 VTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRL 1596 VTMRCD+ST SSAHISLLVSGSAQTCF+DQ+LENHIK+E+I+N+ LVHALP S +N+ Sbjct: 300 VTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPP 359 Query: 1597 SEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEK 1776 SEPR+S SIACGASV+EV +K+ WASQVLRQLAPDVSYR+LV LGIASIQGL+VASFEK Sbjct: 360 SEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEK 419 Query: 1777 DDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISS-IYSQNGAVKRE 1953 DDAERLLFF RQGK +H N + P+WL SPAPSRKRSE C+ S + S+N Sbjct: 420 DDAERLLFFCTRQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESEN------ 473 Query: 1954 ELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPN 2133 V R K AA+RPIPH RH KMLPFSG SEI+ ++G QVK N Sbjct: 474 ----------------VCNVRPKLNAAAMRPIPHTRHHKMLPFSGFSEIERYDGDQVKAN 517 Query: 2134 LPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLK 2313 LP VAP KH+S G TP +HRKS S+SYQA+QIISLNPLPLKKHGCGR+PI VCSEEEFL+ Sbjct: 518 LP-VAPLKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLR 576 Query: 2314 DVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 2493 DVMQFLILRGHTRL+PQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV Sbjct: 577 DVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 636 Query: 2494 FSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVC 2673 FSKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+C Sbjct: 637 FSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGIC 696 Query: 2674 GEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 2820 GEWAHFGCDRRQGLGAFKDYAKTDGLEY+CPQCSV+ FKKK QKTSNGY Sbjct: 697 GEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNGY 745 >ref|XP_006474564.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Citrus sinensis] Length = 745 Score = 1107 bits (2863), Expect = 0.0 Identities = 551/769 (71%), Positives = 627/769 (81%), Gaps = 2/769 (0%) Frame = +1 Query: 520 LMSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSA 699 +M H Q +S+ CSLLAV + +++Q + D+K +YPFP++ SSGRLEV L +PS Sbjct: 1 MMFHAQSSSRNHCSLLAVLSRKFVDDKQKQAATDDKPKYPFPEIASSGRLEVHLLSSPST 60 Query: 700 DEFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEAICGLFSSVLPTTVYLEL 876 DEF ++L+S +PNI+YLQGE++ D GSLVWG V+LSTPEA+CGLF S LPTTVYLE+ Sbjct: 61 DEFRRLLESSEPNIVYLQGEKINDSEEIGSLVWGDVDLSTPEALCGLFGSTLPTTVYLEI 120 Query: 877 PNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFR 1056 PNGE AEALHS+G+PYVIYWKH FS +AACHF AL SVVQSS HTWDAFQLAHASFR Sbjct: 121 PNGENFAEALHSRGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASFR 180 Query: 1057 LYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYDE 1236 LYCVRNN+V+ NSQK + KLGP L+G+PPKI+ S LPAIKIYD+ Sbjct: 181 LYCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEEN-SPENLPAIKIYDD 239 Query: 1237 DINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGV 1416 D+ MRFLVCG +LD+S+L PLEDGLNALL+IE+RGSKLHNR SA PPPLQAG FSRGV Sbjct: 240 DVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGAFSRGV 299 Query: 1417 VTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRL 1596 VTMRCD+ST SSAHISLLVSGSAQTCF+DQ+LENHIK+E+I+N+ LVHALP S +N+ Sbjct: 300 VTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGDNRLPP 359 Query: 1597 SEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEK 1776 SEPR+S SIACGASV+EV +K+ WASQVLRQLAPDVSYR+LV LGIASIQGL+VASFEK Sbjct: 360 SEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEK 419 Query: 1777 DDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISS-IYSQNGAVKRE 1953 DDAERLLFF RQGK +H N + P+WL SPAPSRKRSE C+ S + S+N Sbjct: 420 DDAERLLFFCTRQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGVESEN------ 473 Query: 1954 ELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPN 2133 V R K AA+RPIPH RH KMLPFSG SEI+ ++G QVK N Sbjct: 474 ----------------VCNVRPKLNSAAMRPIPHTRHYKMLPFSGFSEIERYDGDQVKAN 517 Query: 2134 LPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLK 2313 LP VAP KH+S G TP +HRKS S+SYQA+QIISLNPLPLKKHGCGR+PI VCSEEEFL+ Sbjct: 518 LP-VAPLKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLR 576 Query: 2314 DVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 2493 DVMQFLILRGHTRL+PQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV Sbjct: 577 DVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 636 Query: 2494 FSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVC 2673 FSKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+C Sbjct: 637 FSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGIC 696 Query: 2674 GEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 2820 GEWAHFGCDRRQGLGAFKDYAKTDGLEY+CPQCSV+ FKKK QKTSNGY Sbjct: 697 GEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSNGY 745 >gb|EXB64667.1| AT-rich interactive domain-containing protein 4 [Morus notabilis] Length = 779 Score = 1096 bits (2835), Expect = 0.0 Identities = 543/771 (70%), Positives = 633/771 (82%), Gaps = 4/771 (0%) Frame = +1 Query: 523 MSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSAD 702 M H Q +SK TCSLLAV CG +E+++ K+V + +S YPFP+L+SSGRLEVQTL +PS + Sbjct: 1 MFHSQGSSKQTCSLLAVTCGNVSESKRKKDVPENRSLYPFPELISSGRLEVQTLTSPSKE 60 Query: 703 EFGKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPN 882 EF K+L+S++PN++YLQGEQL + G LVWG V+LSTPE++ LF + LPTTVYLE+P+ Sbjct: 61 EFSKLLESYKPNLVYLQGEQLANDEVGPLVWGDVDLSTPESVSELFGTTLPTTVYLEIPD 120 Query: 883 GEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLY 1062 EELAE LHSKG+PYVIYWK +FS AACHFR+AL SVV+SSS H WDAFQLA+ASFRLY Sbjct: 121 CEELAEELHSKGVPYVIYWKDRFSRHAACHFRNALLSVVKSSSTHAWDAFQLAYASFRLY 180 Query: 1063 CVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYDEDI 1242 CVRNN VLP +++ + GP L+G+ KIN S LPAIKI+D+D+ Sbjct: 181 CVRNNHVLPSKGHEISDEQGPCLLGDRLKINVDPPAADVEDDEDGSLDTLPAIKIHDDDL 240 Query: 1243 NMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVT 1422 ++RFLVCG +LD S+LEPLEDGLNALL+IE+RG +LH + SA PPPLQAGTFSRGVVT Sbjct: 241 SLRFLVCGVPSTLDESVLEPLEDGLNALLNIEIRGGRLHGKFSAPPPPLQAGTFSRGVVT 300 Query: 1423 MRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPT-SEENKPRLS 1599 MRCD+ST S AHIS+L+SGSAQTCFDDQ+LENHIK+E+I+N+ LV ALPT SE NK LS Sbjct: 301 MRCDLSTCSCAHISILLSGSAQTCFDDQLLENHIKNEIIENSQLVRALPTASEGNKLPLS 360 Query: 1600 EPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKD 1779 EPR+S SIACGA+V+EVC+K+P WASQVLRQLAPDVSY +LVALGIASIQG+ VASFEK+ Sbjct: 361 EPRKSASIACGATVFEVCMKVPAWASQVLRQLAPDVSYHSLVALGIASIQGIPVASFEKE 420 Query: 1780 DAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQ---NGAVKR 1950 DAERLLFF + QGK E N+L + P WLR PAPSRKRS+ S N V + Sbjct: 421 DAERLLFFCSSQGK-EISNDLVFSNPPPWLRPPAPSRKRSQETSPGSHDGHRVPNQVVSK 479 Query: 1951 EELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKP 2130 E EDKE NG LP++PARQ+ KVAA+RPIPHVR KM PFSG+SE DGH+G QVK Sbjct: 480 SEEEDKERGPSNGVSLPLLPARQRLKVAAMRPIPHVRRPKMTPFSGISEADGHDGGQVKA 539 Query: 2131 NLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFL 2310 +P P+K + VG+TP++ RKS S+S QAKQIISLNPLPLKKHGCGRS IH CSEEEFL Sbjct: 540 IVPVAPPTKLSIVGLTPSAQRKSFSSSSQAKQIISLNPLPLKKHGCGRSSIHTCSEEEFL 599 Query: 2311 KDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQ 2490 KDVMQFLILRGHTRLIPQ GLAEFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ Sbjct: 600 KDVMQFLILRGHTRLIPQSGLAEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQ 659 Query: 2491 VFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGV 2670 +FSKMRN+T+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+ Sbjct: 660 IFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGI 719 Query: 2671 CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 2823 CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CSVS FKKK QK SNG+S Sbjct: 720 CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVSNFKKKSQKVSNGFS 770 >ref|XP_004303747.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Fragaria vesca subsp. vesca] Length = 779 Score = 1086 bits (2809), Expect = 0.0 Identities = 534/763 (69%), Positives = 630/763 (82%), Gaps = 7/763 (0%) Frame = +1 Query: 553 TCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSADEFGKVLDSWQ 732 TCS+L V CG +E+++ K ++K RYPFP+LVSSGRLEVQTL NPS +EF K+L+S++ Sbjct: 7 TCSVLVVTCGEISEDKRGKETPEDKLRYPFPELVSSGRLEVQTLTNPSEEEFCKLLESYK 66 Query: 733 PNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPNGEELAEALHS 912 PN++YLQGEQL + G LVW LST E++ +F + LPTTVYLE+PNGEELA AL S Sbjct: 67 PNLVYLQGEQLENDEVGPLVWRDAYLSTAESMSDIFDATLPTTVYLEVPNGEELAVALQS 126 Query: 913 KGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLYCVRNNLVLPE 1092 KGIPYVIYWK S++AACHFRHAL SVVQSSS HTWDAFQLAHASFRLYCV+N+ V+ Sbjct: 127 KGIPYVIYWKDAISTYAACHFRHALLSVVQSSSTHTWDAFQLAHASFRLYCVQNDHVVRV 186 Query: 1093 NSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYDEDINMRFLVCGDT 1272 N K + +LGP ++GE KI+ ++G+LPAIKI+D+D+++RFLVCG Sbjct: 187 NLDKPSAELGPCILGEHLKISVDPPEADMEEDEEGATGSLPAIKIHDDDVSLRFLVCGQP 246 Query: 1273 RSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVTMRCDISTVSS 1452 +LD+ ILEPLEDGLNALL+IEMRGSKLH + SA PPPLQAGTFSRGVVTMRCDIST SS Sbjct: 247 STLDAGILEPLEDGLNALLNIEMRGSKLHGKFSAPPPPLQAGTFSRGVVTMRCDISTCSS 306 Query: 1453 AHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSEPRRSVSIACG 1632 AHISLLVSGSAQTCFDDQ+LENHIK EVI+ LVHA+P ++ NK L EPR+S +IACG Sbjct: 307 AHISLLVSGSAQTCFDDQLLENHIKHEVIEINQLVHAVPNNDRNKLPLVEPRKSAAIACG 366 Query: 1633 ASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDDAERLLFFSAR 1812 A+V+EV +K+P WASQVLRQLAPDVSYR+LV+LGIASIQGL VASFEKDDA+RLLFF + Sbjct: 367 ATVFEVSMKVPVWASQVLRQLAPDVSYRSLVSLGIASIQGLPVASFEKDDADRLLFFCSS 426 Query: 1813 QGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQ--ISSIYSQNGA--VKREELEDKESAL 1980 + KD LN+L + + P WLR PAPS+KRS +CQ I ++ G + ++E+ E AL Sbjct: 427 RTKDSQLNDLFLSTPPAWLRPPAPSKKRSRLCQEAIPGFRNRQGLPNLAASKVEENEKAL 486 Query: 1981 R--NGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQ-VKPNLPPVAP 2151 NGF P++PARQ+ K AA+RPIPHVR KM PFSG+SE++GH+G+Q VK +LPPV P Sbjct: 487 GAVNGFSTPLLPARQRLKTAAMRPIPHVRRPKMTPFSGISEVNGHDGSQVVKAHLPPVPP 546 Query: 2152 SKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLKDVMQFL 2331 +K N VG+TP + RKS S+S QAKQIISLNPLPLKKHGCGR PIH C EEEFLKDVMQFL Sbjct: 547 TKLNIVGLTPTTQRKSYSSSSQAKQIISLNPLPLKKHGCGRGPIHSCLEEEFLKDVMQFL 606 Query: 2332 ILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRN 2511 ILRGH+RLIPQGGL EFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINWKGQ+FSKMRN Sbjct: 607 ILRGHSRLIPQGGLTEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWKGQIFSKMRN 666 Query: 2512 HTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHF 2691 +T+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+CGEWAHF Sbjct: 667 YTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHF 726 Query: 2692 GCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 2820 GCDRRQGLGAFKDYAKTDGLEYICP CS+S FKKK QK +NG+ Sbjct: 727 GCDRRQGLGAFKDYAKTDGLEYICPHCSISNFKKKPQKVTNGF 769 >ref|XP_006381551.1| hypothetical protein POPTR_0006s13780g [Populus trichocarpa] gi|550336257|gb|ERP59348.1| hypothetical protein POPTR_0006s13780g [Populus trichocarpa] Length = 749 Score = 1070 bits (2767), Expect = 0.0 Identities = 533/768 (69%), Positives = 613/768 (79%), Gaps = 1/768 (0%) Frame = +1 Query: 520 LMSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSA 699 +M H Q + C+LLAV CG++ +N+Q + + D+K R+PFP+L S+GRLEVQ L NPS Sbjct: 1 MMFHAQGPLRNHCTLLAVLCGKSGDNKQKQPLSDDKPRFPFPELASAGRLEVQVLTNPST 60 Query: 700 DEFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEAICGLFSSVLPTTVYLEL 876 DEF +VL S +P+I+Y QGEQ+ D G L WG ++LSTPE++CGLF S LP TVYLE+ Sbjct: 61 DEFQRVLHSLEPSIVYFQGEQIEDSEEIGPLRWGDIDLSTPESLCGLFGSTLPPTVYLEI 120 Query: 877 PNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFR 1056 PNGE+LAEALHSKG+PYVIYWK FS +A HFR AL SVVQSS HT DAFQLA+ASFR Sbjct: 121 PNGEKLAEALHSKGVPYVIYWKSMFSCYAVSHFRQALLSVVQSSCSHTCDAFQLAYASFR 180 Query: 1057 LYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYDE 1236 LYC RNN L N QKV GK GP L+G+PPK + SSGALPAIKIYD+ Sbjct: 181 LYCGRNNNTLASNGQKVGGKPGPQLLGDPPKFDITLPEADDQGEES-SSGALPAIKIYDD 239 Query: 1237 DINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGV 1416 D+ MRFLVCG + +LD+ +LE LEDGLNALL+IE+RGSKLHNR SA PPPLQAGTFSRGV Sbjct: 240 DVTMRFLVCGLSCTLDACLLESLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGTFSRGV 299 Query: 1417 VTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRL 1596 VTMRCD+ST SSAHISLLVSGSAQTCF+DQ+LENHIK+E+I+N+ LVHAL + EE+K Sbjct: 300 VTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALTSFEESKSPS 359 Query: 1597 SEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEK 1776 SEPR+S SIACGASV+EV +K+P WASQVLRQLAPDVSYR+LV LGIASIQGL+VASFEK Sbjct: 360 SEPRKSASIACGASVFEVSMKVPTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFEK 419 Query: 1777 DDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVKREE 1956 DDA+RLLFF + QGK+ H N + PTWL PAP RKRSE + + + Sbjct: 420 DDADRLLFFCSEQGKESHPLNTFLTRPPTWLIPPAPCRKRSEPTR-----------ETKP 468 Query: 1957 LEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPNL 2136 L G + K VAA+RPIPH KMLPFSG + + ++G Q KP+L Sbjct: 469 LTSGRGGENGG------NVKHKFHVAAMRPIPHTHRHKMLPFSGFFDAERYDGEQAKPSL 522 Query: 2137 PPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLKD 2316 PP P KH+ VG P +HRKS S+SYQA+QIISLNPLPLKKHGCGRSPI VCSEEEFL+D Sbjct: 523 PP-PPPKHSVVGPAPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGRSPIQVCSEEEFLRD 581 Query: 2317 VMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVF 2496 VMQFLILRGH+RL+PQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVF Sbjct: 582 VMQFLILRGHSRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVF 641 Query: 2497 SKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVCG 2676 SKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+CG Sbjct: 642 SKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGICG 701 Query: 2677 EWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 2820 EWAHFGCDRRQGLGAFKDYAKTDGLEYICP CS++ FKKK QKT+NGY Sbjct: 702 EWAHFGCDRRQGLGAFKDYAKTDGLEYICPNCSIANFKKKSQKTTNGY 749 >gb|EOY30139.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 1 [Theobroma cacao] gi|508782884|gb|EOY30140.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 1 [Theobroma cacao] Length = 746 Score = 1062 bits (2747), Expect = 0.0 Identities = 542/769 (70%), Positives = 616/769 (80%), Gaps = 2/769 (0%) Frame = +1 Query: 520 LMSHIQVASKYTCSLLAVFCGRN-AENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPS 696 +M Q +S+ CSLLAV G N ++N+Q + V D+K RYPFP+L SSGRLEVQ L +P+ Sbjct: 1 MMFSAQGSSRNHCSLLAVLSGGNVSDNKQKQPVSDDKPRYPFPELASSGRLEVQLLNSPN 60 Query: 697 ADEFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEAICGLFSSVLPTTVYLE 873 DE +VL+S +PN++YLQGEQ D G L+WG V+LSTPE +CGLF S LPTTVYLE Sbjct: 61 IDELRRVLESTEPNVVYLQGEQNADSEEIGPLIWGDVDLSTPETLCGLFDSTLPTTVYLE 120 Query: 874 LPNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASF 1053 PNG++LAEALHS+G+PYVIYWK+ FS FAACHFR AL SV+QSS HTWDAFQLAHASF Sbjct: 121 TPNGDKLAEALHSQGVPYVIYWKNTFSRFAACHFRQALLSVIQSSCSHTWDAFQLAHASF 180 Query: 1054 RLYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYD 1233 RLYCVRNN V+ NSQK + K GP L+GE PKI+ S LPAIKIYD Sbjct: 181 RLYCVRNNNVVSSNSQKQSVKPGPRLLGEAPKIDVSQPEVDMQGEES-SPENLPAIKIYD 239 Query: 1234 EDINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRG 1413 +D+ +RFLVCG LD+ +L LEDGLNALLSIE+RGSKLHNRASA PPPLQAGTFSRG Sbjct: 240 DDVTVRFLVCGSPCILDAFLLGSLEDGLNALLSIEIRGSKLHNRASAPPPPLQAGTFSRG 299 Query: 1414 VVTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPR 1593 VVTMRCD ST SSAHISLLVSGSAQTCF+DQ+LENHIK+E+I+ + LVHA +SEE+K Sbjct: 300 VVTMRCDFSTCSSAHISLLVSGSAQTCFNDQLLENHIKNEIIEKSQLVHAQSSSEESKLP 359 Query: 1594 LSEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFE 1773 SEPRRS SIACGASV+EVC+K+P WASQVLRQLAPDVSYR+LV LGIASIQGL+VASFE Sbjct: 360 SSEPRRSASIACGASVFEVCMKVPTWASQVLRQLAPDVSYRSLVMLGIASIQGLSVASFE 419 Query: 1774 KDDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVKRE 1953 KDDAERLLFF RQ KD ++ I P+WL PAPSRKRSE C+ S + G Sbjct: 420 KDDAERLLFFCMRQDKDPLQDSSVIAISPSWLVPPAPSRKRSEPCKDSKPLNCTG----- 474 Query: 1954 ELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPN 2133 +E + NG AR KS VAA+RPIPH K++PFSG SE + ++G Q K N Sbjct: 475 -MEGE-----NGI------ARPKSNVAAMRPIPHTHRHKIIPFSGFSEAERYDGDQGKVN 522 Query: 2134 LPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLK 2313 LP V P K + P +HRK+ S+SYQA+QIISLNPLPLKKHGCGR+PI VCSEEEFL+ Sbjct: 523 LP-VVPVKQPA----PVTHRKALSSSYQAQQIISLNPLPLKKHGCGRAPIQVCSEEEFLR 577 Query: 2314 DVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 2493 DVMQFLILRGHTRL+PQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV Sbjct: 578 DVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 637 Query: 2494 FSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVC 2673 FSKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+C Sbjct: 638 FSKMRNHTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGIC 697 Query: 2674 GEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 2820 GEWAHFGCDRRQGLGAFKDYAKTDGLEY+CP CS+S FKKK QKT NGY Sbjct: 698 GEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSISNFKKKPQKTVNGY 746 >ref|XP_002277324.1| PREDICTED: AT-rich interactive domain-containing protein 4 [Vitis vinifera] gi|297738501|emb|CBI27746.3| unnamed protein product [Vitis vinifera] Length = 739 Score = 1051 bits (2718), Expect = 0.0 Identities = 536/769 (69%), Positives = 612/769 (79%), Gaps = 3/769 (0%) Frame = +1 Query: 523 MSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSAD 702 M H+Q AS+ C+LLAV CG+ +E ++ YPFP+LVSSGRLEVQ LKNPS Sbjct: 1 MFHVQAASRNHCALLAVVCGKIPVSED-----QQQHPYPFPELVSSGRLEVQILKNPSIH 55 Query: 703 EFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELP 879 EF + L+S +PN LYLQGEQLP GSL WGGV+LS+ EA+ LF LPTTVYLE P Sbjct: 56 EFQRSLESLEPNFLYLQGEQLPGSEEIGSLTWGGVDLSSAEALVELFGPTLPTTVYLETP 115 Query: 880 NGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRL 1059 NGE+LA+ALHSKG+ YVIYWK+ FS +AACHFR ALFSVVQSS HTWDAFQLAHASFRL Sbjct: 116 NGEKLAKALHSKGVSYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFRL 175 Query: 1060 YCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYDED 1239 YCV+NN V P N+QKV+GKLGP L+G+PPKIN + LP IKIYD D Sbjct: 176 YCVQNNTV-PSNNQKVSGKLGPCLLGDPPKINVVPPEVDEEESLPAT---LPVIKIYDAD 231 Query: 1240 INMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVV 1419 ++MRFLVCG +LD+ +L LEDGLNALL IE+RGSKLHNR SA PPPLQAGTFSRGVV Sbjct: 232 VSMRFLVCGAPSALDACLLGSLEDGLNALLCIEIRGSKLHNRVSAPPPPLQAGTFSRGVV 291 Query: 1420 TMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLS 1599 TMRCD+ST SSAHISLLVSGSAQTC +DQ+LE++IK+E+I+ + LVHA+P+ EE+K S Sbjct: 292 TMRCDLSTCSSAHISLLVSGSAQTCLNDQLLESYIKNELIEKSQLVHAVPSCEESKLSSS 351 Query: 1600 EPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKD 1779 EPRRS SIACGASV+EV +K+P WASQVLRQLAPDVSYR+LV LGIASIQGL+VASFEKD Sbjct: 352 EPRRSASIACGASVFEVRIKVPTWASQVLRQLAPDVSYRSLVTLGIASIQGLSVASFEKD 411 Query: 1780 DAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQIS--SIYSQNGAVKRE 1953 DA+RLLFF R K + NN + P+WL +P SRKRS C + S Y G V Sbjct: 412 DADRLLFFCTRHAKQLNQNNSILPRPPSWLIAPPASRKRSGPCHETKPSGYKVLGGV--- 468 Query: 1954 ELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPN 2133 NG L +QK K+AA+RPIPH R+ KMLPFSG+SE +G Q K N Sbjct: 469 ----------NGGVL-----QQKPKIAAMRPIPHTRNHKMLPFSGISEASRCDGDQAKGN 513 Query: 2134 LPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLK 2313 L V P+KHN G TP +HRK S+S+QA+QIISLNPLPLKKHGCGRSPI +CSEEEFL+ Sbjct: 514 LS-VVPAKHN--GTTPVTHRKLLSSSFQAQQIISLNPLPLKKHGCGRSPIQICSEEEFLR 570 Query: 2314 DVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQV 2493 DVMQFLILRGHTRL+PQGGLAEFPDAILNAKRLDL+NLYREVVSRGGFHVGNGINWKGQV Sbjct: 571 DVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRGGFHVGNGINWKGQV 630 Query: 2494 FSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVC 2673 FSKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNCG+C Sbjct: 631 FSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNCGIC 690 Query: 2674 GEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 2820 GEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CS++ F+KK QKT+NGY Sbjct: 691 GEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFQKKSQKTANGY 739 >ref|XP_002324130.2| arid/bright DNA-binding domain-containing family protein [Populus trichocarpa] gi|550318261|gb|EEF02695.2| arid/bright DNA-binding domain-containing family protein [Populus trichocarpa] Length = 746 Score = 1050 bits (2715), Expect = 0.0 Identities = 533/773 (68%), Positives = 604/773 (78%), Gaps = 6/773 (0%) Frame = +1 Query: 520 LMSHIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSA 699 +M H Q + C+LLAV CG++ E Q + D+K RYP P+L S+GRLEVQ L NPS Sbjct: 1 MMFHAQGPLRNHCTLLAVLCGKSGE--QKLPLSDDKPRYPLPELESTGRLEVQVLNNPST 58 Query: 700 DEFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEAICGLFSSVLPTTVYLEL 876 DEF +VL S +P+I+Y QGEQ+ D GSL W V LSTPE++CGLF S LP TVYLE+ Sbjct: 59 DEFRQVLQSLEPSIVYFQGEQVEDREEIGSLRWADVGLSTPESLCGLFGSTLPPTVYLEM 118 Query: 877 PNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFR 1056 PNGE+LAEALHSKG+PYVIYWK FS +AA HFR AL SVVQSS HT DAFQLAHASFR Sbjct: 119 PNGEKLAEALHSKGVPYVIYWKSAFSCYAASHFRQALLSVVQSSCSHTCDAFQLAHASFR 178 Query: 1057 LYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYDE 1236 LYCV+NN NSQKV GK GP L+G+PPK + SSGALPAIKIYD+ Sbjct: 179 LYCVQNNNTPASNSQKVGGKPGPRLLGDPPKFDISLPEADDQGEEG-SSGALPAIKIYDD 237 Query: 1237 DINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGV 1416 D+ MRFLVCG T +LD+ L LEDGLNALL+IE+RGSKLHNR SA PPPLQAGTFSRGV Sbjct: 238 DVTMRFLVCGLTGTLDACALGSLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGTFSRGV 297 Query: 1417 VTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRL 1596 VTMRCD+ST SSAHISLLVSGSAQ CF+DQ+LENHIKSE+I+N+ LVHA +S+E K Sbjct: 298 VTMRCDLSTCSSAHISLLVSGSAQNCFNDQLLENHIKSELIENSQLVHASTSSDEIKSPS 357 Query: 1597 SEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEK 1776 SEPR+S SIACGASV+EV +K+P WASQVLRQLAPDV+YR+LV LGIASIQGL+VASFEK Sbjct: 358 SEPRKSASIACGASVFEVSMKVPTWASQVLRQLAPDVTYRSLVMLGIASIQGLSVASFEK 417 Query: 1777 DDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYS-----QNGA 1941 DDA+RLLFF +Q KD H N + P+WL PAP RKR E + + + +NG Sbjct: 418 DDADRLLFFCTKQSKDPHPRNPVLTRHPSWLIPPAPCRKRYEPSRETKPLTFGCGGENGG 477 Query: 1942 VKREELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQ 2121 +QK VAA+RPIPH R KMLPFSG E + ++G Q Sbjct: 478 ----------------------NFKQKLYVAAMRPIPHTRRHKMLPFSGFLEAERYDGEQ 515 Query: 2122 VKPNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEE 2301 KP+LPP P KH+ VG P +HRKS S SYQA+QIISLNPLPLKKHGCGRSPI CSEE Sbjct: 516 TKPSLPP--PPKHSVVGPAPVTHRKSLSNSYQAQQIISLNPLPLKKHGCGRSPIQACSEE 573 Query: 2302 EFLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 2481 EFL+DVMQFLILRGH+RL+PQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW Sbjct: 574 EFLRDVMQFLILRGHSRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 633 Query: 2482 KGQVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVN 2661 KGQVFSKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVN Sbjct: 634 KGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVN 693 Query: 2662 CGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 2820 CG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CS++ FKKK QK +NGY Sbjct: 694 CGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSIANFKKKSQKNANGY 746 >ref|XP_002516200.1| DNA binding protein, putative [Ricinus communis] gi|223544686|gb|EEF46202.1| DNA binding protein, putative [Ricinus communis] Length = 749 Score = 1048 bits (2709), Expect = 0.0 Identities = 526/725 (72%), Positives = 595/725 (82%), Gaps = 1/725 (0%) Frame = +1 Query: 649 LVSSGRLEVQTLKNPSADEFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEA 825 L SSGRLEVQ L +PS DEF +VL S +PNI+YLQGE + D GSL W G +LSTP+A Sbjct: 43 LXSSGRLEVQILSSPSTDEFRRVLQSSEPNIVYLQGEIIEDSEEIGSLRWAGADLSTPDA 102 Query: 826 ICGLFSSVLPTTVYLELPNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQS 1005 +C LF S LP TVYLE+PNGE+LAEALH KG+PYVIYWK FS +AA HFR AL SVVQS Sbjct: 103 LCELFGSTLPPTVYLEIPNGEKLAEALHFKGVPYVIYWKSTFSCYAAAHFRQALLSVVQS 162 Query: 1006 SSCHTWDAFQLAHASFRLYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXX 1185 S HT DAFQLAHASF LYCVRNN L N+QKV GK GP L+GEPPKI+ Sbjct: 163 SCSHTCDAFQLAHASFSLYCVRNNTGLSSNNQKVGGKPGPRLLGEPPKIDITLPEADVQD 222 Query: 1186 XXXXSSGALPAIKIYDEDINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNR 1365 SSG LPAIKIYD+D+ MRFLVC +LD+ +L LEDGLNALL+IE+RGSKLHNR Sbjct: 223 EES-SSGTLPAIKIYDDDVTMRFLVCELPSTLDACLLGSLEDGLNALLNIEIRGSKLHNR 281 Query: 1366 ASALPPPLQAGTFSRGVVTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDN 1545 SA PPPLQAGTFSRGVVTMRCD+ST SSAHISLLVSGSAQ CF+DQ+LENHIK+E+I+N Sbjct: 282 TSAPPPPLQAGTFSRGVVTMRCDLSTCSSAHISLLVSGSAQACFNDQLLENHIKNELIEN 341 Query: 1546 TCLVHALPTSEENKPRLSEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLV 1725 + LVHALP+SEE+K SEPR+S SI CGASV+EVC+K+P+WASQVLRQLAPDVSYR+LV Sbjct: 342 SQLVHALPSSEESKLLTSEPRKSASIGCGASVFEVCLKVPSWASQVLRQLAPDVSYRSLV 401 Query: 1726 ALGIASIQGLAVASFEKDDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEM 1905 LGIASIQGL+VASFEK+D ERLLFF RQGK+ + NN I P WL PAPSRKRSE Sbjct: 402 MLGIASIQGLSVASFEKEDTERLLFFCTRQGKELYPNNSIIIKPPCWLIPPAPSRKRSEP 461 Query: 1906 CQISSIYSQNGAVKREELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFS 2085 C+ + +++ G ++RE NG + +QK VAA+RPIPH RH KMLPFS Sbjct: 462 CRETKLFTSKG-LERE----------NGGSV-----KQKLNVAAMRPIPHTRHHKMLPFS 505 Query: 2086 GLSEIDGHEGTQVKPNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHG 2265 G +E + ++G Q KP+LP VAP+KH VG P SHRKS S+SYQA+QIISLNPLPLKKHG Sbjct: 506 GFAEGERYDGDQGKPSLP-VAPAKHGVVGPAPVSHRKSLSSSYQAQQIISLNPLPLKKHG 564 Query: 2266 CGRSPIHVCSEEEFLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVS 2445 CGR+PI CSEEEFL+DVMQFLILRGHTRL+PQGGLAEFPDAILNAKRLDLFNLYREVVS Sbjct: 565 CGRAPIQACSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVS 624 Query: 2446 RGGFHVGNGINWKGQVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCL 2625 RGGFHVGNGINWKGQVFSKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCL Sbjct: 625 RGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCL 684 Query: 2626 LCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQK 2805 LC SSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CS++ F+KK QK Sbjct: 685 LCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSIANFRKKSQK 744 Query: 2806 TSNGY 2820 T+NGY Sbjct: 745 TANGY 749 >ref|XP_003547888.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Glycine max] Length = 782 Score = 1038 bits (2683), Expect = 0.0 Identities = 509/773 (65%), Positives = 607/773 (78%), Gaps = 8/773 (1%) Frame = +1 Query: 529 HIQVASKYTCSLLAVFCGRNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSADEF 708 H Q K+TC+LLAV C ++ ++ + + YPFP+LVS+GRLEVQTL +P ++F Sbjct: 5 HSQGTPKHTCTLLAVTCRTSSAEHKLSHA---QRTYPFPELVSAGRLEVQTLCSPEKEQF 61 Query: 709 GKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPNGE 888 KVL+S+QPN +YL+G+QL +G GSLVW GV LST E I LF S LPT VYLE+PNGE Sbjct: 62 RKVLESFQPNFVYLRGDQLENGEVGSLVWQGVELSTCEDITELFGSTLPTAVYLEIPNGE 121 Query: 889 ELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLYCV 1068 AEALH KGIPYVI+WK+ FS +AACHFR A SVVQSSS HTWDAF LA ASF LYCV Sbjct: 122 SFAEALHLKGIPYVIFWKNTFSCYAACHFRQAFLSVVQSSSTHTWDAFHLARASFELYCV 181 Query: 1069 RNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYDEDINM 1248 +NN VLP +S + ++GP L+G+ KIN SSG+LPAIKI+++++N+ Sbjct: 182 QNNQVLPSDSDDASSEMGPHLLGDCLKINVDPPEIDEEDDDESSSGSLPAIKIHEDEVNL 241 Query: 1249 RFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVTMR 1428 RFL+CG ++D S+L LEDGL ALL+IE+RG KLH + SA PPPLQA FSRGVVTMR Sbjct: 242 RFLICGAPSTVDESLLRSLEDGLRALLTIEIRGCKLHGKFSAPPPPLQAAAFSRGVVTMR 301 Query: 1429 CDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSEPR 1608 CDIST SSAHISLLVSGSAQTCF+DQ+LENHIK+E+I+ + LVHA +E NK + EPR Sbjct: 302 CDISTCSSAHISLLVSGSAQTCFNDQLLENHIKNEIIEKSQLVHAQLNNEGNKENICEPR 361 Query: 1609 RSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDDAE 1788 RS SIACGASV+E+C+KLP WA Q+LRQLAP+VSYR+LVALGIASIQGL +ASFEKDDAE Sbjct: 362 RSASIACGASVFEICMKLPQWALQILRQLAPEVSYRSLVALGIASIQGLPIASFEKDDAE 421 Query: 1789 RLLFFSARQGKDE--HLNNLNIGSLPTWLRSPAPSRKRSEMCQISS------IYSQNGAV 1944 RLLFF KD + NN+ S P WL+ P P+RKR E Q +S +++ G V Sbjct: 422 RLLFFYQNCEKDSCTNKNNIIFSSPPGWLKPPPPTRKRCEPRQEASPGLHEGVFAGQGGV 481 Query: 1945 KREELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQV 2124 + E+K+ + NG +P+ PARQ+ KV+A+RPIPH+R +M PF G SE DG +GTQV Sbjct: 482 CKLNEEEKDRKIVNGISMPLTPARQRLKVSAMRPIPHIRRHRMTPFCGPSETDGFDGTQV 541 Query: 2125 KPNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEE 2304 + LP VAP+K S+G T +HRKS S++ Q+KQ+ISLNPLPLKKHGCGR P+ CSEEE Sbjct: 542 EAILPLVAPTKRTSIGSTSGTHRKSFSSAAQSKQVISLNPLPLKKHGCGRGPVQTCSEEE 601 Query: 2305 FLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWK 2484 FLKDVM+FLILRGH RLIPQGGL EFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINWK Sbjct: 602 FLKDVMEFLILRGHNRLIPQGGLTEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINWK 661 Query: 2485 GQVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNC 2664 GQ+FSKMRN+T TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVNC Sbjct: 662 GQIFSKMRNYTTTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC 721 Query: 2665 GVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 2823 G+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CSV+ FKKK Q +NGYS Sbjct: 722 GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVTNFKKK-QNVANGYS 773 >gb|ESW07366.1| hypothetical protein PHAVU_010G123900g [Phaseolus vulgaris] Length = 781 Score = 1034 bits (2674), Expect = 0.0 Identities = 513/774 (66%), Positives = 606/774 (78%), Gaps = 9/774 (1%) Frame = +1 Query: 529 HIQVASKYTCSLLAVFCGRN-AENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPSADE 705 H A K+ C+LLAV CG + AE++ +N + +YPFP+LVS+GRLEVQTL+NP ++ Sbjct: 5 HPHGAPKHACTLLAVTCGASFAEHKASQN----QHKYPFPELVSAGRLEVQTLRNPDKEQ 60 Query: 706 FGKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPNG 885 F KVL+S+QPN +YLQGEQL + + GSLVW G+ LST E I LF S LPT VYLE+PNG Sbjct: 61 FRKVLESYQPNFVYLQGEQLENDKVGSLVWQGLELSTSEDIIELFGSTLPTAVYLEIPNG 120 Query: 886 EELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLYC 1065 E AEALH KGIPYVI+WK+ F S+AACHFR A SVVQSSS HTWDAF LA ASF LYC Sbjct: 121 ESFAEALHLKGIPYVIFWKNAFFSYAACHFRQAFLSVVQSSSTHTWDAFHLARASFELYC 180 Query: 1066 VRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYDEDIN 1245 V+NN VL N ++GP L+G+ KIN SSG LPAIKI+++++N Sbjct: 181 VQNNQVLSTNIHDAISEMGPHLLGDCLKINVDPPEIDEEDDDENSSGTLPAIKIHEDEVN 240 Query: 1246 MRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVTM 1425 +RFLVCG ++D S+L LEDGL ALL+IE+RG KLH + SA PPPLQA TFSRGVVTM Sbjct: 241 LRFLVCGAPSTVDESLLRSLEDGLRALLTIEIRGCKLHGKFSAPPPPLQAATFSRGVVTM 300 Query: 1426 RCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSEP 1605 RCDIST SSAHISLLVSGSAQTCF+DQ+LE+HIK+E+I+ + LVHA +E NK +SEP Sbjct: 301 RCDISTCSSAHISLLVSGSAQTCFNDQLLESHIKNEIIEKSQLVHAQLNNEGNKQNISEP 360 Query: 1606 RRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDDA 1785 RRS SIACGA V+E+C+KLP WA Q+LRQLAP+VSYR+LVALGIASIQGL +ASFEKDDA Sbjct: 361 RRSASIACGAPVFEICMKLPQWALQILRQLAPEVSYRSLVALGIASIQGLPIASFEKDDA 420 Query: 1786 ERLLFF--SARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISS------IYSQNGA 1941 ERLLFF S + NN+ GS P WL+ P P RKR E Q +S +++ Sbjct: 421 ERLLFFYQSCEKDSGTSKNNIIFGSPPGWLKPPPPRRKRCESSQGASPGLHEGVFAGPAT 480 Query: 1942 VKREELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQ 2121 V + E+K+ + NG P+ PARQ+ KV+A+RPIPH+R +M PF G SE DG +G Q Sbjct: 481 VYKVNEEEKDRKMANGISTPLAPARQRLKVSAMRPIPHIRRHRMTPFCGPSETDGFDGGQ 540 Query: 2122 VKPNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEE 2301 V+P LP VAP+K S+G T A+HRKS S++ Q+KQ+ISLNPLPLKKHGCGR P+ CSEE Sbjct: 541 VEPTLPLVAPTK-RSIGSTSATHRKSFSSAAQSKQVISLNPLPLKKHGCGRGPVQTCSEE 599 Query: 2302 EFLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINW 2481 EFLKDVM+FLILRGH RLIPQGGL EFPDAILN KRLDL+NLY+EVV+RGGFHVGNGINW Sbjct: 600 EFLKDVMEFLILRGHNRLIPQGGLTEFPDAILNGKRLDLYNLYKEVVTRGGFHVGNGINW 659 Query: 2482 KGQVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVN 2661 KGQ+FSKMRN+T TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSAAGDWVN Sbjct: 660 KGQIFSKMRNYTTTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVN 719 Query: 2662 CGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 2823 CG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CSV+ FKKK Q +NGYS Sbjct: 720 CGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVTNFKKK-QNVTNGYS 772 >gb|EOY30141.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 3, partial [Theobroma cacao] Length = 708 Score = 1024 bits (2647), Expect = 0.0 Identities = 520/724 (71%), Positives = 585/724 (80%), Gaps = 1/724 (0%) Frame = +1 Query: 649 LVSSGRLEVQTLKNPSADEFGKVLDSWQPNILYLQGEQLPDGR-YGSLVWGGVNLSTPEA 825 L SSGRLEVQ L +P+ DE +VL+S +PN++YLQGEQ D G L+WG V+LSTPE Sbjct: 1 LASSGRLEVQLLNSPNIDELRRVLESTEPNVVYLQGEQNADSEEIGPLIWGDVDLSTPET 60 Query: 826 ICGLFSSVLPTTVYLELPNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQS 1005 +CGLF S LPTTVYLE PNG++LAEALHS+G+PYVIYWK+ FS FAACHFR AL SV+QS Sbjct: 61 LCGLFDSTLPTTVYLETPNGDKLAEALHSQGVPYVIYWKNTFSRFAACHFRQALLSVIQS 120 Query: 1006 SSCHTWDAFQLAHASFRLYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXX 1185 S HTWDAFQLAHASFRLYCVRNN V+ NSQK + K GP L+GE PKI+ Sbjct: 121 SCSHTWDAFQLAHASFRLYCVRNNNVVSSNSQKQSVKPGPRLLGEAPKIDVSQPEVDMQG 180 Query: 1186 XXXXSSGALPAIKIYDEDINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNR 1365 S LPAIKIYD+D+ +RFLVCG LD+ +L LEDGLNALLSIE+RGSKLHNR Sbjct: 181 EES-SPENLPAIKIYDDDVTVRFLVCGSPCILDAFLLGSLEDGLNALLSIEIRGSKLHNR 239 Query: 1366 ASALPPPLQAGTFSRGVVTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDN 1545 ASA PPPLQAGTFSRGVVTMRCD ST SSAHISLLVSGSAQTCF+DQ+LENHIK+E+I+ Sbjct: 240 ASAPPPPLQAGTFSRGVVTMRCDFSTCSSAHISLLVSGSAQTCFNDQLLENHIKNEIIEK 299 Query: 1546 TCLVHALPTSEENKPRLSEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLV 1725 + LVHA +SEE+K SEPRRS SIACGASV+EVC+K+P WASQVLRQLAPDVSYR+LV Sbjct: 300 SQLVHAQSSSEESKLPSSEPRRSASIACGASVFEVCMKVPTWASQVLRQLAPDVSYRSLV 359 Query: 1726 ALGIASIQGLAVASFEKDDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEM 1905 LGIASIQGL+VASFEKDDAERLLFF RQ KD ++ I P+WL PAPSRKRSE Sbjct: 360 MLGIASIQGLSVASFEKDDAERLLFFCMRQDKDPLQDSSVIAISPSWLVPPAPSRKRSEP 419 Query: 1906 CQISSIYSQNGAVKREELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFS 2085 C+ S + G +E + NG AR KS VAA+RPIPH K++PFS Sbjct: 420 CKDSKPLNCTG------MEGE-----NGI------ARPKSNVAAMRPIPHTHRHKIIPFS 462 Query: 2086 GLSEIDGHEGTQVKPNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHG 2265 G SE + ++G Q K NLP V P K + P +HRK+ S+SYQA+QIISLNPLPLKKHG Sbjct: 463 GFSEAERYDGDQGKVNLP-VVPVKQPA----PVTHRKALSSSYQAQQIISLNPLPLKKHG 517 Query: 2266 CGRSPIHVCSEEEFLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVS 2445 CGR+PI VCSEEEFL+DVMQFLILRGHTRL+PQGGLAEFPDAILNAKRLDLFNLYREVVS Sbjct: 518 CGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVS 577 Query: 2446 RGGFHVGNGINWKGQVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCL 2625 RGGFHVGNGINWKGQVFSKMRNHT+TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCL Sbjct: 578 RGGFHVGNGINWKGQVFSKMRNHTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCL 637 Query: 2626 LCRSSAAGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQK 2805 LC SSAAGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEY+CP CS+S FKKK QK Sbjct: 638 LCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSISNFKKKPQK 697 Query: 2806 TSNG 2817 T NG Sbjct: 698 TVNG 701 >ref|XP_004510562.1| PREDICTED: AT-rich interactive domain-containing protein 4-like [Cicer arietinum] Length = 783 Score = 1004 bits (2595), Expect = 0.0 Identities = 505/780 (64%), Positives = 604/780 (77%), Gaps = 12/780 (1%) Frame = +1 Query: 520 LMSHIQVASKYTCSLLAVFCG-RNAENEQIKNVLDEKSRYPFPDLVSSGRLEVQTLKNPS 696 L H Q +SK TC+LL V R AE + +N +PFP+LVSSGRLEVQTL NP Sbjct: 2 LQFHPQGSSKQTCTLLTVTSATRCAEQKHPQN----HHNFPFPELVSSGRLEVQTLCNPE 57 Query: 697 ADEFGKVLDSWQPNILYLQGEQLPDGRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLEL 876 ++F KVL+S+QP+I+YLQGEQL + GS+VW GV LSTPE I LF + LPT VYLE+ Sbjct: 58 KEQFCKVLESYQPSIVYLQGEQLVNEEVGSVVWQGVELSTPEDISELFGTSLPTAVYLEI 117 Query: 877 PNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFR 1056 PNGE AEALH KGIPYV++WK+ FS +AACHFR A FSVVQSSS HTWDAF LAHASF Sbjct: 118 PNGESFAEALHLKGIPYVVFWKNAFSRYAACHFRQAFFSVVQSSSTHTWDAFHLAHASFE 177 Query: 1057 LYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXX----SSGALPAIK 1224 LYCV+NN VLP +S + +GP L+G+ KI+ SSG+LP+I+ Sbjct: 178 LYCVQNNQVLPTDSNDADSDMGPHLLGDCLKIHIDPPEMGEEEEDDDDDESSSGSLPSIQ 237 Query: 1225 IYDEDINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTF 1404 I+D+++N+RFL+CG+ ++D S+L LEDGL ALL+IE+RG KLH + SA PPPLQA F Sbjct: 238 IHDDEVNLRFLICGEPSTVDESLLRSLEDGLRALLTIEIRGCKLHGKYSAPPPPLQAAAF 297 Query: 1405 SRGVVTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEEN 1584 SRGVVTMRCDIST SSAHISLLVSGSAQ CF+DQ+LENHIK+E+I+ +VHA SE N Sbjct: 298 SRGVVTMRCDISTCSSAHISLLVSGSAQACFNDQLLENHIKNEIIEKGQIVHA-QLSEAN 356 Query: 1585 KPRLSEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVA 1764 K +SEPRRS SIACGA+++E+ +KLP WA Q+LRQLAPDVSYR+LVALGIASIQGL VA Sbjct: 357 KQTISEPRRSASIACGATIFEISMKLPQWALQILRQLAPDVSYRSLVALGIASIQGLPVA 416 Query: 1765 SFEKDDAERLLFFSARQGKDEHLN-NLNIGSLPTWLRSPAPSRKRSEMCQISS------I 1923 SFEKDDAERLLFF KD N N+ P WL+ P P+RKRSE Q +S + Sbjct: 417 SFEKDDAERLLFFYQSSEKDGCANHNIVFSRPPIWLKPPPPTRKRSESSQGASPDIDDGV 476 Query: 1924 YSQNGAVKREELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEID 2103 +S GA+K+ + E+K+ + NG P+ PARQ+ KV+A+RPIP VR +M PF G SE+D Sbjct: 477 FSGQGAIKKVDEEEKDRKMVNGISTPLTPARQRLKVSAMRPIPQVRRHRMTPFCGPSEMD 536 Query: 2104 GHEGTQVKPNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPI 2283 G G V+ ++P V P K +S+ + A+ RKS S+S +KQ+ISLNPLPLKKHGC R P+ Sbjct: 537 GFGGAHVEASVPLV-PMKRSSIASSSATQRKSFSSSALSKQVISLNPLPLKKHGCSRGPV 595 Query: 2284 HVCSEEEFLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHV 2463 CSEEEFLKDVM+FLILRGH+RLIPQGGL+EFPDAILN KRLDL+NLY+EVV+RGGFHV Sbjct: 596 QTCSEEEFLKDVMEFLILRGHSRLIPQGGLSEFPDAILNGKRLDLYNLYKEVVTRGGFHV 655 Query: 2464 GNGINWKGQVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSA 2643 GNGINWKGQ+FSKM N+T TN+MTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC SSA Sbjct: 656 GNGINWKGQIFSKMGNYTSTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSA 715 Query: 2644 AGDWVNCGVCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGYS 2823 AGDWVNCG+CGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CS++ FKKK Q +NGYS Sbjct: 716 AGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFKKK-QSVANGYS 774 >ref|XP_003533805.1| PREDICTED: AT-rich interactive domain-containing protein 4-like isoform X1 [Glycine max] Length = 752 Score = 998 bits (2581), Expect = 0.0 Identities = 502/771 (65%), Positives = 588/771 (76%), Gaps = 4/771 (0%) Frame = +1 Query: 520 LMSHIQVASKYTCSLLAVFCGRNAENEQIK---NVLDEKSRYPFPDLVSSGRLEVQTLKN 690 +M H Q S++ CSLLAV G++ + +Q + N +++ YPFP+L SSGRLEV+ L Sbjct: 1 MMFHSQGVSRH-CSLLAVLSGKSRDIKQKQKQGNASEDQFPYPFPELSSSGRLEVKVLIE 59 Query: 691 PSADEFGKVLDSWQPNILYLQGEQLPD-GRYGSLVWGGVNLSTPEAICGLFSSVLPTTVY 867 P+ADE G L+ QP+ +YLQG+QL D G G L W +LS PEA+CGLFSS LP TVY Sbjct: 60 PTADELGLALEQLQPDFVYLQGQQLEDRGEIGPLGWEDFDLSVPEALCGLFSSKLPNTVY 119 Query: 868 LELPNGEELAEALHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHA 1047 LE P GE+LAEAL SKG+PY IYWK+ FS +AA HFRH+LFSV QS+S HTWDAFQLA A Sbjct: 120 LETPKGEKLAEALRSKGVPYTIYWKNDFSKYAASHFRHSLFSVAQSTSSHTWDAFQLALA 179 Query: 1048 SFRLYCVRNNLVLPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKI 1227 SFRLYC+ NN VLP N K GKLGP ++G PP I+ S + A+KI Sbjct: 180 SFRLYCIHNN-VLPSNCHKGAGKLGPQILGVPPNIDVSPCVADMKEEEEDSPETISAVKI 238 Query: 1228 YDEDINMRFLVCGDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFS 1407 YD+D+NMRFL+CG +LD+ +L LEDGLNALL E+RG KLHNR SA PPPLQAGTFS Sbjct: 239 YDDDVNMRFLICGVPCTLDACLLGSLEDGLNALLFAEIRGCKLHNRTSATPPPLQAGTFS 298 Query: 1408 RGVVTMRCDISTVSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENK 1587 RGVVTMRCDIST SSAHISLLVSGSA TCF+DQ+LENHIK E+I+ + LV A P E++K Sbjct: 299 RGVVTMRCDISTCSSAHISLLVSGSADTCFNDQLLENHIKKELIEKSQLVQAFPNHEQSK 358 Query: 1588 PRLSEPRRSVSIACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVAS 1767 SEPRRS S+ACG+SV+EVC+++P WASQVLRQLAP++SYR+LV LGIASIQGL VAS Sbjct: 359 APSSEPRRSASVACGSSVFEVCMQVPAWASQVLRQLAPNLSYRSLVMLGIASIQGLPVAS 418 Query: 1768 FEKDDAERLLFFSARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVK 1947 F KDDAERLLFF RQ K+ N+ +P+WL+ P+ SRKRSE C S + +G Sbjct: 419 FNKDDAERLLFFCTRQEKENCPNDHVFSGIPSWLKPPSTSRKRSEPCSSSKSINDSGRGV 478 Query: 1948 REELEDKESALRNGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVK 2127 + RQK +A++RPIPH K+LPFSGLSE ++G K Sbjct: 479 EA----------------IGSHRQKFNLASMRPIPHSNRHKILPFSGLSEGTRYDGDHGK 522 Query: 2128 PNLPPVAPSKHNSVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEF 2307 NLP +AP KHN G T ++RKS S S+QA QIISLNPLP+KKHGC R+PI CSEEEF Sbjct: 523 SNLP-LAPIKHNVSGPTSVTNRKSVSNSFQAHQIISLNPLPMKKHGCDRAPIRACSEEEF 581 Query: 2308 LKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKG 2487 L+DVMQFLILRGH RLIP GGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKG Sbjct: 582 LRDVMQFLILRGHNRLIPPGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKG 641 Query: 2488 QVFSKMRNHTVTNKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCG 2667 QVFSKMRNHT+TN+MTGVGNTLKRHYETYLLEYEL+HDDVDGECCL+C SSAAGDWVNCG Sbjct: 642 QVFSKMRNHTMTNRMTGVGNTLKRHYETYLLEYELSHDDVDGECCLMCHSSAAGDWVNCG 701 Query: 2668 VCGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 2820 +CGEWAHFGCDRRQGLGAFKDYAKTDGLEY+CP+CS F KK QKT+NG+ Sbjct: 702 ICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPRCSALKFSKKSQKTANGF 752 >ref|XP_006587068.1| PREDICTED: AT-rich interactive domain-containing protein 4-like isoform X3 [Glycine max] Length = 772 Score = 995 bits (2572), Expect = 0.0 Identities = 498/759 (65%), Positives = 581/759 (76%), Gaps = 4/759 (0%) Frame = +1 Query: 556 CSLLAVFCGRNAENEQIK---NVLDEKSRYPFPDLVSSGRLEVQTLKNPSADEFGKVLDS 726 CSLLAV G++ + +Q + N +++ YPFP+L SSGRLEV+ L P+ADE G L+ Sbjct: 32 CSLLAVLSGKSRDIKQKQKQGNASEDQFPYPFPELSSSGRLEVKVLIEPTADELGLALEQ 91 Query: 727 WQPNILYLQGEQLPD-GRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPNGEELAEA 903 QP+ +YLQG+QL D G G L W +LS PEA+CGLFSS LP TVYLE P GE+LAEA Sbjct: 92 LQPDFVYLQGQQLEDRGEIGPLGWEDFDLSVPEALCGLFSSKLPNTVYLETPKGEKLAEA 151 Query: 904 LHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLYCVRNNLV 1083 L SKG+PY IYWK+ FS +AA HFRH+LFSV QS+S HTWDAFQLA ASFRLYC+ NN V Sbjct: 152 LRSKGVPYTIYWKNDFSKYAASHFRHSLFSVAQSTSSHTWDAFQLALASFRLYCIHNN-V 210 Query: 1084 LPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYDEDINMRFLVC 1263 LP N K GKLGP ++G PP I+ S + A+KIYD+D+NMRFL+C Sbjct: 211 LPSNCHKGAGKLGPQILGVPPNIDVSPCVADMKEEEEDSPETISAVKIYDDDVNMRFLIC 270 Query: 1264 GDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVTMRCDIST 1443 G +LD+ +L LEDGLNALL E+RG KLHNR SA PPPLQAGTFSRGVVTMRCDIST Sbjct: 271 GVPCTLDACLLGSLEDGLNALLFAEIRGCKLHNRTSATPPPLQAGTFSRGVVTMRCDIST 330 Query: 1444 VSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSEPRRSVSI 1623 SSAHISLLVSGSA TCF+DQ+LENHIK E+I+ + LV A P E++K SEPRRS S+ Sbjct: 331 CSSAHISLLVSGSADTCFNDQLLENHIKKELIEKSQLVQAFPNHEQSKAPSSEPRRSASV 390 Query: 1624 ACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDDAERLLFF 1803 ACG+SV+EVC+++P WASQVLRQLAP++SYR+LV LGIASIQGL VASF KDDAERLLFF Sbjct: 391 ACGSSVFEVCMQVPAWASQVLRQLAPNLSYRSLVMLGIASIQGLPVASFNKDDAERLLFF 450 Query: 1804 SARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVKREELEDKESALR 1983 RQ K+ N+ +P+WL+ P+ SRKRSE C S + +G Sbjct: 451 CTRQEKENCPNDHVFSGIPSWLKPPSTSRKRSEPCSSSKSINDSGRGVEA---------- 500 Query: 1984 NGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPNLPPVAPSKHN 2163 + RQK +A++RPIPH K+LPFSGLSE ++G K NLP +AP KHN Sbjct: 501 ------IGSHRQKFNLASMRPIPHSNRHKILPFSGLSEGTRYDGDHGKSNLP-LAPIKHN 553 Query: 2164 SVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLKDVMQFLILRG 2343 G T ++RKS S S+QA QIISLNPLP+KKHGC R+PI CSEEEFL+DVMQFLILRG Sbjct: 554 VSGPTSVTNRKSVSNSFQAHQIISLNPLPMKKHGCDRAPIRACSEEEFLRDVMQFLILRG 613 Query: 2344 HTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTVT 2523 H RLIP GGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHT+T Sbjct: 614 HNRLIPPGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTMT 673 Query: 2524 NKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDR 2703 N+MTGVGNTLKRHYETYLLEYEL+HDDVDGECCL+C SSAAGDWVNCG+CGEWAHFGCDR Sbjct: 674 NRMTGVGNTLKRHYETYLLEYELSHDDVDGECCLMCHSSAAGDWVNCGICGEWAHFGCDR 733 Query: 2704 RQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 2820 RQGLGAFKDYAKTDGLEY+CP+CS F KK QKT+NG+ Sbjct: 734 RQGLGAFKDYAKTDGLEYVCPRCSALKFSKKSQKTANGF 772 >ref|XP_006587067.1| PREDICTED: AT-rich interactive domain-containing protein 4-like isoform X2 [Glycine max] Length = 795 Score = 995 bits (2572), Expect = 0.0 Identities = 498/759 (65%), Positives = 581/759 (76%), Gaps = 4/759 (0%) Frame = +1 Query: 556 CSLLAVFCGRNAENEQIK---NVLDEKSRYPFPDLVSSGRLEVQTLKNPSADEFGKVLDS 726 CSLLAV G++ + +Q + N +++ YPFP+L SSGRLEV+ L P+ADE G L+ Sbjct: 55 CSLLAVLSGKSRDIKQKQKQGNASEDQFPYPFPELSSSGRLEVKVLIEPTADELGLALEQ 114 Query: 727 WQPNILYLQGEQLPD-GRYGSLVWGGVNLSTPEAICGLFSSVLPTTVYLELPNGEELAEA 903 QP+ +YLQG+QL D G G L W +LS PEA+CGLFSS LP TVYLE P GE+LAEA Sbjct: 115 LQPDFVYLQGQQLEDRGEIGPLGWEDFDLSVPEALCGLFSSKLPNTVYLETPKGEKLAEA 174 Query: 904 LHSKGIPYVIYWKHKFSSFAACHFRHALFSVVQSSSCHTWDAFQLAHASFRLYCVRNNLV 1083 L SKG+PY IYWK+ FS +AA HFRH+LFSV QS+S HTWDAFQLA ASFRLYC+ NN V Sbjct: 175 LRSKGVPYTIYWKNDFSKYAASHFRHSLFSVAQSTSSHTWDAFQLALASFRLYCIHNN-V 233 Query: 1084 LPENSQKVNGKLGPDLIGEPPKINXXXXXXXXXXXXXXSSGALPAIKIYDEDINMRFLVC 1263 LP N K GKLGP ++G PP I+ S + A+KIYD+D+NMRFL+C Sbjct: 234 LPSNCHKGAGKLGPQILGVPPNIDVSPCVADMKEEEEDSPETISAVKIYDDDVNMRFLIC 293 Query: 1264 GDTRSLDSSILEPLEDGLNALLSIEMRGSKLHNRASALPPPLQAGTFSRGVVTMRCDIST 1443 G +LD+ +L LEDGLNALL E+RG KLHNR SA PPPLQAGTFSRGVVTMRCDIST Sbjct: 294 GVPCTLDACLLGSLEDGLNALLFAEIRGCKLHNRTSATPPPLQAGTFSRGVVTMRCDIST 353 Query: 1444 VSSAHISLLVSGSAQTCFDDQMLENHIKSEVIDNTCLVHALPTSEENKPRLSEPRRSVSI 1623 SSAHISLLVSGSA TCF+DQ+LENHIK E+I+ + LV A P E++K SEPRRS S+ Sbjct: 354 CSSAHISLLVSGSADTCFNDQLLENHIKKELIEKSQLVQAFPNHEQSKAPSSEPRRSASV 413 Query: 1624 ACGASVYEVCVKLPNWASQVLRQLAPDVSYRNLVALGIASIQGLAVASFEKDDAERLLFF 1803 ACG+SV+EVC+++P WASQVLRQLAP++SYR+LV LGIASIQGL VASF KDDAERLLFF Sbjct: 414 ACGSSVFEVCMQVPAWASQVLRQLAPNLSYRSLVMLGIASIQGLPVASFNKDDAERLLFF 473 Query: 1804 SARQGKDEHLNNLNIGSLPTWLRSPAPSRKRSEMCQISSIYSQNGAVKREELEDKESALR 1983 RQ K+ N+ +P+WL+ P+ SRKRSE C S + +G Sbjct: 474 CTRQEKENCPNDHVFSGIPSWLKPPSTSRKRSEPCSSSKSINDSGRGVEA---------- 523 Query: 1984 NGFPLPVVPARQKSKVAALRPIPHVRHQKMLPFSGLSEIDGHEGTQVKPNLPPVAPSKHN 2163 + RQK +A++RPIPH K+LPFSGLSE ++G K NLP +AP KHN Sbjct: 524 ------IGSHRQKFNLASMRPIPHSNRHKILPFSGLSEGTRYDGDHGKSNLP-LAPIKHN 576 Query: 2164 SVGVTPASHRKSTSTSYQAKQIISLNPLPLKKHGCGRSPIHVCSEEEFLKDVMQFLILRG 2343 G T ++RKS S S+QA QIISLNPLP+KKHGC R+PI CSEEEFL+DVMQFLILRG Sbjct: 577 VSGPTSVTNRKSVSNSFQAHQIISLNPLPMKKHGCDRAPIRACSEEEFLRDVMQFLILRG 636 Query: 2344 HTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTVT 2523 H RLIP GGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHT+T Sbjct: 637 HNRLIPPGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTMT 696 Query: 2524 NKMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCRSSAAGDWVNCGVCGEWAHFGCDR 2703 N+MTGVGNTLKRHYETYLLEYEL+HDDVDGECCL+C SSAAGDWVNCG+CGEWAHFGCDR Sbjct: 697 NRMTGVGNTLKRHYETYLLEYELSHDDVDGECCLMCHSSAAGDWVNCGICGEWAHFGCDR 756 Query: 2704 RQGLGAFKDYAKTDGLEYICPQCSVSTFKKKIQKTSNGY 2820 RQGLGAFKDYAKTDGLEY+CP+CS F KK QKT+NG+ Sbjct: 757 RQGLGAFKDYAKTDGLEYVCPRCSALKFSKKSQKTANGF 795