BLASTX nr result
ID: Catharanthus22_contig00023912
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00023912 (1380 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006345586.1| PREDICTED: AT-rich interactive domain-contai... 285 3e-74 ref|XP_004240098.1| PREDICTED: AT-rich interactive domain-contai... 282 3e-73 ref|XP_006345587.1| PREDICTED: AT-rich interactive domain-contai... 277 9e-72 ref|XP_002276148.2| PREDICTED: AT-rich interactive domain-contai... 270 8e-70 emb|CAN80959.1| hypothetical protein VITISV_037562 [Vitis vinifera] 270 8e-70 emb|CBI40441.3| unnamed protein product [Vitis vinifera] 263 2e-67 gb|EOX95919.1| ARID/BRIGHT DNA-binding domain,ELM2 domain protei... 262 3e-67 ref|XP_004231189.1| PREDICTED: AT-rich interactive domain-contai... 258 3e-66 gb|EOX95917.1| ARID/BRIGHT DNA-binding domain,ELM2 domain protei... 256 1e-65 ref|XP_004308360.1| PREDICTED: AT-rich interactive domain-contai... 242 3e-61 ref|XP_006339600.1| PREDICTED: AT-rich interactive domain-contai... 240 9e-61 ref|XP_002302593.2| hypothetical protein POPTR_0002s16250g [Popu... 231 7e-58 ref|XP_006293913.1| hypothetical protein CARUB_v10022904mg [Caps... 199 3e-48 ref|NP_182128.2| ARID/BRIGHT AND ELM2 DNA-binding domain-contain... 198 4e-48 gb|AAC62899.1| hypothetical protein [Arabidopsis thaliana] 198 4e-48 ref|XP_006293914.1| hypothetical protein CARUB_v10022904mg, part... 194 7e-47 ref|XP_006476534.1| PREDICTED: AT-rich interactive domain-contai... 192 2e-46 ref|XP_006476535.1| PREDICTED: AT-rich interactive domain-contai... 190 1e-45 gb|EOY24787.1| ARID/BRIGHT DNA-binding domain,ELM2 domain protei... 189 3e-45 ref|XP_004492764.1| PREDICTED: AT-rich interactive domain-contai... 181 8e-43 >ref|XP_006345586.1| PREDICTED: AT-rich interactive domain-containing protein 1-like isoform X1 [Solanum tuberosum] Length = 621 Score = 285 bits (729), Expect = 3e-74 Identities = 183/467 (39%), Positives = 247/467 (52%), Gaps = 8/467 (1%) Frame = +2 Query: 2 GWLKRVDGSGLERFKTPQKFRKK-ETRLDFQPVCVKAGEQQQDRRDELKSSFDEFLSVFL 178 GW KRVDGS L F+ P+ + + +D+ D K F E ++ L Sbjct: 3 GWSKRVDGSNLTSFRKPENPKTLVKNEVDY---------------DSAKLLFIEVFTICL 47 Query: 179 QETFGTSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGFDSSVG 358 +E G+ C RPLPPMLGDGK+VDLFKLYL VRK GGY+RVSEN LW V +CGFDSS G Sbjct: 48 KEVAGSLCYRPLPPMLGDGKAVDLFKLYLVVRKNGGYQRVSENGLWGLVGMDCGFDSSYG 107 Query: 359 LALKLVYSKYLHALSCSLQKA--VANKDSKSGVTDSSPDSFG-GRLMHLESNLEGFLSGN 529 +ALKLVY KYL L + + + KS V+ S G G M +E +L+ L Sbjct: 108 MALKLVYVKYLGVLEKWMLRVREIDGSSVKSKVSKCDLGSGGDGVPMDVELDLKKVLMKI 167 Query: 530 SEKKKKGGVYTNIKMRQTQIEFFYDGKLSNTDEHMGIGKKSAGIDHVGENGVESINNRGA 709 S++K K G K+R+ ++ + + D G+ + + D I + Sbjct: 168 SDEKAKDGENGKCKLRKKGVDTTDGDEARDFD---GLNESNEKFDDQSNLDSNVITVKDI 224 Query: 710 ECLSEIRNDNDDC--IAAASDLGEGRGXXXXXXXXXXXXXXXXXXIRKRRREFVPKMLKW 883 + + D + + + G RKR+RE M+KW Sbjct: 225 DETRSVVYDKEHLKSVKGWESVSVLHGNGTDESLTKSTMVEEDCISRKRKRESYLDMVKW 284 Query: 884 IHMVAKDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVR-EACLRKDIDSSDQQGTGQKKQ 1060 + VAK+PCD AIG LPERSKWK Y ++ VWK+VL +R E L+KD+D QQ Q KQ Sbjct: 285 VSEVAKNPCDLAIGTLPERSKWKDYGTEVVWKKVLLLRDEMLLKKDVDPCAQQSVWQIKQ 344 Query: 1061 KMHPFMYDDHVRTSPEKLRFSQRLISAKDTLKKTGG-QIXXXXXXXGTDEDSVDPQSDSI 1237 KMHP MYD++ + +LR SQR++SAKD LKK QI DED D + S Sbjct: 345 KMHPSMYDEN--SGSGRLRCSQRVLSAKDHLKKRRALQISASPSSSHNDEDQADVPTGSS 402 Query: 1238 VDPLPDYLVNRRKKRMSLGPQHQADVPEWTGEAYESDNKWLGTKMWP 1378 + + +R+KR+ +GPQ QAD+PE E YESD+KWLGT +WP Sbjct: 403 AESAIGFWWKQRRKRIPVGPQFQADIPESIEEIYESDSKWLGTGIWP 449 >ref|XP_004240098.1| PREDICTED: AT-rich interactive domain-containing protein 1-like [Solanum lycopersicum] Length = 617 Score = 282 bits (721), Expect = 3e-73 Identities = 184/466 (39%), Positives = 246/466 (52%), Gaps = 7/466 (1%) Frame = +2 Query: 2 GWLKRVDGSGLERFKTPQKFRKK-ETRLDFQPVCVKAGEQQQDRRDELKSSFDEFLSVFL 178 GW KRVDGS L F+ P+ + + ++D+ D K F E ++ L Sbjct: 3 GWSKRVDGSSLTSFRKPENPKNVVKNKVDY---------------DNAKLLFIEVFTICL 47 Query: 179 QETFGTSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGFDSSVG 358 +E G+ C RPLPPMLGDGKSVDLFKLYL VRK GGY+RVSEN LW V +CGFDSS G Sbjct: 48 KEASGSLCYRPLPPMLGDGKSVDLFKLYLVVRKNGGYQRVSENGLWGLVGMDCGFDSSYG 107 Query: 359 LALKLVYSKYLHALSCSLQKA----VANKDSKSGVTDSSPDSFGGRLMHLESNLEGFLSG 526 +ALKLVY KYL L + + V++ SK D D G M +E +L+ L Sbjct: 108 MALKLVYVKYLGVLEKWVLRVRETDVSSVKSKVSKCDLGSDG-DGVPMDVELDLKKVLMK 166 Query: 527 NSEKKKKGGVYTNIKMRQTQIEFFYDGKLSNTDEHMGIGKKSAGIDHVGENGVESINNRG 706 S++K K G N ++ + + D G +K G ++ +N V ++ + Sbjct: 167 ISDEKAKDGDNGN------GVDTTDGDEARDFDGLNGSNEKFDGQSNL-DNNVITVKDID 219 Query: 707 AECLSEIRNDNDDCIAAASDLGEGRGXXXXXXXXXXXXXXXXXXIRKRRREFVPKMLKWI 886 ++ + + G RKR+RE M+KW+ Sbjct: 220 ETRSVVYDKEHLKSVKGRESVSVLHGNGTDESLTKYTMVEEDCVSRKRKRESYLDMVKWV 279 Query: 887 HMVAKDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVR-EACLRKDIDSSDQQGTGQKKQK 1063 VAK PCD AIG LP+RSKWK Y ++ VWKQVL VR + L+KD+D QQ Q KQK Sbjct: 280 SEVAKKPCDLAIGTLPDRSKWKDYGAEVVWKQVLLVRDDMLLKKDVDPCAQQSIWQIKQK 339 Query: 1064 MHPFMYDDHVRTSPEKLRFSQRLISAKDTLKKTGG-QIXXXXXXXGTDEDSVDPQSDSIV 1240 MHP MYDD+ + +LR SQR++SAKD LKK Q DED D + S Sbjct: 340 MHPSMYDDN--SGSGRLRCSQRVLSAKDHLKKRRAMQFLASPSSSHNDEDQADVPTGSSA 397 Query: 1241 DPLPDYLVNRRKKRMSLGPQHQADVPEWTGEAYESDNKWLGTKMWP 1378 + + +R+KR+ +GPQ QAD+PE E YESD+KWLGT +WP Sbjct: 398 ESAIGFWWKQRRKRIPVGPQFQADIPESIEEIYESDSKWLGTGIWP 443 >ref|XP_006345587.1| PREDICTED: AT-rich interactive domain-containing protein 1-like isoform X2 [Solanum tuberosum] Length = 618 Score = 277 bits (708), Expect = 9e-72 Identities = 181/467 (38%), Positives = 245/467 (52%), Gaps = 8/467 (1%) Frame = +2 Query: 2 GWLKRVDGSGLERFKTPQKFRKK-ETRLDFQPVCVKAGEQQQDRRDELKSSFDEFLSVFL 178 GW KRVDGS L F+ P+ + + +D+ D K F E ++ L Sbjct: 3 GWSKRVDGSNLTSFRKPENPKTLVKNEVDY---------------DSAKLLFIEVFTICL 47 Query: 179 QETFGTSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGFDSSVG 358 +E G+ C RPLPPMLGDGK+VDLFKLYL VRK GGY+RVSEN LW V +CGFDSS G Sbjct: 48 KEVAGSLCYRPLPPMLGDGKAVDLFKLYLVVRKNGGYQRVSENGLWGLVGMDCGFDSSYG 107 Query: 359 LALKLVYSKYLHALSCSLQKA--VANKDSKSGVTDSSPDSFG-GRLMHLESNLEGFLSGN 529 +ALKLVY KYL L + + + KS V+ S G G M +E +L+ L Sbjct: 108 MALKLVYVKYLGVLEKWMLRVREIDGSSVKSKVSKCDLGSGGDGVPMDVELDLKKVLMKI 167 Query: 530 SEKKKKGGVYTNIKMRQTQIEFFYDGKLSNTDEHMGIGKKSAGIDHVGENGVESINNRGA 709 S++K K G K+R+ ++ + + D G+ + + D I + Sbjct: 168 SDEKAKDGENGKCKLRKKGVDTTDGDEARDFD---GLNESNEKFDDQSNLDSNVITVKDI 224 Query: 710 ECLSEIRNDNDDC--IAAASDLGEGRGXXXXXXXXXXXXXXXXXXIRKRRREFVPKMLKW 883 + + D + + + G RKR+RE M+KW Sbjct: 225 DETRSVVYDKEHLKSVKGWESVSVLHGNGTDESLTKSTMVEEDCISRKRKRESYLDMVKW 284 Query: 884 IHMVAKDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVR-EACLRKDIDSSDQQGTGQKKQ 1060 + VAK+PCD AIG LPERSKWK Y ++ VWK+VL +R E L+KD+D QQ Q Sbjct: 285 VSEVAKNPCDLAIGTLPERSKWKDYGTEVVWKKVLLLRDEMLLKKDVDPCAQQSVW---Q 341 Query: 1061 KMHPFMYDDHVRTSPEKLRFSQRLISAKDTLKKTGG-QIXXXXXXXGTDEDSVDPQSDSI 1237 KMHP MYD++ + +LR SQR++SAKD LKK QI DED D + S Sbjct: 342 KMHPSMYDEN--SGSGRLRCSQRVLSAKDHLKKRRALQISASPSSSHNDEDQADVPTGSS 399 Query: 1238 VDPLPDYLVNRRKKRMSLGPQHQADVPEWTGEAYESDNKWLGTKMWP 1378 + + +R+KR+ +GPQ QAD+PE E YESD+KWLGT +WP Sbjct: 400 AESAIGFWWKQRRKRIPVGPQFQADIPESIEEIYESDSKWLGTGIWP 446 >ref|XP_002276148.2| PREDICTED: AT-rich interactive domain-containing protein 1-like [Vitis vinifera] Length = 628 Score = 270 bits (691), Expect = 8e-70 Identities = 181/481 (37%), Positives = 250/481 (51%), Gaps = 22/481 (4%) Frame = +2 Query: 2 GWLKRVDGSGLERFKTPQKFRKKETRLDF------QPVCVKAGEQQQDRRDELKSSFDEF 163 GW DGS L+ K + +E L F + + V G EL+ SFD+F Sbjct: 3 GWSMIADGSALDCVKI---LKPQENGLWFVLEPGSKGIVVGGGGIS-----ELRCSFDQF 54 Query: 164 LSVFLQETFGTSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGF 343 L FL++ G + RPLPPMLG+G+ VDLFKL+L V+++GGY VSEN LW+ VA+E G Sbjct: 55 LGPFLKQIRGHNSYRPLPPMLGNGQCVDLFKLFLLVKEKGGYRTVSENVLWNLVAEESGL 114 Query: 344 DSSVGLALKLVYSKYLHALSCSLQKAVANKDSKSGVTDSSPDSFGGRLMHLESNLEGFLS 523 DS VG ALKLVY KYL L L + +K S G D+ G LM LE+ +GFL Sbjct: 115 DSGVGSALKLVYIKYLDLLDRWLDRIFKDKKS-HGSLSVCGDTSGRLLMELETEFKGFLP 173 Query: 524 GNSEKKKKGGVYTNIKMRQTQIEFFYDGKLSNTDEHMGIGKKSAGIDHVGENGVESINNR 703 ++K K Y + + +++ F L DE + + E GV+ +++ Sbjct: 174 EILDQKMKDEEYPHFDLAKSESSFSGVENLYCNDE------VKSDVKVESEGGVKCVDDN 227 Query: 704 GAECLSEIRND---NDDCIAAASDLGEGRGXXXXXXXXXXXXXXXXXXIRKRRREFVPKM 874 E S ++ + N C+ D+ RKR+RE++ M Sbjct: 228 D-EVKSSVKLELDLNRKCVDDDEDV----------MILDLNEVNEEVFTRKRKREYMLGM 276 Query: 875 LKWIHMVAKDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVREAC-LRKDIDSSDQQGTGQ 1051 L W+ +AK+PCDP+IG LPERSKWK E+WKQVL VREA L++D+D S +Q Q Sbjct: 277 LNWVTTIAKNPCDPSIGKLPERSKWKLTGPGELWKQVLLVREALFLQRDVDLSAEQSIWQ 336 Query: 1052 KKQKMHPFMYDDHVRTSPEKLRFSQRLISAKDTLKKTGGQIXXXXXXXGTD-------ED 1210 KKQKMHP MY+DH E+LR+SQRL+S K + + + D ED Sbjct: 337 KKQKMHPSMYEDH--AGSERLRYSQRLLSGKRSRSRACSESSSSATQSDLDKSPSPCMED 394 Query: 1211 SVDPQSDSIVDPLPDYLV-----NRRKKRMSLGPQHQADVPEWTGEAYESDNKWLGTKMW 1375 D Q I DP + V + ++ +GP QA +PEWTG E D+KWLGT++W Sbjct: 395 HHDKQLLGICDPSIGHSVAGLCGDSHVRKRPVGPAFQATIPEWTGVVSEIDSKWLGTRVW 454 Query: 1376 P 1378 P Sbjct: 455 P 455 >emb|CAN80959.1| hypothetical protein VITISV_037562 [Vitis vinifera] Length = 724 Score = 270 bits (691), Expect = 8e-70 Identities = 181/481 (37%), Positives = 250/481 (51%), Gaps = 22/481 (4%) Frame = +2 Query: 2 GWLKRVDGSGLERFKTPQKFRKKETRLDF------QPVCVKAGEQQQDRRDELKSSFDEF 163 GW DGS L+ K + +E L F + + V G EL+ SFD+F Sbjct: 3 GWSMIADGSALDCVKI---LKPQENGLWFVLEPGSKGIVVGGGGIS-----ELRCSFDQF 54 Query: 164 LSVFLQETFGTSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGF 343 L FL++ G + RPLPPMLG+G+ VDLFKL+L V+++GGY VSEN LW+ VA+E G Sbjct: 55 LGPFLKQIRGHNSYRPLPPMLGNGQCVDLFKLFLLVKEKGGYRTVSENVLWNLVAEESGL 114 Query: 344 DSSVGLALKLVYSKYLHALSCSLQKAVANKDSKSGVTDSSPDSFGGRLMHLESNLEGFLS 523 DS VG ALKLVY KYL L L + +K S G D+ G LM LE+ +GFL Sbjct: 115 DSGVGSALKLVYIKYLDLLDRWLDRIFKDKKS-HGSLSVCGDTSGRLLMELETEFKGFLP 173 Query: 524 GNSEKKKKGGVYTNIKMRQTQIEFFYDGKLSNTDEHMGIGKKSAGIDHVGENGVESINNR 703 ++K K Y + + +++ F L DE + + E GV+ +++ Sbjct: 174 EILDQKMKDEEYPHFDLAKSESSFSGVENLYCNDE------VKSDVKVESEGGVKCVDDN 227 Query: 704 GAECLSEIRND---NDDCIAAASDLGEGRGXXXXXXXXXXXXXXXXXXIRKRRREFVPKM 874 E S ++ + N C+ D+ RKR+RE++ M Sbjct: 228 D-EVKSSVKLELDLNRKCVDDDEDV----------MILDLNEVNEEVFTRKRKREYMLGM 276 Query: 875 LKWIHMVAKDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVREAC-LRKDIDSSDQQGTGQ 1051 L W+ +AK+PCDP+IG LPERSKWK E+WKQVL VREA L++D+D S +Q Q Sbjct: 277 LNWVTTIAKNPCDPSIGKLPERSKWKLTGPGELWKQVLLVREALFLQRDVDLSAEQSIWQ 336 Query: 1052 KKQKMHPFMYDDHVRTSPEKLRFSQRLISAKDTLKKTGGQIXXXXXXXGTD-------ED 1210 KKQKMHP MY+DH E+LR+SQRL+S K + + + D ED Sbjct: 337 KKQKMHPSMYEDH--AGSERLRYSQRLLSGKRSRSRACSESSSSATQSDLDKSPSPCMED 394 Query: 1211 SVDPQSDSIVDPLPDYLV-----NRRKKRMSLGPQHQADVPEWTGEAYESDNKWLGTKMW 1375 D Q I DP + V + ++ +GP QA +PEWTG E D+KWLGT++W Sbjct: 395 HHDKQLLGICDPSIGHSVAGLCGDSHVRKRPVGPAFQATIPEWTGVVSEIDSKWLGTRVW 454 Query: 1376 P 1378 P Sbjct: 455 P 455 >emb|CBI40441.3| unnamed protein product [Vitis vinifera] Length = 594 Score = 263 bits (671), Expect = 2e-67 Identities = 176/469 (37%), Positives = 242/469 (51%), Gaps = 10/469 (2%) Frame = +2 Query: 2 GWLKRVDGSGLERFKTPQKFRKKETRLDF------QPVCVKAGEQQQDRRDELKSSFDEF 163 GW DGS L+ K + +E L F + + V G EL+ SFD+F Sbjct: 10 GWSMIADGSALDCVKI---LKPQENGLWFVLEPGSKGIVVGGGGIS-----ELRCSFDQF 61 Query: 164 LSVFLQETFGTSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGF 343 L FL++ G + RPLPPMLG+G+ VDLFKL+L V+++GGY VSEN LW+ VA+E G Sbjct: 62 LGPFLKQIRGHNSYRPLPPMLGNGQCVDLFKLFLLVKEKGGYRTVSENVLWNLVAEESGL 121 Query: 344 DSSVGLALKLVYSKYLHALSCSLQKAVANKDSKSGVTDSSPDSFGGRLMHLESNLEGFLS 523 DS VG ALKLVY KYL L L + +K S G D+ G LM LE+ +GFL Sbjct: 122 DSGVGSALKLVYIKYLDLLDRWLDRIFKDKKS-HGSLSVCGDTSGRLLMELETEFKGFLP 180 Query: 524 GNSEKKKKGGVYTNIKMRQTQIEFFYDGKLSNTDEHMGIGKKSAGIDHVGENGVESINNR 703 ++K K Y + + +++ F L DE + + E GV+ +++ Sbjct: 181 EILDQKMKDEEYPHFDLAKSESSFSGVENLYCNDE------VKSDVKVESEGGVKCVDDN 234 Query: 704 GAECLSEIRND---NDDCIAAASDLGEGRGXXXXXXXXXXXXXXXXXXIRKRRREFVPKM 874 E S ++ + N C+ D+ RKR+RE++ M Sbjct: 235 D-EVKSSVKLELDLNRKCVDDDEDV----------MILDLNEVNEEVFTRKRKREYMLGM 283 Query: 875 LKWIHMVAKDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVREAC-LRKDIDSSDQQGTGQ 1051 L W+ +AK+PCDP+IG LPERSKWK E+WKQVL VREA L++D+D S +Q Q Sbjct: 284 LNWVTTIAKNPCDPSIGKLPERSKWKLTGPGELWKQVLLVREALFLQRDVDLSAEQSIWQ 343 Query: 1052 KKQKMHPFMYDDHVRTSPEKLRFSQRLISAKDTLKKTGGQIXXXXXXXGTDEDSVDPQSD 1231 KKQKMHP MY+DH E+LR+SQRL+S K + + G D Sbjct: 344 KKQKMHPSMYEDH--AGSERLRYSQRLLSGKRSRSRASGLC-----------------GD 384 Query: 1232 SIVDPLPDYLVNRRKKRMSLGPQHQADVPEWTGEAYESDNKWLGTKMWP 1378 S V ++ +GP QA +PEWTG E D+KWLGT++WP Sbjct: 385 SHV------------RKRPVGPAFQATIPEWTGVVSEIDSKWLGTRVWP 421 >gb|EOX95919.1| ARID/BRIGHT DNA-binding domain,ELM2 domain protein, putative isoform 3 [Theobroma cacao] Length = 676 Score = 262 bits (669), Expect = 3e-67 Identities = 179/509 (35%), Positives = 243/509 (47%), Gaps = 50/509 (9%) Frame = +2 Query: 2 GWLKRVDGSGLERFKTPQKFRKKETRLDFQPVCVKAGEQQQDRRDELKSSFDEFLSVFLQ 181 GW R DGS L+ KTP+K +D +P G + D+L+ F++FL+ FL+ Sbjct: 3 GWSMRADGSSLDCAKTPEKLEPAGYWVDLEPF--SEGSFLKSEPDKLRFWFNKFLASFLK 60 Query: 182 ETFGTSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGFDSSVGL 361 E C PLPPMLGDG+ VDLFKL+L VR++GGY VSE+ LWD VA+E G +V Sbjct: 61 EICAQGCFWPLPPMLGDGQPVDLFKLFLVVREKGGYNAVSESGLWDLVAEESGLGLNVAS 120 Query: 362 ALKLVYSKYLHALSCSLQKAVANKDSKSGVTDSSPDSFGGRLMHLESNLEGFL------- 520 ++KLVY KYL +L L++ + ++DSKS + G LM L + L+GFL Sbjct: 121 SVKLVYVKYLVSLERWLERIIESEDSKS------ESDYSGHLMELGAELKGFLLASKKKV 174 Query: 521 -------------SGNSEKKKKGGVYTNIKMRQTQIEFFYDGKLSNTDEHMGIGKKSAGI 661 S EK K +I + + + + GKL N D+ + S G Sbjct: 175 VEYSQVEESVVAGSDGGEKCVKNEESMHIDLTKRVLNYEGVGKLQNDDDSKSVVVDSDGD 234 Query: 662 DH------------------VGENGVESINNRGAECLSEIRNDNDDCIAAASDLGEGRGX 787 V + VE I N E S I D DC + Sbjct: 235 KKCMDGDECEESPSDLAKSAVNSSDVEKICNED-EVKSAIMEDFVDCKKCTDSDDDDN-- 291 Query: 788 XXXXXXXXXXXXXXXXXIRKRRREFVPKMLKWIHMVAKDPCDPAIGNLPERSKWKSYPSD 967 KR+RE + ML WI +AKDPCDP IG+LPERSKWKSY ++ Sbjct: 292 ---VVILDSNDTKEKFSSHKRKRESMWGMLNWITEIAKDPCDPVIGSLPERSKWKSYGNE 348 Query: 968 EVWKQVLFVREACL-RKDIDSSDQQGTGQKKQKMHPFMYDDHVR---------TSPEKLR 1117 E+WKQVL REA +KD S Q + QK QKMHP +YDD R + P+KL Sbjct: 349 ELWKQVLLFREAAFHKKDDHSGVDQSSWQKNQKMHPCLYDDPTRFGYNLRERLSCPKKLL 408 Query: 1118 FSQRLISAKD--TLKKTGGQIXXXXXXXGTDEDSVDPQSDSIVDPLPDYLVNRRKKRMSL 1291 + + K+ +G G D+ S + + DY ++ + Sbjct: 409 LGKMVSKGKNYSQSSSSGNHSDLDNSMVGIDKQSHGTYDSATPGSVFDY---DNDMQVPI 465 Query: 1292 GPQHQADVPEWTGEAYESDNKWLGTKMWP 1378 GP Q +VP+WTG A ESD KWLGT++WP Sbjct: 466 GPYFQVEVPDWTGLASESDPKWLGTRVWP 494 >ref|XP_004231189.1| PREDICTED: AT-rich interactive domain-containing protein 1-like [Solanum lycopersicum] Length = 603 Score = 258 bits (660), Expect = 3e-66 Identities = 179/480 (37%), Positives = 246/480 (51%), Gaps = 21/480 (4%) Frame = +2 Query: 2 GWLKRVDGSGLERFKTPQKFRKKETRLDFQPVCVKAGEQQQDRRDELKSSFDEFLSVFLQ 181 GW RVDG GLE F++ + K ++ L ++ L+ Sbjct: 3 GWSMRVDGFGLESFRSVENVSKLDSVL---------------------------VNRLLK 35 Query: 182 ETFGTSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGFDSSVGL 361 E G C RP P MLGDGK+VDL +L+L VR++GGYERV + W VA ECGFD + G Sbjct: 36 EVNGCLCFRPFPRMLGDGKAVDLCELFLVVREKGGYERVCASRSWGVVAVECGFDLTTGS 95 Query: 362 ALKLVYSKYLHALSCSLQKAVANKDSKSGVTDSSPDSFGGRLMHLESNLEGFLSGNSEKK 541 ALKLVY KYL AL+ ++ K + KS V S+ F M LE +L+G L ++K Sbjct: 96 ALKLVYVKYLDALNRAMVKLEQPDNEKSEVKKSAL-GFSAVRMDLELDLKGVLMEICDEK 154 Query: 542 KKGGVYTNIKMRQTQIEFFYDGKLSNTDEHMGIGKKSAGIDHVGENGVESINNRGAECLS 721 KK + ++ DG LS+ E +K + V + V+ I++ LS Sbjct: 155 KKDEEHGKMEFDPAA-----DGNLSDHIEVQDFVEKQSLDGSVWD--VKGIHDNRLMKLS 207 Query: 722 EIRNDNDDCIAAASDLGEGR--------------------GXXXXXXXXXXXXXXXXXXI 841 E D DDCI +G+ G Sbjct: 208 E---DEDDCIIRKRKYSDGKIGVNYDEKHLKIDDGNGDVLGGSDDIGLTKFRGDKEDSIT 264 Query: 842 RKRRREFVPKMLKWIHMVAKDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVR-EACLRKD 1018 R+R+R+ MLKW+ +AKDPCDPAIGNLPE+SKWKSY ++ VWK+VL +R E L + Sbjct: 265 RQRKRDSNLDMLKWVIELAKDPCDPAIGNLPEKSKWKSYGNEVVWKKVLLLRDEMLLEGN 324 Query: 1019 IDSSDQQGTGQKKQKMHPFMYDDHVRTSPEKLRFSQRLISAKDTLKKTGGQIXXXXXXXG 1198 +D+S + Q+KQKMHP MYD++ + E+LR SQR+ SAKD KK+G I Sbjct: 325 VDTSTRNSIWQQKQKMHPSMYDNN--SGSEQLRCSQRVQSAKDYSKKSGSHIDATLINRP 382 Query: 1199 TDEDSVDPQSDSIVDPLPDYLVNRRKKRMSLGPQHQADVPEWTGEAYESDNKWLGTKMWP 1378 TD ++S V + RRKK S+G + QAD+PE + YESD+KWLGT++WP Sbjct: 383 TDS-----SAESGV-----WWNRRRKKIASIGSEFQADIPECNKDIYESDSKWLGTRIWP 432 >gb|EOX95917.1| ARID/BRIGHT DNA-binding domain,ELM2 domain protein, putative isoform 1 [Theobroma cacao] gi|508704022|gb|EOX95918.1| ARID/BRIGHT DNA-binding domain,ELM2 domain protein, putative isoform 1 [Theobroma cacao] Length = 671 Score = 256 bits (655), Expect = 1e-65 Identities = 177/505 (35%), Positives = 241/505 (47%), Gaps = 50/505 (9%) Frame = +2 Query: 14 RVDGSGLERFKTPQKFRKKETRLDFQPVCVKAGEQQQDRRDELKSSFDEFLSVFLQETFG 193 R DGS L+ KTP+K +D +P G + D+L+ F++FL+ FL+E Sbjct: 2 RADGSSLDCAKTPEKLEPAGYWVDLEPF--SEGSFLKSEPDKLRFWFNKFLASFLKEICA 59 Query: 194 TSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGFDSSVGLALKL 373 C PLPPMLGDG+ VDLFKL+L VR++GGY VSE+ LWD VA+E G +V ++KL Sbjct: 60 QGCFWPLPPMLGDGQPVDLFKLFLVVREKGGYNAVSESGLWDLVAEESGLGLNVASSVKL 119 Query: 374 VYSKYLHALSCSLQKAVANKDSKSGVTDSSPDSFGGRLMHLESNLEGFL----------- 520 VY KYL +L L++ + ++DSKS + G LM L + L+GFL Sbjct: 120 VYVKYLVSLERWLERIIESEDSKS------ESDYSGHLMELGAELKGFLLASKKKVVEYS 173 Query: 521 ---------SGNSEKKKKGGVYTNIKMRQTQIEFFYDGKLSNTDEHMGIGKKSAGIDH-- 667 S EK K +I + + + + GKL N D+ + S G Sbjct: 174 QVEESVVAGSDGGEKCVKNEESMHIDLTKRVLNYEGVGKLQNDDDSKSVVVDSDGDKKCM 233 Query: 668 ----------------VGENGVESINNRGAECLSEIRNDNDDCIAAASDLGEGRGXXXXX 799 V + VE I N E S I D DC + Sbjct: 234 DGDECEESPSDLAKSAVNSSDVEKICNED-EVKSAIMEDFVDCKKCTDSDDDDN-----V 287 Query: 800 XXXXXXXXXXXXXIRKRRREFVPKMLKWIHMVAKDPCDPAIGNLPERSKWKSYPSDEVWK 979 KR+RE + ML WI +AKDPCDP IG+LPERSKWKSY ++E+WK Sbjct: 288 VILDSNDTKEKFSSHKRKRESMWGMLNWITEIAKDPCDPVIGSLPERSKWKSYGNEELWK 347 Query: 980 QVLFVREACL-RKDIDSSDQQGTGQKKQKMHPFMYDDHVR---------TSPEKLRFSQR 1129 QVL REA +KD S Q + QK QKMHP +YDD R + P+KL + Sbjct: 348 QVLLFREAAFHKKDDHSGVDQSSWQKNQKMHPCLYDDPTRFGYNLRERLSCPKKLLLGKM 407 Query: 1130 LISAKD--TLKKTGGQIXXXXXXXGTDEDSVDPQSDSIVDPLPDYLVNRRKKRMSLGPQH 1303 + K+ +G G D+ S + + DY ++ +GP Sbjct: 408 VSKGKNYSQSSSSGNHSDLDNSMVGIDKQSHGTYDSATPGSVFDY---DNDMQVPIGPYF 464 Query: 1304 QADVPEWTGEAYESDNKWLGTKMWP 1378 Q +VP+WTG A ESD KWLGT++WP Sbjct: 465 QVEVPDWTGLASESDPKWLGTRVWP 489 >ref|XP_004308360.1| PREDICTED: AT-rich interactive domain-containing protein 1-like [Fragaria vesca subsp. vesca] Length = 595 Score = 242 bits (617), Expect = 3e-61 Identities = 165/469 (35%), Positives = 239/469 (50%), Gaps = 10/469 (2%) Frame = +2 Query: 2 GWLKRVDGSGLERFKTPQKFRKKETRLDFQPVCVKAGEQQQDRRDELKSSFDEFLSVFLQ 181 GW DGS L+ K + E++ CV D+L+S FD+FL V L+ Sbjct: 3 GWSMLADGSVLDCGKKEESDCFSESKR-----CVSG------LPDKLRSWFDQFLGVLLR 51 Query: 182 ETFGTSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGFDSSVGL 361 E RPLPPMLG+G+ VDLFKL+ VRK+GG++ VS+N +WD VA EC SS+G Sbjct: 52 EICAKDSARPLPPMLGNGQRVDLFKLFWAVRKKGGFDCVSDNGVWDLVAGECRLGSSLGG 111 Query: 362 ALKLVYSKYLHALSCSLQKAVANKDSKSGVTDSSPDSFGGRLMHLESNLEGFLSGNSEKK 541 A+KLVYSKYL+ + + + N+D + + S D +L L+ +G ++K Sbjct: 112 AVKLVYSKYLYL----VDRLLENRDLEWSLGSSGSD-LKKQLTDLQDEFKGMFPEVGDRK 166 Query: 542 KKGGVYTNIKMRQTQIEFFYDGKLSNTDEHMGI-GKKSAGIDHVGENGVE----SINNRG 706 Y +M L++++E G GK S VG G++ ++ + G Sbjct: 167 VNKAGYLKEEMSPK--------SLNDSNEEAGAGGKGSRNKKRVGNVGLDWDSGTVEDAG 218 Query: 707 AECLSEIRNDNDDCIAAASDLGEGR----GXXXXXXXXXXXXXXXXXXIRKRRREFVPKM 874 N+N++ + + G G+ RKR+RE M Sbjct: 219 K------LNNNEEVKSEVVESGGGKKVDDDDDYVDVLIVNPATMEVSSCRKRKRESFCGM 272 Query: 875 LKWIHMVAKDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVREAC-LRKDIDSSDQQGTGQ 1051 L W+ M+A DPCDP++G+LPERSKWKS+ + E WKQVL REA L+K+ DS +Q GQ Sbjct: 273 LNWLRMIAADPCDPSVGSLPERSKWKSFGNKENWKQVLGAREAIYLKKNADSVAEQFNGQ 332 Query: 1052 KKQKMHPFMYDDHVRTSPEKLRFSQRLISAKDTLKKTGGQIXXXXXXXGTDEDSVDPQSD 1231 +MHP MYDDH+ T+ LR QRL + T+ ++G + G+D P + Sbjct: 333 NNLRMHPSMYDDHLGTA-YNLRERQRLEKQQRTMSESGACL---YSSPGSDMSKSSPGME 388 Query: 1232 SIVDPLPDYLVNRRKKRMSLGPQHQADVPEWTGEAYESDNKWLGTKMWP 1378 ++ + +G + QA VPEWTGE ESD KWLGT++WP Sbjct: 389 DHIE-------------VRVGSKFQAHVPEWTGELEESDGKWLGTRVWP 424 >ref|XP_006339600.1| PREDICTED: AT-rich interactive domain-containing protein 1-like [Solanum tuberosum] Length = 578 Score = 240 bits (613), Expect = 9e-61 Identities = 163/429 (37%), Positives = 223/429 (51%), Gaps = 43/429 (10%) Frame = +2 Query: 221 MLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGFDSSVGLALKLVYSKYLHAL 400 MLGDGK+VDL KL L VR++GGYERV + W VA ECGFD + GLALKLVY KYL AL Sbjct: 1 MLGDGKAVDLCKLSLVVREKGGYERVCASRSWGVVAVECGFDLTSGLALKLVYVKYLDAL 60 Query: 401 SCSLQKAVANKDSKSGVTDSSPDSFGGRLMHLESNLEGFLSGNSEKKKKGGVYTNIKMRQ 580 + ++ K + KS V S+ F M LE +L+G L +K ++ G + Sbjct: 61 NRAMVKLEQPNNEKSEVKKSAL-GFSAVRMDLELDLKGVLMNEEKKDEEHG--------K 111 Query: 581 TQIEFFYDGKLSNTDEHMGIGKKSA---------GIDH-------------VGENGVESI 694 + + DGKLS+ E +K + GID V +N + + Sbjct: 112 MEFDPAADGKLSDGIEVQDFVEKRSLDGSVCDVKGIDEKFSVVNAEEDSKFVNQNEDDQL 171 Query: 695 NNRGAECLSEIRNDNDDCIAAASDLGEGR--------------------GXXXXXXXXXX 814 + L ++ D DDC +G+ G Sbjct: 172 DTNDNR-LMKLSGDEDDCDNRKRKYSDGKMIVGYDEEHLKIDDGNGDVQGGKEGIGLTKF 230 Query: 815 XXXXXXXXIRKRRREFVPKMLKWIHMVAKDPCDPAIGNLPERSKWKSYPSDEVWKQVLFV 994 RK +RE MLKW+ +AKDPCDPAIG+LPE+SKWKSY ++ VWK+VL + Sbjct: 231 SGDKEDSITRKGKRESNLDMLKWVTELAKDPCDPAIGHLPEKSKWKSYGNEVVWKKVLLL 290 Query: 995 R-EACLRKDIDSSDQQGTGQKKQKMHPFMYDDHVRTSPEKLRFSQRLISAKDTLKKTGGQ 1171 R E L ++D+S Q Q+KQKMHP MYDD+ + ++LR SQR+ SAKD+ KK+G Sbjct: 291 RDEMLLEGNVDTSTQNSIWQQKQKMHPSMYDDN--SGSDQLRCSQRVQSAKDSSKKSGSH 348 Query: 1172 IXXXXXXXGTDEDSVDPQSDSIVDPLPDYLVNRRKKRMSLGPQHQADVPEWTGEAYESDN 1351 I D +D +DS + + RRKK S+G + QAD+PEW + YESD+ Sbjct: 349 I---------DATLIDRPTDSSAES-GVWWNRRRKKIASIGSEFQADIPEWNKDIYESDS 398 Query: 1352 KWLGTKMWP 1378 KWLGT++WP Sbjct: 399 KWLGTRIWP 407 >ref|XP_002302593.2| hypothetical protein POPTR_0002s16250g [Populus trichocarpa] gi|550345138|gb|EEE81866.2| hypothetical protein POPTR_0002s16250g [Populus trichocarpa] Length = 655 Score = 231 bits (588), Expect = 7e-58 Identities = 158/456 (34%), Positives = 235/456 (51%), Gaps = 42/456 (9%) Frame = +2 Query: 137 ELKSSFDEFLSVFLQETFGT-SCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCL 313 +L+S FD+F F +E G SC+ LPPMLG+G+ VDL KL+L VR++GGY+ VS+N L Sbjct: 44 QLRSCFDKFSESFFKEVCGLDSCLWTLPPMLGNGQFVDLLKLFLVVREKGGYDVVSKNGL 103 Query: 314 WDSVAKECGFDSSVGLALKLVYSKYLHALSCSLQKAVANKDSKSGVTDSSPDSFGGRLMH 493 W VA+E GF S+ A+KLVY KYL AL L++ + + + S + G +M Sbjct: 104 WGLVAQESGFGLSLVPAVKLVYIKYLDALERWLERLLVDSVELNTELSDSGVNVVGAVME 163 Query: 494 LESNLEGFLSGNSEKKKKGGVYTNIKMR---QTQIEFFYDGKLSNTDEHMGIGKKSAGID 664 L + +G LS EK+ + +K ++E + K +E + I +G+D Sbjct: 164 LGAEFKGLLSEMPEKE-----FLELKSELNVDAEVESYESEKFVEDEEPLHIDLTKSGVD 218 Query: 665 HV-----GENGVESI------NNRGAEC-------LSEIRN----DNDD-----CIAAAS 763 +V G+N V+S+ +N +C +S+ R +N+D + Sbjct: 219 YVEVGESGDNVVKSVMVDDSFSNWNVKCKDVVEKLISDSRKNEKVENEDEVKSVVVVEID 278 Query: 764 DLGEG-RGXXXXXXXXXXXXXXXXXXIRKRRREFVPKMLKWIHMVAKDPCDPAIGNLPER 940 GEG +G RKR+RE +P+ML W+ +A+DPCDP +G+LPE Sbjct: 279 GDGEGDKGDNSEVEELDLATYNESVSSRKRKRESIPRMLNWVTGIARDPCDPVVGSLPEW 338 Query: 941 SKWKSYPSDEVWKQVLFVREAC-LRKDIDSSD--QQGTGQKKQKMHPFMYDDHVRTSPEK 1111 SKWK Y ++E WKQVL REA L++++DS+ ++ QK KMHP YDDH +S Sbjct: 339 SKWKFYGNEECWKQVLLTREALFLKRNVDSTSIAERSFRQKNPKMHPCRYDDHAGSS--- 395 Query: 1112 LRFSQRLISAKDTLKKTGGQIXXXXXXXGTDEDSVDPQSDSIVD-------PLPDYLVNR 1270 +RL K L GG + D + D + D+ + Sbjct: 396 YNLRERLKCRKKPL--PGGTSSQAHVCSQSSSGETSSCMDGVYDGDSSTEHSVLDFPIT- 452 Query: 1271 RKKRMSLGPQHQADVPEWTGEAYESDNKWLGTKMWP 1378 KR+ +GP QA+VPEWTG +SD+KWLGT++WP Sbjct: 453 --KRIPVGPVFQAEVPEWTGVVSKSDSKWLGTQVWP 486 >ref|XP_006293913.1| hypothetical protein CARUB_v10022904mg [Capsella rubella] gi|482562621|gb|EOA26811.1| hypothetical protein CARUB_v10022904mg [Capsella rubella] Length = 560 Score = 199 bits (505), Expect = 3e-48 Identities = 145/460 (31%), Positives = 213/460 (46%), Gaps = 1/460 (0%) Frame = +2 Query: 2 GWLKRVDGSGLERFKTPQKFRKKETRLDFQPVCVKAGEQQQDRRDELKSSFDEFLSVFLQ 181 GW ++R KTP+ + P +G +++ + +L S F L FL Sbjct: 3 GWSMVAGEDAVDRSKTPKTLDANRSPESVNPEPTGSGFEKKIK--DLISLFRPLLESFLC 60 Query: 182 ETFGTSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGFDSSVGL 361 E + RPLPPM GDG+ VDLF L+LNV +GG++ VSEN WD VA++ G S Sbjct: 61 EFCASDSFRPLPPMTGDGRVVDLFNLFLNVSHKGGFDAVSENESWDEVAQDSGLGSYNSA 120 Query: 362 ALKLVYSKYLHALSCSLQKAVANKDSKSGVTDSSPDSFGGRLMHLESNLEGFLSGNSEKK 541 + KL+Y KYL AL+ L + +A G TD+S G L + LE FLS E K Sbjct: 121 SAKLIYVKYLDALARWLNRVIA------GDTDASSLELSGISDDLLARLESFLS---EVK 171 Query: 542 KKGGVYTNIKMRQTQIEF-FYDGKLSNTDEHMGIGKKSAGIDHVGENGVESINNRGAECL 718 +K + ++ E ++ K + IGKK D V E ++ + E + Sbjct: 172 RKYELRKGKTAKELGAELKWFISKTKRRYDKYHIGKKPGSNDAVKEIQGSTLTEKRLEKI 231 Query: 719 SEIRNDNDDCIAAASDLGEGRGXXXXXXXXXXXXXXXXXXIRKRRREFVPKMLKWIHMVA 898 + + N + + KR+RE + LKW+ VA Sbjct: 232 RILESGNQEYSSPG----------------------------KRKRECPLETLKWLSKVA 263 Query: 899 KDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVREACLRKDIDSSDQQGTGQKKQKMHPFM 1078 KDP DP+ G +P+RSKW++Y S+E WKQ+L R + + + S + QK QKMHP + Sbjct: 264 KDPFDPSTGRMPDRSKWEAYGSEEPWKQLLLFRAS---RTCNDSGCEKIWQKLQKMHPSL 320 Query: 1079 YDDHVRTSPEKLRFSQRLISAKDTLKKTGGQIXXXXXXXGTDEDSVDPQSDSIVDPLPDY 1258 YD+ P + + KK+ G G+D DS D + Sbjct: 321 YDN--SAGPSYNLRERTSYDGQFNDKKSSG--------IGSDSDSSDEED---------- 360 Query: 1259 LVNRRKKRMSLGPQHQADVPEWTGEAYESDNKWLGTKMWP 1378 + R +G + QA+VPEWT ESD+KWLGT++WP Sbjct: 361 -----RPRTLVGSEFQAEVPEWTEITPESDSKWLGTRIWP 395 >ref|NP_182128.2| ARID/BRIGHT AND ELM2 DNA-binding domain-containing protein [Arabidopsis thaliana] gi|75146722|sp|Q84JT7.1|ARID1_ARATH RecName: Full=AT-rich interactive domain-containing protein 1; Short=ARID domain-containing protein 1; AltName: Full=ARID and ELM2 domain-containing protein 1 gi|28393187|gb|AAO42024.1| unknown protein [Arabidopsis thaliana] gi|28827512|gb|AAO50600.1| unknown protein [Arabidopsis thaliana] gi|330255541|gb|AEC10635.1| ARID/BRIGHT AND ELM2 DNA-binding domain-containing protein [Arabidopsis thaliana] Length = 562 Score = 198 bits (504), Expect = 4e-48 Identities = 155/461 (33%), Positives = 215/461 (46%), Gaps = 2/461 (0%) Frame = +2 Query: 2 GWLKRVDGSGLERFKTPQKFRKKETRLDFQPVCVKAGEQQQDRRDELKSSFDEFLSVFLQ 181 GW D ++ KTP+ + P G +++ + EL S F L FL Sbjct: 3 GWSMVADEDAVDYSKTPKSLDANRSPESVNPE--STGFEKKIK--ELISLFRPLLDSFLA 58 Query: 182 ETFGTSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGFDSSVGL 361 E PLP M G+G++VDLF L+LNV +GG++ VSEN WD V +E G +S Sbjct: 59 EFCSADGFLPLPAMTGEGRTVDLFNLFLNVTHKGGFDAVSENGSWDEVVQESGLESYDSA 118 Query: 362 ALKLVYSKYLHALSCSLQKAVANKDSKSGVTDSSPDSFGGRLMHLESNLEGFLSGNSEKK 541 + KL+Y KYL A L + VA G TD S G L + L GFLS E K Sbjct: 119 SAKLIYVKYLDAFGRWLNRVVA------GDTDVSSVELSGISDALVARLNGFLS---EVK 169 Query: 542 KKGGVYTNIKMRQ--TQIEFFYDGKLSNTDEHMGIGKKSAGIDHVGENGVESINNRGAEC 715 KK + ++ ++++F D+H +GK+SA D V E + R E Sbjct: 170 KKYELRKGRPAKELGAELKWFISKTKRRYDKHH-VGKESASNDAVKEFQGSKLAERRLEQ 228 Query: 716 LSEIRNDNDDCIAAASDLGEGRGXXXXXXXXXXXXXXXXXXIRKRRREFVPKMLKWIHMV 895 + + + +C + KR+RE + LKW+ V Sbjct: 229 IMILESVTQECSSPG----------------------------KRKRECPLETLKWLSDV 260 Query: 896 AKDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVREACLRKDIDSSDQQGTGQKKQKMHPF 1075 AKDPCDP++G +P+RS+W SY S+E WKQ+L R + R + DS+ ++ T QK QKMHP Sbjct: 261 AKDPCDPSLGIVPDRSEWVSYGSEEPWKQLLLFRAS--RTNNDSACEK-TWQKVQKMHPC 317 Query: 1076 MYDDHVRTSPEKLRFSQRLISAKDTLKKTGGQIXXXXXXXGTDEDSVDPQSDSIVDPLPD 1255 +YDD S +RL KTG G+D S D + P Sbjct: 318 LYDDSAGAS---YNLRERLSYEDYKRGKTGN---------GSDIGSSDEED------RPC 359 Query: 1256 YLVNRRKKRMSLGPQHQADVPEWTGEAYESDNKWLGTKMWP 1378 LV G + QA VPEWTG ESD+KWLGT++WP Sbjct: 360 ALV---------GSKFQAKVPEWTGITPESDSKWLGTRIWP 391 >gb|AAC62899.1| hypothetical protein [Arabidopsis thaliana] Length = 576 Score = 198 bits (504), Expect = 4e-48 Identities = 155/461 (33%), Positives = 215/461 (46%), Gaps = 2/461 (0%) Frame = +2 Query: 2 GWLKRVDGSGLERFKTPQKFRKKETRLDFQPVCVKAGEQQQDRRDELKSSFDEFLSVFLQ 181 GW D ++ KTP+ + P G +++ + EL S F L FL Sbjct: 17 GWSMVADEDAVDYSKTPKSLDANRSPESVNPE--STGFEKKIK--ELISLFRPLLDSFLA 72 Query: 182 ETFGTSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGFDSSVGL 361 E PLP M G+G++VDLF L+LNV +GG++ VSEN WD V +E G +S Sbjct: 73 EFCSADGFLPLPAMTGEGRTVDLFNLFLNVTHKGGFDAVSENGSWDEVVQESGLESYDSA 132 Query: 362 ALKLVYSKYLHALSCSLQKAVANKDSKSGVTDSSPDSFGGRLMHLESNLEGFLSGNSEKK 541 + KL+Y KYL A L + VA G TD S G L + L GFLS E K Sbjct: 133 SAKLIYVKYLDAFGRWLNRVVA------GDTDVSSVELSGISDALVARLNGFLS---EVK 183 Query: 542 KKGGVYTNIKMRQ--TQIEFFYDGKLSNTDEHMGIGKKSAGIDHVGENGVESINNRGAEC 715 KK + ++ ++++F D+H +GK+SA D V E + R E Sbjct: 184 KKYELRKGRPAKELGAELKWFISKTKRRYDKHH-VGKESASNDAVKEFQGSKLAERRLEQ 242 Query: 716 LSEIRNDNDDCIAAASDLGEGRGXXXXXXXXXXXXXXXXXXIRKRRREFVPKMLKWIHMV 895 + + + +C + KR+RE + LKW+ V Sbjct: 243 IMILESVTQECSSPG----------------------------KRKRECPLETLKWLSDV 274 Query: 896 AKDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVREACLRKDIDSSDQQGTGQKKQKMHPF 1075 AKDPCDP++G +P+RS+W SY S+E WKQ+L R + R + DS+ ++ T QK QKMHP Sbjct: 275 AKDPCDPSLGIVPDRSEWVSYGSEEPWKQLLLFRAS--RTNNDSACEK-TWQKVQKMHPC 331 Query: 1076 MYDDHVRTSPEKLRFSQRLISAKDTLKKTGGQIXXXXXXXGTDEDSVDPQSDSIVDPLPD 1255 +YDD S +RL KTG G+D S D + P Sbjct: 332 LYDDSAGAS---YNLRERLSYEDYKRGKTGN---------GSDIGSSDEED------RPC 373 Query: 1256 YLVNRRKKRMSLGPQHQADVPEWTGEAYESDNKWLGTKMWP 1378 LV G + QA VPEWTG ESD+KWLGT++WP Sbjct: 374 ALV---------GSKFQAKVPEWTGITPESDSKWLGTRIWP 405 >ref|XP_006293914.1| hypothetical protein CARUB_v10022904mg, partial [Capsella rubella] gi|482562622|gb|EOA26812.1| hypothetical protein CARUB_v10022904mg, partial [Capsella rubella] Length = 572 Score = 194 bits (493), Expect = 7e-47 Identities = 143/460 (31%), Positives = 211/460 (45%), Gaps = 1/460 (0%) Frame = +2 Query: 2 GWLKRVDGSGLERFKTPQKFRKKETRLDFQPVCVKAGEQQQDRRDELKSSFDEFLSVFLQ 181 GW ++R KTP+ + P +G +++ + +L S F L FL Sbjct: 18 GWSMVAGEDAVDRSKTPKTLDANRSPESVNPEPTGSGFEKKIK--DLISLFRPLLESFLC 75 Query: 182 ETFGTSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGFDSSVGL 361 E + RPLPPM GDG+ VDLF L+LNV +GG++ VSEN WD VA++ G S Sbjct: 76 EFCASDSFRPLPPMTGDGRVVDLFNLFLNVSHKGGFDAVSENESWDEVAQDSGLGSYNSA 135 Query: 362 ALKLVYSKYLHALSCSLQKAVANKDSKSGVTDSSPDSFGGRLMHLESNLEGFLSGNSEKK 541 + KL+Y KYL AL+ L + +A G TD+S G L + LE FLS E K Sbjct: 136 SAKLIYVKYLDALARWLNRVIA------GDTDASSLELSGISDDLLARLESFLS---EVK 186 Query: 542 KKGGVYTNIKMRQTQIEF-FYDGKLSNTDEHMGIGKKSAGIDHVGENGVESINNRGAECL 718 +K + ++ E ++ K + IGKK D V E ++ + E + Sbjct: 187 RKYELRKGKTAKELGAELKWFISKTKRRYDKYHIGKKPGSNDAVKEIQGSTLTEKRLEKI 246 Query: 719 SEIRNDNDDCIAAASDLGEGRGXXXXXXXXXXXXXXXXXXIRKRRREFVPKMLKWIHMVA 898 + + N + + KR+RE + LKW+ VA Sbjct: 247 RILESGNQEYSSPG----------------------------KRKRECPLETLKWLSKVA 278 Query: 899 KDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVREACLRKDIDSSDQQGTGQKKQKMHPFM 1078 KDP DP+ G +P+RSKW++Y S+E WKQ+L R + + + G + QKMHP + Sbjct: 279 KDPFDPSTGRMPDRSKWEAYGSEEPWKQLLLFRAS------RTCNDSGCEKIWQKMHPSL 332 Query: 1079 YDDHVRTSPEKLRFSQRLISAKDTLKKTGGQIXXXXXXXGTDEDSVDPQSDSIVDPLPDY 1258 YD+ P + + KK+ G G+D DS D + Sbjct: 333 YDN--SAGPSYNLRERTSYDGQFNDKKSSG--------IGSDSDSSDEED---------- 372 Query: 1259 LVNRRKKRMSLGPQHQADVPEWTGEAYESDNKWLGTKMWP 1378 + R +G + QA+VPEWT ESD+KWLGT++WP Sbjct: 373 -----RPRTLVGSEFQAEVPEWTEITPESDSKWLGTRIWP 407 >ref|XP_006476534.1| PREDICTED: AT-rich interactive domain-containing protein 2-like isoform X1 [Citrus sinensis] Length = 625 Score = 192 bits (489), Expect = 2e-46 Identities = 155/486 (31%), Positives = 218/486 (44%), Gaps = 27/486 (5%) Frame = +2 Query: 2 GWLKRVDGSGLERFKTPQKFRKKETRLDFQPVCVKAGEQQQDR---RDELKSSFDEFLSV 172 GW +GS L+ KT + + +K + +D DELK FD+ L Sbjct: 3 GWSILTNGSALDCGKTIGSVQSNDGCCPEADNYMKDDDSVEDSGGYEDELKCLFDKVLET 62 Query: 173 FLQE-TFGTSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGFDS 349 L+E + IRP+P MLGDG+S+DLFKL+ VR+RGG+ VS+N LW V ++ G D Sbjct: 63 VLKEGSDRKGSIRPIPAMLGDGRSLDLFKLFCAVRERGGFCMVSKNGLWGFVLEDLGLDF 122 Query: 350 SVGLALKLVYSKYLHALSCSLQKAVANKDSKSGVTDSSPDSFGGRL----MHLESNLEGF 517 V ++KLVY++YL L L G FGG + LE+ G Sbjct: 123 GVSASVKLVYARYLGELEKWLMGTSGLSLGNGGC------GFGGNSGLLPLELETRFRGL 176 Query: 518 LSGNSEKKKKGGVYTNIKMRQTQIEFFYDGKLSNTDEHMGIGKKSAGIDHVG-ENGVESI 694 L S+KK K R +E+ D H+ + + +D + +N E Sbjct: 177 LMNWSKKKIKDD-------RLALLEY------KKNDNHVDMEIEKTELDLLDTKNRHERC 223 Query: 695 NNRGAECLSEIRN--DNDDCIAAASDLGEGRGXXXXXXXXXXXXXXXXXXIRKRRREFVP 868 N G +C R DNDD + RKR+RE + Sbjct: 224 NCLGKKCSDNNRKNYDNDDKLC----------------NDDPSITQKEYCYRKRKRESLS 267 Query: 869 KMLKWIHMVAKDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVREACL-RKDIDSSDQQGT 1045 ML W+ +AK P DP IG +PE SKWK+ E+W + R+A L RK ++S+ Q Sbjct: 268 GMLNWVIQIAKYPDDPLIGVIPEPSKWKNNEDKELWLHAIRARDALLQRKHVNSNIHQSL 327 Query: 1046 GQKKQKMHPFMYDDHVRT---SPEKLRFSQRL------------ISAKDTLKKTGGQIXX 1180 Q QKMHP MY+D S E+LR S+RL S T K Sbjct: 328 FQNGQKMHPSMYEDVTNQRHWSTERLRSSERLPTIMKSRVCSCCSSCSATENKLTSPHNA 387 Query: 1181 XXXXXGTDEDSVDPQSDSIVDPLPDYLVNRRKKRMSLGPQHQADVPEWTGEAYESDNKWL 1360 E + S ++ + ++K++S+GP QA VPEWTG ESD+KWL Sbjct: 388 ELETGPKGETPMTVTSSAMNIAVCSSGDEPQEKQVSVGPLFQASVPEWTGVVLESDSKWL 447 Query: 1361 GTKMWP 1378 GT++WP Sbjct: 448 GTRIWP 453 >ref|XP_006476535.1| PREDICTED: AT-rich interactive domain-containing protein 2-like isoform X2 [Citrus sinensis] Length = 590 Score = 190 bits (482), Expect = 1e-45 Identities = 146/439 (33%), Positives = 202/439 (46%), Gaps = 24/439 (5%) Frame = +2 Query: 134 DELKSSFDEFLSVFLQE-TFGTSCIRPLPPMLGDGKSVDLFKLYLNVRKRGGYERVSENC 310 DELK FD+ L L+E + IRP+P MLGDG+S+DLFKL+ VR+RGG+ VS+N Sbjct: 15 DELKCLFDKVLETVLKEGSDRKGSIRPIPAMLGDGRSLDLFKLFCAVRERGGFCMVSKNG 74 Query: 311 LWDSVAKECGFDSSVGLALKLVYSKYLHALSCSLQKAVANKDSKSGVTDSSPDSFGGRL- 487 LW V ++ G D V ++KLVY++YL L L G FGG Sbjct: 75 LWGFVLEDLGLDFGVSASVKLVYARYLGELEKWLMGTSGLSLGNGGC------GFGGNSG 128 Query: 488 ---MHLESNLEGFLSGNSEKKKKGGVYTNIKMRQTQIEFFYDGKLSNTDEHMGIGKKSAG 658 + LE+ G L S+KK K R +E+ D H+ + + Sbjct: 129 LLPLELETRFRGLLMNWSKKKIKDD-------RLALLEY------KKNDNHVDMEIEKTE 175 Query: 659 IDHVG-ENGVESINNRGAECLSEIRN--DNDDCIAAASDLGEGRGXXXXXXXXXXXXXXX 829 +D + +N E N G +C R DNDD + Sbjct: 176 LDLLDTKNRHERCNCLGKKCSDNNRKNYDNDDKLC----------------NDDPSITQK 219 Query: 830 XXXIRKRRREFVPKMLKWIHMVAKDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVREACL 1009 RKR+RE + ML W+ +AK P DP IG +PE SKWK+ E+W + R+A L Sbjct: 220 EYCYRKRKRESLSGMLNWVIQIAKYPDDPLIGVIPEPSKWKNNEDKELWLHAIRARDALL 279 Query: 1010 -RKDIDSSDQQGTGQKKQKMHPFMYDDHVRT---SPEKLRFSQRL------------ISA 1141 RK ++S+ Q Q QKMHP MY+D S E+LR S+RL S Sbjct: 280 QRKHVNSNIHQSLFQNGQKMHPSMYEDVTNQRHWSTERLRSSERLPTIMKSRVCSCCSSC 339 Query: 1142 KDTLKKTGGQIXXXXXXXGTDEDSVDPQSDSIVDPLPDYLVNRRKKRMSLGPQHQADVPE 1321 T K E + S ++ + ++K++S+GP QA VPE Sbjct: 340 SATENKLTSPHNAELETGPKGETPMTVTSSAMNIAVCSSGDEPQEKQVSVGPLFQASVPE 399 Query: 1322 WTGEAYESDNKWLGTKMWP 1378 WTG ESD+KWLGT++WP Sbjct: 400 WTGVVLESDSKWLGTRIWP 418 >gb|EOY24787.1| ARID/BRIGHT DNA-binding domain,ELM2 domain protein, putative [Theobroma cacao] Length = 616 Score = 189 bits (479), Expect = 3e-45 Identities = 149/477 (31%), Positives = 232/477 (48%), Gaps = 18/477 (3%) Frame = +2 Query: 2 GWLKRVDGSGLERFKTPQKFRKKETRLDFQPVCVKAGEQQQDRRDELKSSFDEFLSVFLQ 181 GW +GS L+ T + LD PV + E+ D R+ L+ FD LS +L+ Sbjct: 3 GWSILTNGSALDCVGTVNNCQSNGCHLDDDPVTKNSVEEFGDHRNRLRCLFDLVLSGYLK 62 Query: 182 ETFGTSCIRPLPPMLG-DGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGFDSSVG 358 E +R +P MLG DG S+DL KL+L VR+ GGYE VS+ LW V KE G D V Sbjct: 63 EVACKGFVRRMPAMLGNDGHSLDLLKLFLVVREIGGYEFVSKKGLWAFVVKELGLDLEVS 122 Query: 359 LALKLVYSKYLHALSCSLQKAVANKDSKSGVTDSSPDSFGGRLMHLESNLEGFLSGNSEK 538 ++KL+Y+KYL+ L L+ ++ +++ + + GG+ FLS E+ Sbjct: 123 ASVKLIYAKYLNELEKWLRNSLVDRNGEG--------AGGGKFR--------FLSLEQEE 166 Query: 539 KKKGGVYTNIKMRQTQIEFFYDGKLSNTDEHMGIGKKSAGIDHVGENGVESINNRGAECL 718 + + G++TN ++ + + D+ + K G+ N +++ G E Sbjct: 167 EFR-GLFTNGVDQKVVVNRVALSEYIKNDKCIAKDSKKNGLKISDANSRYRLHS-GVE-- 222 Query: 719 SEIRNDNDDCIAAASDLGEGRGXXXXXXXXXXXXXXXXXXIRKRRREFVPKMLKWIHMVA 898 E+ +DND+ + +DLG RKR+RE + ML W+ VA Sbjct: 223 -EVFSDNDEKV-CRNDLG----------VLDPPVARKEFSTRKRKRESLAGMLNWVTQVA 270 Query: 899 KDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVREACLRK-DIDSSDQQGTGQKKQKMHPF 1075 K DP++ + E SKWK + +E W Q + REA +K D S +Q Q +KMHP Sbjct: 271 KCHDDPSVWAIAEPSKWKDHGGNEFWIQAIRAREAIRQKRDDHSVTEQSLLQNNKKMHPS 330 Query: 1076 MYDDHVRTS--PEKLRFSQRL----------ISAKDTLKKTGGQIXXXXXXXGTDEDS-- 1213 MY+D + + E+ R S++L S+ L+K G E S Sbjct: 331 MYEDGILSHHLTERSRCSEKLPTTQSRSCSCCSSDSALQKNSMCRHKTESECGLKEQSPV 390 Query: 1214 -VDPQS-DSIVDPLPDYLVNRRKKRMSLGPQHQADVPEWTGEAYESDNKWLGTKMWP 1378 +D S D V+P D + ++++S+G + QA+VPEWTG ++D+KWLGT+ WP Sbjct: 391 TIDSSSLDMTVEPSGD---DSLRRQVSVGLRFQAEVPEWTGMVSDTDSKWLGTQEWP 444 >ref|XP_004492764.1| PREDICTED: AT-rich interactive domain-containing protein 2-like isoform X1 [Cicer arietinum] gi|502105269|ref|XP_004492765.1| PREDICTED: AT-rich interactive domain-containing protein 2-like isoform X2 [Cicer arietinum] Length = 586 Score = 181 bits (458), Expect = 8e-43 Identities = 129/436 (29%), Positives = 205/436 (47%), Gaps = 50/436 (11%) Frame = +2 Query: 221 MLGDGKSVDLFKLYLNVRKRGGYERVSENCLWDSVAKECGFDSSVGLALKLVYSKYLHAL 400 MLG + +DL+KL++ V+ +GGY+ V +N LWD V +E G VG +++LVYSKYL L Sbjct: 1 MLGSEQPLDLYKLFMVVKDKGGYDVVCKNRLWDLVGEEYGLGVKVGSSVELVYSKYLSTL 60 Query: 401 SCSLQKAVANKDSKSGVTDSSPDSFGGRLMHLESNLEGFLSGNSEKKKKG----GVY--- 559 L+ V ++ +K G+ D FG RLM L++ FL + ++ G VY Sbjct: 61 ETPLKNVVDDEVAKCGLVDDRV-KFGERLMELQAE---FLLDDYGEEDAGDELESVYDCG 116 Query: 560 -----------TNIKMRQTQIEFFYDG------------KLSNTDEHMGIGKKSAGI--- 661 N K+ ++E YD K +N + + K+ G+ Sbjct: 117 RKLCGTNRVKGVNSKLNAAELERVYDYLDGRKLRGANRMKDANLESNAAKKVKNGGLVDM 176 Query: 662 --DHVGENGV------ESINNRGAECLSEIRNDNDDCIAAASDLGEGRGXXXXXXXXXXX 817 + + N + ++N R+D+DD + D+ Sbjct: 177 HMEELDGNKILAVDVSNTVNKMPGLSDGSKRHDSDDNADSVDDV----------LILDPS 226 Query: 818 XXXXXXXIRKRRREFVPKMLKWIHMVAKDPCDPAIGNLPERSKWKSYPSDEVWKQVLFVR 997 RKR+RE + ++L W+ AK+PCDP +G++PE+SKWKS ++E+WK+VL R Sbjct: 227 SVNRENFGRKRKRESMSEVLSWVTRTAKNPCDPVVGSIPEKSKWKSCRNEEIWKKVLLFR 286 Query: 998 EACLRKDIDSSDQQGTGQKKQKMHPFMYDDHVRTS---PEKLRFSQRLISAKDTLKKTGG 1168 E+ K S+ + Q+MHP MYDD+ + ++++ L++ K K Sbjct: 287 ESVFLKKDFGSNCEKLSWLAQRMHPAMYDDNFGATYNLRQRIKCDNGLLAGKSASKGIFS 346 Query: 1169 QIXXXXXXXGTDE------DSVDPQSDSIVDPLPDYLVNRRKKRMSLGPQHQADVPEWTG 1330 ++ D DS P+S L + LGP HQA+VP+WTG Sbjct: 347 RMQRTPSPHTDDRAKKKLLDSSAPESS---------LDTPATVNIPLGPNHQAEVPKWTG 397 Query: 1331 EAYESDNKWLGTKMWP 1378 +ESD+KWLGT++WP Sbjct: 398 TTHESDSKWLGTQIWP 413