BLASTX nr result

ID: Akebia23_contig00006397 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00006397
         (3648 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI35803.3| unnamed protein product [Vitis vinifera]             1165   0.0  
ref|XP_006452906.1| hypothetical protein CICLE_v10007563mg [Citr...  1158   0.0  
ref|XP_006474564.1| PREDICTED: AT-rich interactive domain-contai...  1155   0.0  
ref|XP_002277324.1| PREDICTED: AT-rich interactive domain-contai...  1144   0.0  
ref|XP_006381551.1| hypothetical protein POPTR_0006s13780g [Popu...  1135   0.0  
ref|XP_002516200.1| DNA binding protein, putative [Ricinus commu...  1124   0.0  
ref|XP_007012520.1| ARID/BRIGHT DNA-binding domain-containing pr...  1122   0.0  
ref|XP_002324130.2| arid/bright DNA-binding domain-containing fa...  1120   0.0  
ref|XP_006362097.1| PREDICTED: AT-rich interactive domain-contai...  1108   0.0  
ref|XP_004252398.1| PREDICTED: AT-rich interactive domain-contai...  1100   0.0  
ref|XP_007217035.1| hypothetical protein PRUPE_ppa001668mg [Prun...  1097   0.0  
ref|XP_007012522.1| ARID/BRIGHT DNA-binding domain-containing pr...  1086   0.0  
gb|EXB64667.1| AT-rich interactive domain-containing protein 4 [...  1085   0.0  
ref|XP_004303747.1| PREDICTED: AT-rich interactive domain-contai...  1063   0.0  
ref|XP_003547888.1| PREDICTED: AT-rich interactive domain-contai...  1059   0.0  
ref|XP_003533805.1| PREDICTED: AT-rich interactive domain-contai...  1045   0.0  
ref|XP_006828651.1| hypothetical protein AMTR_s00129p00111730 [A...  1042   0.0  
gb|EYU21278.1| hypothetical protein MIMGU_mgv1a001736mg [Mimulus...  1038   0.0  
ref|XP_006587068.1| PREDICTED: AT-rich interactive domain-contai...  1037   0.0  
ref|XP_006587067.1| PREDICTED: AT-rich interactive domain-contai...  1037   0.0  

>emb|CBI35803.3| unnamed protein product [Vitis vinifera]
          Length = 746

 Score = 1165 bits (3015), Expect = 0.0
 Identities = 580/782 (74%), Positives = 634/782 (81%), Gaps = 1/782 (0%)
 Frame = -2

Query: 2597 MFHVQGPTKPMCGLLAVLC-EAPNSKQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPTT 2421
            M H QG +   CGLLAV C +    KQ+ + S D P  Y FP+ VSSGRLEVQTL SP+ 
Sbjct: 1    MLHTQGISNHTCGLLAVTCGKTSECKQEHETSNDRPR-YPFPDFVSSGRLEVQTLTSPSP 59

Query: 2420 DEFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLEI 2241
            DEFRRV E  +PN VY QGEQL ND E+GSLVWGGV+LS+ E I GLFGS LPTTVYLEI
Sbjct: 60   DEFRRVFESVQPNFVYFQGEQLQND-EVGSLVWGGVELSSAEDICGLFGSKLPTTVYLEI 118

Query: 2240 PNSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFR 2061
            PN E LA ALHSKG+PYVIYWKNAFSCYAACHFR ALFSVVQSS +HTWDAFQLA+ASFR
Sbjct: 119  PNGEKLAEALHSKGIPYVIYWKNAFSCYAACHFRNALFSVVQSSSTHTWDAFQLAYASFR 178

Query: 2060 LYCVRNNQVLPANNHKISGKLGPHLLGDPPKITIVPMVKEAGXXXXXXXXDLSGDLPAIK 1881
            LYCVRNN VLPAN+HK+SGKLGP LLGDP  I + P   +AG           G LPAIK
Sbjct: 179  LYCVRNNHVLPANSHKVSGKLGPRLLGDPATIDVPPPEVDAGEDEEGSL----GTLPAIK 234

Query: 1880 IYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQAGTF 1701
            IYDDDV +RFLVCG PC LD+CL  SLEDGLNALL+IEIRGSKLHNRVSAPPPPLQAGTF
Sbjct: 235  IYDDDVGIRFLVCGEPCMLDSCLFESLEDGLNALLSIEIRGSKLHNRVSAPPPPLQAGTF 294

Query: 1700 SRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSCGES 1521
            SRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLE +IK E+ E+SQLVHA P    +
Sbjct: 295  SRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLENNIKKEVTEQSQLVHALPYSEGN 354

Query: 1520 KPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGSAVA 1341
            KP L EPR+S SIACGAA+FEVC KVP WASQVLRQLAP++SYRS V+LGIASIQG AVA
Sbjct: 355  KPPLSEPRRSASIACGAAVFEVCAKVPAWASQVLRQLAPDVSYRSLVALGIASIQGLAVA 414

Query: 1340 SFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSVNGCSMV 1161
            SF+KDD  RLLFFCTR+ +  +P N  PS  P WLKPP PSRKR EP Q+T         
Sbjct: 415  SFEKDDANRLLFFCTRQGKYIHPNNFTPSRLPSWLKPPPPSRKRVEPSQDT--------- 465

Query: 1160 GEGRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGGTTE 981
                                ++G  +PL+P  Q+LK+AAMRPIPH RHHKMLPF  G +E
Sbjct: 466  --------------------MNGVTMPLLPAGQRLKVAAMRPIPHIRHHKMLPF-SGISE 504

Query: 980  VETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGCGRC 801
             + ++GGQ K NLSV P TKH+IVG     HRKS SSS+QAKQIISLNPLPLKKHGCGR 
Sbjct: 505  ADGHDGGQVKANLSVPPPTKHSIVGSTSAMHRKSFSSSYQAKQIISLNPLPLKKHGCGRS 564

Query: 800  PIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRGGF 621
            PI++CSEEEFL+DVMQFL LRGHTRL+PQGGLAEFPDAILNAKRLDLYNLYREVVSRGGF
Sbjct: 565  PIRVCSEEEFLKDVMQFLNLRGHTRLIPQGGLAEFPDAILNAKRLDLYNLYREVVSRGGF 624

Query: 620  HVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHS 441
            HVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHS
Sbjct: 625  HVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHS 684

Query: 440  SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKAANG 261
            SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP CS+TNF++K  KA NG
Sbjct: 685  SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPQCSVTNFKKKANKAPNG 744

Query: 260  YS 255
            +S
Sbjct: 745  FS 746


>ref|XP_006452906.1| hypothetical protein CICLE_v10007563mg [Citrus clementina]
            gi|557556132|gb|ESR66146.1| hypothetical protein
            CICLE_v10007563mg [Citrus clementina]
          Length = 745

 Score = 1158 bits (2995), Expect = 0.0
 Identities = 579/782 (74%), Positives = 639/782 (81%), Gaps = 1/782 (0%)
 Frame = -2

Query: 2600 MMFHVQGPTKPMCGLLAVLCEA-PNSKQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPT 2424
            MMFH Q  ++  C LLAVL     + KQKQ  ++D P  Y FPE+ SSGRLEV  L SP+
Sbjct: 1    MMFHAQSSSRNHCSLLAVLSRKFVDDKQKQAATDDKPK-YPFPEIASSGRLEVHLLSSPS 59

Query: 2423 TDEFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLE 2244
            TDEFRR++E SEPNIVYLQGE++ + +EIGSLVWG VDLS PEA+ GLFGSTLPTTVYLE
Sbjct: 60   TDEFRRLLESSEPNIVYLQGEKINDSEEIGSLVWGDVDLSTPEALCGLFGSTLPTTVYLE 119

Query: 2243 IPNSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASF 2064
            IPN E+ A ALHS+GVPYVIYWK++FSCYAACHF QAL SVVQSSCSHTWDAFQLAHASF
Sbjct: 120  IPNGENFAEALHSRGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASF 179

Query: 2063 RLYCVRNNQVLPANNHKISGKLGPHLLGDPPKITIVPMVKEAGXXXXXXXXDLSGDLPAI 1884
            RLYCVRNN V+ +N+ K S KLGPHLLGDPPKI I     +              +LPAI
Sbjct: 180  RLYCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPE-----NLPAI 234

Query: 1883 KIYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQAGT 1704
            KIYDDDV+MRFLVCGVPCTLD  LLG LEDGLNALLNIEIRGSKLHNR SAPPPPLQAG 
Sbjct: 235  KIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGA 294

Query: 1703 FSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSCGE 1524
            FSRGVVTMRCDLSTCSSAHISLLVSGSAQTCF+DQLLE HIKNELIE SQLVHA P+ G+
Sbjct: 295  FSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGD 354

Query: 1523 SKPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGSAV 1344
            ++    EPRKS SIACGA++FEV MKV TWASQVLRQLAP++SYRS V LGIASIQG +V
Sbjct: 355  NRLPPSEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSV 414

Query: 1343 ASFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSVNGCSM 1164
            ASF+KDD +RLLFFCTR+ +  + +N++ + PP WL  PAPSRKRSEPC+E++ V     
Sbjct: 415  ASFEKDDAERLLFFCTRQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGV----- 469

Query: 1163 VGEGRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGGTT 984
                        E  N  N            VR KL  AAMRPIPHTRHHKMLPF  G +
Sbjct: 470  ------------ESENVCN------------VRPKLNAAAMRPIPHTRHHKMLPF-SGFS 504

Query: 983  EVETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGCGR 804
            E+E Y+G Q K NL V P  KH+  GP PVTHRKS SSS+QA+QIISLNPLPLKKHGCGR
Sbjct: 505  EIERYDGDQVKANLPVAP-LKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGR 563

Query: 803  CPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRGG 624
             PIQ+CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDL+NLYREVVSRGG
Sbjct: 564  APIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGG 623

Query: 623  FHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCH 444
            FHVGNGINWKGQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCH
Sbjct: 624  FHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCH 683

Query: 443  SSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKAAN 264
            SSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY+CP CS+TNF++K QK +N
Sbjct: 684  SSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSN 743

Query: 263  GY 258
            GY
Sbjct: 744  GY 745


>ref|XP_006474564.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
            [Citrus sinensis]
          Length = 745

 Score = 1155 bits (2988), Expect = 0.0
 Identities = 578/782 (73%), Positives = 639/782 (81%), Gaps = 1/782 (0%)
 Frame = -2

Query: 2600 MMFHVQGPTKPMCGLLAVLCEA-PNSKQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPT 2424
            MMFH Q  ++  C LLAVL     + KQKQ  ++D P  Y FPE+ SSGRLEV  L SP+
Sbjct: 1    MMFHAQSSSRNHCSLLAVLSRKFVDDKQKQAATDDKPK-YPFPEIASSGRLEVHLLSSPS 59

Query: 2423 TDEFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLE 2244
            TDEFRR++E SEPNIVYLQGE++ + +EIGSLVWG VDLS PEA+ GLFGSTLPTTVYLE
Sbjct: 60   TDEFRRLLESSEPNIVYLQGEKINDSEEIGSLVWGDVDLSTPEALCGLFGSTLPTTVYLE 119

Query: 2243 IPNSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASF 2064
            IPN E+ A ALHS+GVPYVIYWK++FSCYAACHF QAL SVVQSSCSHTWDAFQLAHASF
Sbjct: 120  IPNGENFAEALHSRGVPYVIYWKHSFSCYAACHFLQALLSVVQSSCSHTWDAFQLAHASF 179

Query: 2063 RLYCVRNNQVLPANNHKISGKLGPHLLGDPPKITIVPMVKEAGXXXXXXXXDLSGDLPAI 1884
            RLYCVRNN V+ +N+ K S KLGPHLLGDPPKI I     +              +LPAI
Sbjct: 180  RLYCVRNNIVMASNSQKGSSKLGPHLLGDPPKIDIALSEMDVQGEENSPE-----NLPAI 234

Query: 1883 KIYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQAGT 1704
            KIYDDDV+MRFLVCGVPCTLD  LLG LEDGLNALLNIEIRGSKLHNR SAPPPPLQAG 
Sbjct: 235  KIYDDDVTMRFLVCGVPCTLDTSLLGPLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGA 294

Query: 1703 FSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSCGE 1524
            FSRGVVTMRCDLSTCSSAHISLLVSGSAQTCF+DQLLE HIKNELIE SQLVHA P+ G+
Sbjct: 295  FSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALPNSGD 354

Query: 1523 SKPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGSAV 1344
            ++    EPRKS SIACGA++FEV MKV TWASQVLRQLAP++SYRS V LGIASIQG +V
Sbjct: 355  NRLPPSEPRKSASIACGASVFEVSMKVSTWASQVLRQLAPDVSYRSLVMLGIASIQGLSV 414

Query: 1343 ASFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSVNGCSM 1164
            ASF+KDD +RLLFFCTR+ +  + +N++ + PP WL  PAPSRKRSEPC+E++ V     
Sbjct: 415  ASFEKDDAERLLFFCTRQGKADHTENSVLTRPPSWLTSPAPSRKRSEPCRESKGV----- 469

Query: 1163 VGEGRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGGTT 984
                        E  N  N            VR KL  AAMRPIPHTRH+KMLPF  G +
Sbjct: 470  ------------ESENVCN------------VRPKLNSAAMRPIPHTRHYKMLPF-SGFS 504

Query: 983  EVETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGCGR 804
            E+E Y+G Q K NL V P  KH+  GP PVTHRKS SSS+QA+QIISLNPLPLKKHGCGR
Sbjct: 505  EIERYDGDQVKANLPVAP-LKHSSAGPTPVTHRKSLSSSYQAQQIISLNPLPLKKHGCGR 563

Query: 803  CPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRGG 624
             PIQ+CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDL+NLYREVVSRGG
Sbjct: 564  APIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRGG 623

Query: 623  FHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCH 444
            FHVGNGINWKGQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCH
Sbjct: 624  FHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCH 683

Query: 443  SSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKAAN 264
            SSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY+CP CS+TNF++K QK +N
Sbjct: 684  SSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPQCSVTNFKKKSQKTSN 743

Query: 263  GY 258
            GY
Sbjct: 744  GY 745


>ref|XP_002277324.1| PREDICTED: AT-rich interactive domain-containing protein 4 [Vitis
            vinifera] gi|297738501|emb|CBI27746.3| unnamed protein
            product [Vitis vinifera]
          Length = 739

 Score = 1144 bits (2959), Expect = 0.0
 Identities = 587/781 (75%), Positives = 636/781 (81%), Gaps = 1/781 (0%)
 Frame = -2

Query: 2597 MFHVQGPTKPMCGLLAVLC-EAPNSKQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPTT 2421
            MFHVQ  ++  C LLAV+C + P S+ +Q         Y FPELVSSGRLEVQ L +P+ 
Sbjct: 1    MFHVQAASRNHCALLAVVCGKIPVSEDQQQHP------YPFPELVSSGRLEVQILKNPSI 54

Query: 2420 DEFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLEI 2241
             EF+R +E  EPN +YLQGEQLP  +EIGSL WGGVDLS+ EA+  LFG TLPTTVYLE 
Sbjct: 55   HEFQRSLESLEPNFLYLQGEQLPGSEEIGSLTWGGVDLSSAEALVELFGPTLPTTVYLET 114

Query: 2240 PNSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFR 2061
            PN E LA ALHSKGV YVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFR
Sbjct: 115  PNGEKLAKALHSKGVSYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFR 174

Query: 2060 LYCVRNNQVLPANNHKISGKLGPHLLGDPPKITIVPMVKEAGXXXXXXXXDLSGDLPAIK 1881
            LYCV+NN V P+NN K+SGKLGP LLGDPPKI +VP   +           L   LP IK
Sbjct: 175  LYCVQNNTV-PSNNQKVSGKLGPCLLGDPPKINVVPPEVDE-------EESLPATLPVIK 226

Query: 1880 IYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQAGTF 1701
            IYD DVSMRFLVCG P  LDACLLGSLEDGLNALL IEIRGSKLHNRVSAPPPPLQAGTF
Sbjct: 227  IYDADVSMRFLVCGAPSALDACLLGSLEDGLNALLCIEIRGSKLHNRVSAPPPPLQAGTF 286

Query: 1700 SRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSCGES 1521
            SRGVVTMRCDLSTCSSAHISLLVSGSAQTC +DQLLE++IKNELIEKSQLVHA PSC ES
Sbjct: 287  SRGVVTMRCDLSTCSSAHISLLVSGSAQTCLNDQLLESYIKNELIEKSQLVHAVPSCEES 346

Query: 1520 KPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGSAVA 1341
            K S  EPR+S SIACGA++FEV +KVPTWASQVLRQLAP++SYRS V+LGIASIQG +VA
Sbjct: 347  KLSSSEPRRSASIACGASVFEVRIKVPTWASQVLRQLAPDVSYRSLVTLGIASIQGLSVA 406

Query: 1340 SFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSVNGCSMV 1161
            SF+KDD  RLLFFCTR  +  N  N++   PP WL  P  SRKRS PC ET+  +G  ++
Sbjct: 407  SFEKDDADRLLFFCTRHAKQLNQNNSILPRPPSWLIAPPASRKRSGPCHETKP-SGYKVL 465

Query: 1160 GEGRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGGTTE 981
            G                NG V         ++QK K+AAMRPIPHTR+HKMLPF  G +E
Sbjct: 466  GG--------------VNGGV---------LQQKPKIAAMRPIPHTRNHKMLPF-SGISE 501

Query: 980  VETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGCGRC 801
                +G QAK NLSVVP+ KHN  G  PVTHRK  SSSFQA+QIISLNPLPLKKHGCGR 
Sbjct: 502  ASRCDGDQAKGNLSVVPA-KHN--GTTPVTHRKLLSSSFQAQQIISLNPLPLKKHGCGRS 558

Query: 800  PIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRGGF 621
            PIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRGGF
Sbjct: 559  PIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRGGF 618

Query: 620  HVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHS 441
            HVGNGINWKGQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHS
Sbjct: 619  HVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHS 678

Query: 440  SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKAANG 261
            SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNF++K QK ANG
Sbjct: 679  SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFQKKSQKTANG 738

Query: 260  Y 258
            Y
Sbjct: 739  Y 739


>ref|XP_006381551.1| hypothetical protein POPTR_0006s13780g [Populus trichocarpa]
            gi|550336257|gb|ERP59348.1| hypothetical protein
            POPTR_0006s13780g [Populus trichocarpa]
          Length = 749

 Score = 1135 bits (2937), Expect = 0.0
 Identities = 572/783 (73%), Positives = 633/783 (80%), Gaps = 2/783 (0%)
 Frame = -2

Query: 2600 MMFHVQGPTKPMCGLLAVLC-EAPNSKQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPT 2424
            MMFH QGP +  C LLAVLC ++ ++KQKQ  S+D P  + FPEL S+GRLEVQ L +P+
Sbjct: 1    MMFHAQGPLRNHCTLLAVLCGKSGDNKQKQPLSDDKPR-FPFPELASAGRLEVQVLTNPS 59

Query: 2423 TDEFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLE 2244
            TDEF+RV+   EP+IVY QGEQ+ + +EIG L WG +DLS PE++ GLFGSTLP TVYLE
Sbjct: 60   TDEFQRVLHSLEPSIVYFQGEQIEDSEEIGPLRWGDIDLSTPESLCGLFGSTLPPTVYLE 119

Query: 2243 IPNSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASF 2064
            IPN E LA ALHSKGVPYVIYWK+ FSCYA  HFRQAL SVVQSSCSHT DAFQLA+ASF
Sbjct: 120  IPNGEKLAEALHSKGVPYVIYWKSMFSCYAVSHFRQALLSVVQSSCSHTCDAFQLAYASF 179

Query: 2063 RLYCVRNNQVLPANNHKISGKLGPHLLGDPPKITI-VPMVKEAGXXXXXXXXDLSGDLPA 1887
            RLYC RNN  L +N  K+ GK GP LLGDPPK  I +P   + G          SG LPA
Sbjct: 180  RLYCGRNNNTLASNGQKVGGKPGPQLLGDPPKFDITLPEADDQGEESS------SGALPA 233

Query: 1886 IKIYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQAG 1707
            IKIYDDDV+MRFLVCG+ CTLDACLL SLEDGLNALLNIEIRGSKLHNR SAPPPPLQAG
Sbjct: 234  IKIYDDDVTMRFLVCGLSCTLDACLLESLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAG 293

Query: 1706 TFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSCG 1527
            TFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCF+DQLLE HIKNELIE SQLVHA  S  
Sbjct: 294  TFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFNDQLLENHIKNELIENSQLVHALTSFE 353

Query: 1526 ESKPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGSA 1347
            ESK    EPRKS SIACGA++FEV MKVPTWASQVLRQLAP++SYRS V LGIASIQG +
Sbjct: 354  ESKSPSSEPRKSASIACGASVFEVSMKVPTWASQVLRQLAPDVSYRSLVMLGIASIQGLS 413

Query: 1346 VASFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSVNGCS 1167
            VASF+KDD  RLLFFC+ + ++ +P N   + PP WL PPAP RKRSEP +ET+ +    
Sbjct: 414  VASFEKDDADRLLFFCSEQGKESHPLNTFLTRPPTWLIPPAPCRKRSEPTRETKPLTS-- 471

Query: 1166 MVGEGRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGGT 987
                GR G+              +G N     V+ K  +AAMRPIPHT  HKMLPF  G 
Sbjct: 472  ----GRGGE--------------NGGN-----VKHKFHVAAMRPIPHTHRHKMLPF-SGF 507

Query: 986  TEVETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGCG 807
             + E Y+G QAK +L   P  KH++VGP PVTHRKS SSS+QA+QIISLNPLPLKKHGCG
Sbjct: 508  FDAERYDGEQAKPSLPP-PPPKHSVVGPAPVTHRKSLSSSYQAQQIISLNPLPLKKHGCG 566

Query: 806  RCPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRG 627
            R PIQ+CSEEEFLRDVMQFLILRGH+RLVPQGGLAEFPDAILNAKRLDL+NLYREVVSRG
Sbjct: 567  RSPIQVCSEEEFLRDVMQFLILRGHSRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRG 626

Query: 626  GFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC 447
            GFHVGNGINWKGQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC
Sbjct: 627  GFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC 686

Query: 446  HSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKAA 267
            HSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP+CSI NF++K QK  
Sbjct: 687  HSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPNCSIANFKKKSQKTT 746

Query: 266  NGY 258
            NGY
Sbjct: 747  NGY 749


>ref|XP_002516200.1| DNA binding protein, putative [Ricinus communis]
            gi|223544686|gb|EEF46202.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 749

 Score = 1124 bits (2907), Expect = 0.0
 Identities = 563/738 (76%), Positives = 610/738 (82%)
 Frame = -2

Query: 2471 LVSSGRLEVQTLISPTTDEFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEA 2292
            L SSGRLEVQ L SP+TDEFRRV++ SEPNIVYLQGE + + +EIGSL W G DLS P+A
Sbjct: 43   LXSSGRLEVQILSSPSTDEFRRVLQSSEPNIVYLQGEIIEDSEEIGSLRWAGADLSTPDA 102

Query: 2291 ISGLFGSTLPTTVYLEIPNSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQS 2112
            +  LFGSTLP TVYLEIPN E LA ALH KGVPYVIYWK+ FSCYAA HFRQAL SVVQS
Sbjct: 103  LCELFGSTLPPTVYLEIPNGEKLAEALHFKGVPYVIYWKSTFSCYAAAHFRQALLSVVQS 162

Query: 2111 SCSHTWDAFQLAHASFRLYCVRNNQVLPANNHKISGKLGPHLLGDPPKITIVPMVKEAGX 1932
            SCSHT DAFQLAHASF LYCVRNN  L +NN K+ GK GP LLG+PPKI I   + EA  
Sbjct: 163  SCSHTCDAFQLAHASFSLYCVRNNTGLSSNNQKVGGKPGPRLLGEPPKIDIT--LPEADV 220

Query: 1931 XXXXXXXDLSGDLPAIKIYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSK 1752
                     SG LPAIKIYDDDV+MRFLVC +P TLDACLLGSLEDGLNALLNIEIRGSK
Sbjct: 221  QDEESS---SGTLPAIKIYDDDVTMRFLVCELPSTLDACLLGSLEDGLNALLNIEIRGSK 277

Query: 1751 LHNRVSAPPPPLQAGTFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNE 1572
            LHNR SAPPPPLQAGTFSRGVVTMRCDLSTCSSAHISLLVSGSAQ CF+DQLLE HIKNE
Sbjct: 278  LHNRTSAPPPPLQAGTFSRGVVTMRCDLSTCSSAHISLLVSGSAQACFNDQLLENHIKNE 337

Query: 1571 LIEKSQLVHAFPSCGESKPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISY 1392
            LIE SQLVHA PS  ESK    EPRKS SI CGA++FEVC+KVP+WASQVLRQLAP++SY
Sbjct: 338  LIENSQLVHALPSSEESKLLTSEPRKSASIGCGASVFEVCLKVPSWASQVLRQLAPDVSY 397

Query: 1391 RSFVSLGIASIQGSAVASFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRK 1212
            RS V LGIASIQG +VASF+K+DT+RLLFFCTR+ ++  P N++   PP WL PPAPSRK
Sbjct: 398  RSLVMLGIASIQGLSVASFEKEDTERLLFFCTRQGKELYPNNSIIIKPPCWLIPPAPSRK 457

Query: 1211 RSEPCQETRSVNGCSMVGEGRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPI 1032
            RSEPC+ET+      +                +ENG           V+QKL +AAMRPI
Sbjct: 458  RSEPCRETKLFTSKGL---------------ERENGG---------SVKQKLNVAAMRPI 493

Query: 1031 PHTRHHKMLPFIGGTTEVETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQ 852
            PHTRHHKMLPF  G  E E Y+G Q K +L V P+ KH +VGP PV+HRKS SSS+QA+Q
Sbjct: 494  PHTRHHKMLPF-SGFAEGERYDGDQGKPSLPVAPA-KHGVVGPAPVSHRKSLSSSYQAQQ 551

Query: 851  IISLNPLPLKKHGCGRCPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAK 672
            IISLNPLPLKKHGCGR PIQ CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAK
Sbjct: 552  IISLNPLPLKKHGCGRAPIQACSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAK 611

Query: 671  RLDLYNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEY 492
            RLDL+NLYREVVSRGGFHVGNGINWKGQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLEY
Sbjct: 612  RLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEY 671

Query: 491  ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP 312
            ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP
Sbjct: 672  ELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICP 731

Query: 311  HCSITNFRRKPQKAANGY 258
            HCSI NFR+K QK ANGY
Sbjct: 732  HCSIANFRKKSQKTANGY 749


>ref|XP_007012520.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 1
            [Theobroma cacao] gi|590574848|ref|XP_007012521.1|
            ARID/BRIGHT DNA-binding domain-containing protein isoform
            1 [Theobroma cacao] gi|508782883|gb|EOY30139.1|
            ARID/BRIGHT DNA-binding domain-containing protein isoform
            1 [Theobroma cacao] gi|508782884|gb|EOY30140.1|
            ARID/BRIGHT DNA-binding domain-containing protein isoform
            1 [Theobroma cacao]
          Length = 746

 Score = 1122 bits (2902), Expect = 0.0
 Identities = 564/784 (71%), Positives = 628/784 (80%), Gaps = 3/784 (0%)
 Frame = -2

Query: 2600 MMFHVQGPTKPMCGLLAVLC--EAPNSKQKQDFSEDPPLGYSFPELVSSGRLEVQTLISP 2427
            MMF  QG ++  C LLAVL      ++KQKQ  S+D P  Y FPEL SSGRLEVQ L SP
Sbjct: 1    MMFSAQGSSRNHCSLLAVLSGGNVSDNKQKQPVSDDKPR-YPFPELASSGRLEVQLLNSP 59

Query: 2426 TTDEFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYL 2247
              DE RRV+E +EPN+VYLQGEQ  + +EIG L+WG VDLS PE + GLF STLPTTVYL
Sbjct: 60   NIDELRRVLESTEPNVVYLQGEQNADSEEIGPLIWGDVDLSTPETLCGLFDSTLPTTVYL 119

Query: 2246 EIPNSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHAS 2067
            E PN + LA ALHS+GVPYVIYWKN FS +AACHFRQAL SV+QSSCSHTWDAFQLAHAS
Sbjct: 120  ETPNGDKLAEALHSQGVPYVIYWKNTFSRFAACHFRQALLSVIQSSCSHTWDAFQLAHAS 179

Query: 2066 FRLYCVRNNQVLPANNHKISGKLGPHLLGDPPKITIV-PMVKEAGXXXXXXXXDLSGDLP 1890
            FRLYCVRNN V+ +N+ K S K GP LLG+ PKI +  P V   G            +LP
Sbjct: 180  FRLYCVRNNNVVSSNSQKQSVKPGPRLLGEAPKIDVSQPEVDMQGEESSPE------NLP 233

Query: 1889 AIKIYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQA 1710
            AIKIYDDDV++RFLVCG PC LDA LLGSLEDGLNALL+IEIRGSKLHNR SAPPPPLQA
Sbjct: 234  AIKIYDDDVTVRFLVCGSPCILDAFLLGSLEDGLNALLSIEIRGSKLHNRASAPPPPLQA 293

Query: 1709 GTFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSC 1530
            GTFSRGVVTMRCD STCSSAHISLLVSGSAQTCF+DQLLE HIKNE+IEKSQLVHA  S 
Sbjct: 294  GTFSRGVVTMRCDFSTCSSAHISLLVSGSAQTCFNDQLLENHIKNEIIEKSQLVHAQSSS 353

Query: 1529 GESKPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGS 1350
             ESK    EPR+S SIACGA++FEVCMKVPTWASQVLRQLAP++SYRS V LGIASIQG 
Sbjct: 354  EESKLPSSEPRRSASIACGASVFEVCMKVPTWASQVLRQLAPDVSYRSLVMLGIASIQGL 413

Query: 1349 AVASFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSVNGC 1170
            +VASF+KDD +RLLFFC R+++D    +++ +  P WL PPAPSRKRSEPC++++ +N  
Sbjct: 414  SVASFEKDDAERLLFFCMRQDKDPLQDSSVIAISPSWLVPPAPSRKRSEPCKDSKPLNCT 473

Query: 1169 SMVGEGRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGG 990
             M GE  +                          R K  +AAMRPIPHT  HK++PF  G
Sbjct: 474  GMEGENGIA-------------------------RPKSNVAAMRPIPHTHRHKIIPF-SG 507

Query: 989  TTEVETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGC 810
             +E E Y+G Q K NL VVP     +  P PVTHRK+ SSS+QA+QIISLNPLPLKKHGC
Sbjct: 508  FSEAERYDGDQGKVNLPVVP-----VKQPAPVTHRKALSSSYQAQQIISLNPLPLKKHGC 562

Query: 809  GRCPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSR 630
            GR PIQ+CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDL+NLYREVVSR
Sbjct: 563  GRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSR 622

Query: 629  GGFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLL 450
            GGFHVGNGINWKGQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLL
Sbjct: 623  GGFHVGNGINWKGQVFSKMRNHTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLL 682

Query: 449  CHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKA 270
            CHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY+CPHCSI+NF++KPQK 
Sbjct: 683  CHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPHCSISNFKKKPQKT 742

Query: 269  ANGY 258
             NGY
Sbjct: 743  VNGY 746


>ref|XP_002324130.2| arid/bright DNA-binding domain-containing family protein [Populus
            trichocarpa] gi|550318261|gb|EEF02695.2| arid/bright
            DNA-binding domain-containing family protein [Populus
            trichocarpa]
          Length = 746

 Score = 1120 bits (2898), Expect = 0.0
 Identities = 569/783 (72%), Positives = 629/783 (80%), Gaps = 2/783 (0%)
 Frame = -2

Query: 2600 MMFHVQGPTKPMCGLLAVLCEAPNSKQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPTT 2421
            MMFH QGP +  C LLAVLC   + +QK   S+D P  Y  PEL S+GRLEVQ L +P+T
Sbjct: 1    MMFHAQGPLRNHCTLLAVLC-GKSGEQKLPLSDDKPR-YPLPELESTGRLEVQVLNNPST 58

Query: 2420 DEFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLEI 2241
            DEFR+V++  EP+IVY QGEQ+ + +EIGSL W  V LS PE++ GLFGSTLP TVYLE+
Sbjct: 59   DEFRQVLQSLEPSIVYFQGEQVEDREEIGSLRWADVGLSTPESLCGLFGSTLPPTVYLEM 118

Query: 2240 PNSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFR 2061
            PN E LA ALHSKGVPYVIYWK+AFSCYAA HFRQAL SVVQSSCSHT DAFQLAHASFR
Sbjct: 119  PNGEKLAEALHSKGVPYVIYWKSAFSCYAASHFRQALLSVVQSSCSHTCDAFQLAHASFR 178

Query: 2060 LYCVRNNQVLPANNHKISGKLGPHLLGDPPKITI-VPMVKEAGXXXXXXXXDLSGDLPAI 1884
            LYCV+NN    +N+ K+ GK GP LLGDPPK  I +P   + G          SG LPAI
Sbjct: 179  LYCVQNNNTPASNSQKVGGKPGPRLLGDPPKFDISLPEADDQGEEGS------SGALPAI 232

Query: 1883 KIYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQAGT 1704
            KIYDDDV+MRFLVCG+  TLDAC LGSLEDGLNALLNIEIRGSKLHNR SAPPPPLQAGT
Sbjct: 233  KIYDDDVTMRFLVCGLTGTLDACALGSLEDGLNALLNIEIRGSKLHNRTSAPPPPLQAGT 292

Query: 1703 FSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSCGE 1524
            FSRGVVTMRCDLSTCSSAHISLLVSGSAQ CF+DQLLE HIK+ELIE SQLVHA  S  E
Sbjct: 293  FSRGVVTMRCDLSTCSSAHISLLVSGSAQNCFNDQLLENHIKSELIENSQLVHASTSSDE 352

Query: 1523 SKPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGSAV 1344
             K    EPRKS SIACGA++FEV MKVPTWASQVLRQLAP+++YRS V LGIASIQG +V
Sbjct: 353  IKSPSSEPRKSASIACGASVFEVSMKVPTWASQVLRQLAPDVTYRSLVMLGIASIQGLSV 412

Query: 1343 ASFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSVN-GCS 1167
            ASF+KDD  RLLFFCT++ +D +P+N + +  P WL PPAP RKR EP +ET+ +  GC 
Sbjct: 413  ASFEKDDADRLLFFCTKQSKDPHPRNPVLTRHPSWLIPPAPCRKRYEPSRETKPLTFGC- 471

Query: 1166 MVGEGRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGGT 987
                                G  +G N      +QKL +AAMRPIPHTR HKMLPF  G 
Sbjct: 472  --------------------GGENGGNF-----KQKLYVAAMRPIPHTRRHKMLPF-SGF 505

Query: 986  TEVETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGCG 807
             E E Y+G Q K +L   P  KH++VGP PVTHRKS S+S+QA+QIISLNPLPLKKHGCG
Sbjct: 506  LEAERYDGEQTKPSLP--PPPKHSVVGPAPVTHRKSLSNSYQAQQIISLNPLPLKKHGCG 563

Query: 806  RCPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRG 627
            R PIQ CSEEEFLRDVMQFLILRGH+RLVPQGGLAEFPDAILNAKRLDL+NLYREVVSRG
Sbjct: 564  RSPIQACSEEEFLRDVMQFLILRGHSRLVPQGGLAEFPDAILNAKRLDLFNLYREVVSRG 623

Query: 626  GFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC 447
            GFHVGNGINWKGQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC
Sbjct: 624  GFHVGNGINWKGQVFSKMRNHTLTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC 683

Query: 446  HSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKAA 267
            HSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSI NF++K QK A
Sbjct: 684  HSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSIANFKKKSQKNA 743

Query: 266  NGY 258
            NGY
Sbjct: 744  NGY 746


>ref|XP_006362097.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
            [Solanum tuberosum]
          Length = 770

 Score = 1108 bits (2865), Expect = 0.0
 Identities = 556/783 (71%), Positives = 630/783 (80%), Gaps = 2/783 (0%)
 Frame = -2

Query: 2597 MFHVQGPTKPMCGLLAVLCEAPNS-KQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPTT 2421
            MFH QG ++  C LLAVLC + +   QK+D  +  P  Y FPE+VSSGRLEVQ L +P+T
Sbjct: 1    MFHCQGTSRQSCSLLAVLCGSTSEYDQKKDVHDGKPR-YCFPEIVSSGRLEVQVLKNPST 59

Query: 2420 DEFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLEI 2241
            DEF +V++  +PNIVYLQGE L ND E+GSLVWGG+DLS+ EAISGLF S LPT VYLE+
Sbjct: 60   DEFHKVLDSWQPNIVYLQGEHLSND-EVGSLVWGGLDLSSAEAISGLFSSALPTAVYLEL 118

Query: 2240 PNSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFR 2061
            PN E LA ALH+KG+PYV+YWK+AFSCYAA HFR A   V QSS  H WDAFQLA ASFR
Sbjct: 119  PNGEKLAEALHAKGIPYVMYWKSAFSCYAASHFRHAFLCVAQSSTCHVWDAFQLAQASFR 178

Query: 2060 LYCVRNNQVLPANNHKISGKLGPHLLGDPPKITIVPMVKEAGXXXXXXXXDLSGDLPAIK 1881
            LYCV+NN VLP  + + S  +GPHLLGDPP I + P   EAG          S  LPAIK
Sbjct: 179  LYCVQNNFVLPEMSQRDSDNMGPHLLGDPPNIDVPP--PEAGPDDDEESN--SDALPAIK 234

Query: 1880 IYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQAGTF 1701
            IYDDDV+MRFLVCG+PC+LD CLLGS+ DGLNALLNIE+RGSKLHNRVSA PPPLQAGTF
Sbjct: 235  IYDDDVTMRFLVCGLPCSLDECLLGSIADGLNALLNIEMRGSKLHNRVSALPPPLQAGTF 294

Query: 1700 SRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSCGES 1521
            SRGVVTMRCDLST SSAHISLLVSGSAQTCFDD LLE HIK+E+IE S LVH  PS  E+
Sbjct: 295  SRGVVTMRCDLSTSSSAHISLLVSGSAQTCFDDLLLENHIKSEIIENSTLVHVLPSDEEN 354

Query: 1520 KPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGSAVA 1341
            +P +  PR+S S+ACG+ +FEVCMKVP WASQVLRQLAP++SYRS V+LGIASIQG AVA
Sbjct: 355  RPPISAPRRSMSVACGSEVFEVCMKVPMWASQVLRQLAPDVSYRSLVALGIASIQGLAVA 414

Query: 1340 SFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSVNGCSMV 1161
            SF+KDD +RLLFF T++ +D    N     PP WL+PPAPSRKRS+  Q      G S +
Sbjct: 415  SFEKDDAQRLLFFYTKQGKDGFFGNFKIGDPPAWLRPPAPSRKRSDFYQ------GASYI 468

Query: 1160 GE-GRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGGTT 984
             + G      +     KE+   +G   PLV  RQKLK+AAMRPIPH RH KMLPF    +
Sbjct: 469  CQNGSTPGNHVAVKEEKESRLGNGVATPLVTARQKLKVAAMRPIPHVRHQKMLPF-SRIS 527

Query: 983  EVETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGCGR 804
            E+++ +G Q KTNL ++PSTK + VG  PVTHRKS+SSS QAKQIISLNPLPLKKHGCGR
Sbjct: 528  ELDSLDGNQVKTNLPIIPSTKGSNVGVTPVTHRKSASSSHQAKQIISLNPLPLKKHGCGR 587

Query: 803  CPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRGG 624
             PI +CSEEEFL+DVMQFLILRGHTRL+PQGGLAEFPDAILNAKRLDL+NLYREVVSRGG
Sbjct: 588  SPIHVCSEEEFLKDVMQFLILRGHTRLIPQGGLAEFPDAILNAKRLDLFNLYREVVSRGG 647

Query: 623  FHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCH 444
            FHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC+
Sbjct: 648  FHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCN 707

Query: 443  SSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKAAN 264
            SSAAGDWVNCGICGEWAHFGCDRR GLGAFKDYAKTDGLEYICP CS+TNF++K  + AN
Sbjct: 708  SSAAGDWVNCGICGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVTNFKKKVLRTAN 767

Query: 263  GYS 255
            GYS
Sbjct: 768  GYS 770


>ref|XP_004252398.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
            [Solanum lycopersicum]
          Length = 771

 Score = 1100 bits (2844), Expect = 0.0
 Identities = 552/784 (70%), Positives = 628/784 (80%), Gaps = 3/784 (0%)
 Frame = -2

Query: 2597 MFHVQGPTKPMCGLLAVLCEAPNS-KQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPTT 2421
            MFH QG ++  C LLAVLC   +   QK+D  +  P  Y FPE+VSSGRLEVQ L +P+T
Sbjct: 1    MFHCQGASRQSCSLLAVLCGRTSEYDQKKDVHDGKPR-YCFPEIVSSGRLEVQVLKNPST 59

Query: 2420 DEFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLEI 2241
            DEF +V++  +PNIVYLQGE L ND E+GSLVWGG+DLS+ EAISGLF S LPT VYLE+
Sbjct: 60   DEFHKVLDSWQPNIVYLQGEHLSND-EVGSLVWGGLDLSSAEAISGLFSSVLPTAVYLEL 118

Query: 2240 PNSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFR 2061
            PN E LA ALH+KG+PYV+YWK+AFSCYAA HFR A   V QSS  H WDAFQLAHASFR
Sbjct: 119  PNGEKLAEALHAKGIPYVMYWKSAFSCYAASHFRHAFLCVAQSSTCHVWDAFQLAHASFR 178

Query: 2060 LYCVRNNQVLPANNHKISGKLGPHLLGDPPKITIVPMVKEAGXXXXXXXXDLSGDLPAIK 1881
            LYCVRNN  L   + + S  +GPHLLGDPP I +   + EAG          S  LPAIK
Sbjct: 179  LYCVRNNFALSEMSQRDSDNVGPHLLGDPPNIDVP--LPEAGPEDDEESN--SDALPAIK 234

Query: 1880 IYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQAGTF 1701
            IYDDDV+MRFLVCG+PC+LD CLLGS+ DGLNALLNIE+RGSKLHNRVSA PPPLQAGTF
Sbjct: 235  IYDDDVTMRFLVCGLPCSLDECLLGSIADGLNALLNIEMRGSKLHNRVSALPPPLQAGTF 294

Query: 1700 SRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSCGES 1521
            SRGVVTMRCDLST SSAHISLLVSGSAQTCFDD LLE HIK+E+IE S LVH  PS  E+
Sbjct: 295  SRGVVTMRCDLSTSSSAHISLLVSGSAQTCFDDLLLENHIKSEIIENSTLVHVLPSDEEN 354

Query: 1520 KPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGSAVA 1341
            +P +  PR+S S+ACG+ +FEVCMKVP WASQVLRQLAP++SYRS V+LGIASIQG AVA
Sbjct: 355  RPPISAPRRSMSVACGSEVFEVCMKVPMWASQVLRQLAPDVSYRSLVALGIASIQGLAVA 414

Query: 1340 SFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSVNGCSMV 1161
            SF+KDD +RLLFFCT++ +D    N    +PP WL+PPAPSRKRS+  Q      G S +
Sbjct: 415  SFEKDDAQRLLFFCTKQGKDGFFGNFKMGNPPAWLRPPAPSRKRSDFYQ------GASYI 468

Query: 1160 GEGRMGDAK-IDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGGTT 984
             +  +     +     KE+   +G   PLV  RQKLK+AAMRPIPH RH KMLPF    +
Sbjct: 469  CQNGLTPGNHVAVKEEKESRLGNGVATPLVTARQKLKVAAMRPIPHVRHQKMLPF-SRIS 527

Query: 983  EVETYNGGQAKTNLSVVPS-TKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGCG 807
            E+++ +G Q KTNL ++PS TK + VG  P THRKS+SSS QAKQIISLNPLPLKKHGCG
Sbjct: 528  ELDSLDGNQVKTNLPIIPSSTKGSNVGVTPATHRKSASSSHQAKQIISLNPLPLKKHGCG 587

Query: 806  RCPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRG 627
            R PI +CSEEEFL+DVMQFLILRGHTRL+PQ G+AEFPDAILNAKRLDL+NLYREVVSRG
Sbjct: 588  RSPIHVCSEEEFLKDVMQFLILRGHTRLIPQSGIAEFPDAILNAKRLDLFNLYREVVSRG 647

Query: 626  GFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC 447
            GFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC
Sbjct: 648  GFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC 707

Query: 446  HSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKAA 267
            +SSAAGDWVNCGICGEWAHFGCDRR GLGAFKDYAKTDGLEYICP CS+TNF++K  + A
Sbjct: 708  NSSAAGDWVNCGICGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVTNFKKKVLRTA 767

Query: 266  NGYS 255
            NGYS
Sbjct: 768  NGYS 771


>ref|XP_007217035.1| hypothetical protein PRUPE_ppa001668mg [Prunus persica]
            gi|462413185|gb|EMJ18234.1| hypothetical protein
            PRUPE_ppa001668mg [Prunus persica]
          Length = 783

 Score = 1097 bits (2836), Expect = 0.0
 Identities = 551/786 (70%), Positives = 627/786 (79%), Gaps = 1/786 (0%)
 Frame = -2

Query: 2597 MFHVQGPTKPMCGLLAVLCEAPNSKQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPTTD 2418
            M H QG +K  C LL V C   + ++  + + D  L Y FPELVS GRLEVQTL  P+ +
Sbjct: 1    MNHSQGASKQTCSLLVVTCGKISEEKPNEDTLDEKLKYPFPELVSLGRLEVQTLTKPSKE 60

Query: 2417 EFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLEIP 2238
            EF +++E  +PN+VYLQGEQL N+ EIGS VW  VDLS  EAIS +F +TLPTTVYLE+P
Sbjct: 61   EFCKMLESYKPNLVYLQGEQLENN-EIGSPVWEDVDLSTAEAISEIFSATLPTTVYLEVP 119

Query: 2237 NSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFRL 2058
            N E+LA ALHSKG+PYVIYWK+ FS YAACHFR AL SVVQSS +HTWDAFQLA+ASFRL
Sbjct: 120  NGENLAAALHSKGIPYVIYWKHEFSSYAACHFRHALLSVVQSSSTHTWDAFQLAYASFRL 179

Query: 2057 YCVRNNQVLPANNHKISG-KLGPHLLGDPPKITIVPMVKEAGXXXXXXXXDLSGDLPAIK 1881
            YCV N+  +PAN HK S  +LGP LLGD  KI + P   +             G LPAIK
Sbjct: 180  YCVENSHAIPANRHKSSSAELGPCLLGDRLKINVDPPEADVEEDEEGSL----GTLPAIK 235

Query: 1880 IYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQAGTF 1701
            I+DDDV +RFLVCG P TLDA LL  LEDGLNALLNIE+RGSKLH + SAPPPPLQAGTF
Sbjct: 236  IHDDDVILRFLVCGEPSTLDASLLEPLEDGLNALLNIEMRGSKLHGKFSAPPPPLQAGTF 295

Query: 1700 SRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSCGES 1521
            SRGVVTMRCD+STCSSAHISLLVSGSAQTCFDDQLLE HIKNE+IE+ QLV A P+   +
Sbjct: 296  SRGVVTMRCDVSTCSSAHISLLVSGSAQTCFDDQLLENHIKNEVIEEIQLVRALPNNEGN 355

Query: 1520 KPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGSAVA 1341
            K  L EPRKS SIACGA +FEVCMKVP WASQVLRQLAP++SY S V+LGIASIQG  VA
Sbjct: 356  KVPLAEPRKSASIACGATVFEVCMKVPAWASQVLRQLAPDVSYHSLVALGIASIQGLPVA 415

Query: 1340 SFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSVNGCSMV 1161
            SF+K+D +RLLFFC+   +D    + +  SPP WL+PP PSRKRS+PCQET   +  S  
Sbjct: 416  SFEKEDAERLLFFCSSLGKDNKSNDFILGSPPTWLRPPPPSRKRSQPCQETSRGSNYSQR 475

Query: 1160 GEGRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGGTTE 981
                +  +KIDED NKE G ++G + PL+P RQ+LK+AAMRPIPH R  KM PF  G +E
Sbjct: 476  LPS-LAASKIDED-NKEAGAMNGVSTPLLPPRQRLKIAAMRPIPHVRRPKMTPF-SGMSE 532

Query: 980  VETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGCGRC 801
            ++ ++GGQ K NL   P TK NIVG  P T RKS SSS  +KQIISLNPLPLKKHGCGR 
Sbjct: 533  LDGHDGGQFKANLPPAPPTKLNIVGLTPTTQRKSYSSSSHSKQIISLNPLPLKKHGCGRS 592

Query: 800  PIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRGGF 621
            PI  C EEEFL+DVMQFLILRGH+RL+PQGGLAEFPDAILN KRLDLYNLY+EVV+RGGF
Sbjct: 593  PIHSCLEEEFLKDVMQFLILRGHSRLIPQGGLAEFPDAILNGKRLDLYNLYKEVVTRGGF 652

Query: 620  HVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHS 441
            HVGNGINWKGQ+FSKMRN+T+TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHS
Sbjct: 653  HVGNGINWKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHS 712

Query: 440  SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKAANG 261
            SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSI+NF++KPQK ANG
Sbjct: 713  SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSISNFKKKPQKIANG 772

Query: 260  YS*GLT 243
            +S G T
Sbjct: 773  FSQGST 778


>ref|XP_007012522.1| ARID/BRIGHT DNA-binding domain-containing protein isoform 3, partial
            [Theobroma cacao] gi|508782885|gb|EOY30141.1| ARID/BRIGHT
            DNA-binding domain-containing protein isoform 3, partial
            [Theobroma cacao]
          Length = 708

 Score = 1086 bits (2809), Expect = 0.0
 Identities = 541/738 (73%), Positives = 600/738 (81%), Gaps = 1/738 (0%)
 Frame = -2

Query: 2471 LVSSGRLEVQTLISPTTDEFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEA 2292
            L SSGRLEVQ L SP  DE RRV+E +EPN+VYLQGEQ  + +EIG L+WG VDLS PE 
Sbjct: 1    LASSGRLEVQLLNSPNIDELRRVLESTEPNVVYLQGEQNADSEEIGPLIWGDVDLSTPET 60

Query: 2291 ISGLFGSTLPTTVYLEIPNSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQS 2112
            + GLF STLPTTVYLE PN + LA ALHS+GVPYVIYWKN FS +AACHFRQAL SV+QS
Sbjct: 61   LCGLFDSTLPTTVYLETPNGDKLAEALHSQGVPYVIYWKNTFSRFAACHFRQALLSVIQS 120

Query: 2111 SCSHTWDAFQLAHASFRLYCVRNNQVLPANNHKISGKLGPHLLGDPPKITIV-PMVKEAG 1935
            SCSHTWDAFQLAHASFRLYCVRNN V+ +N+ K S K GP LLG+ PKI +  P V   G
Sbjct: 121  SCSHTWDAFQLAHASFRLYCVRNNNVVSSNSQKQSVKPGPRLLGEAPKIDVSQPEVDMQG 180

Query: 1934 XXXXXXXXDLSGDLPAIKIYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGS 1755
                        +LPAIKIYDDDV++RFLVCG PC LDA LLGSLEDGLNALL+IEIRGS
Sbjct: 181  EESSPE------NLPAIKIYDDDVTVRFLVCGSPCILDAFLLGSLEDGLNALLSIEIRGS 234

Query: 1754 KLHNRVSAPPPPLQAGTFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKN 1575
            KLHNR SAPPPPLQAGTFSRGVVTMRCD STCSSAHISLLVSGSAQTCF+DQLLE HIKN
Sbjct: 235  KLHNRASAPPPPLQAGTFSRGVVTMRCDFSTCSSAHISLLVSGSAQTCFNDQLLENHIKN 294

Query: 1574 ELIEKSQLVHAFPSCGESKPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEIS 1395
            E+IEKSQLVHA  S  ESK    EPR+S SIACGA++FEVCMKVPTWASQVLRQLAP++S
Sbjct: 295  EIIEKSQLVHAQSSSEESKLPSSEPRRSASIACGASVFEVCMKVPTWASQVLRQLAPDVS 354

Query: 1394 YRSFVSLGIASIQGSAVASFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSR 1215
            YRS V LGIASIQG +VASF+KDD +RLLFFC R+++D    +++ +  P WL PPAPSR
Sbjct: 355  YRSLVMLGIASIQGLSVASFEKDDAERLLFFCMRQDKDPLQDSSVIAISPSWLVPPAPSR 414

Query: 1214 KRSEPCQETRSVNGCSMVGEGRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRP 1035
            KRSEPC++++ +N   M GE  +                          R K  +AAMRP
Sbjct: 415  KRSEPCKDSKPLNCTGMEGENGIA-------------------------RPKSNVAAMRP 449

Query: 1034 IPHTRHHKMLPFIGGTTEVETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAK 855
            IPHT  HK++PF  G +E E Y+G Q K NL VVP     +  P PVTHRK+ SSS+QA+
Sbjct: 450  IPHTHRHKIIPF-SGFSEAERYDGDQGKVNLPVVP-----VKQPAPVTHRKALSSSYQAQ 503

Query: 854  QIISLNPLPLKKHGCGRCPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNA 675
            QIISLNPLPLKKHGCGR PIQ+CSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNA
Sbjct: 504  QIISLNPLPLKKHGCGRAPIQVCSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNA 563

Query: 674  KRLDLYNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLE 495
            KRLDL+NLYREVVSRGGFHVGNGINWKGQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLE
Sbjct: 564  KRLDLFNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTMTNRMTGVGNTLKRHYETYLLE 623

Query: 494  YELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYIC 315
            YELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY+C
Sbjct: 624  YELAHDDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVC 683

Query: 314  PHCSITNFRRKPQKAANG 261
            PHCSI+NF++KPQK  NG
Sbjct: 684  PHCSISNFKKKPQKTVNG 701


>gb|EXB64667.1| AT-rich interactive domain-containing protein 4 [Morus notabilis]
          Length = 779

 Score = 1085 bits (2807), Expect = 0.0
 Identities = 547/789 (69%), Positives = 632/789 (80%), Gaps = 4/789 (0%)
 Frame = -2

Query: 2597 MFHVQGPTKPMCGLLAVLC-EAPNSKQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPTT 2421
            MFH QG +K  C LLAV C     SK+K+D  E+  L Y FPEL+SSGRLEVQTL SP+ 
Sbjct: 1    MFHSQGSSKQTCSLLAVTCGNVSESKRKKDVPENRSL-YPFPELISSGRLEVQTLTSPSK 59

Query: 2420 DEFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLEI 2241
            +EF +++E  +PN+VYLQGEQL ND E+G LVWG VDLS PE++S LFG+TLPTTVYLEI
Sbjct: 60   EEFSKLLESYKPNLVYLQGEQLAND-EVGPLVWGDVDLSTPESVSELFGTTLPTTVYLEI 118

Query: 2240 PNSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFR 2061
            P+ E+LA  LHSKGVPYVIYWK+ FS +AACHFR AL SVV+SS +H WDAFQLA+ASFR
Sbjct: 119  PDCEELAEELHSKGVPYVIYWKDRFSRHAACHFRNALLSVVKSSSTHAWDAFQLAYASFR 178

Query: 2060 LYCVRNNQVLPANNHKISGKLGPHLLGDPPKITIVPMVKEAGXXXXXXXXDLSGDLPAIK 1881
            LYCVRNN VLP+  H+IS + GP LLGD  KI + P   +           L    PAIK
Sbjct: 179  LYCVRNNHVLPSKGHEISDEQGPCLLGDRLKINVDPPAADVEDDEDGSLDTL----PAIK 234

Query: 1880 IYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQAGTF 1701
            I+DDD+S+RFLVCGVP TLD  +L  LEDGLNALLNIEIRG +LH + SAPPPPLQAGTF
Sbjct: 235  IHDDDLSLRFLVCGVPSTLDESVLEPLEDGLNALLNIEIRGGRLHGKFSAPPPPLQAGTF 294

Query: 1700 SRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSCGE- 1524
            SRGVVTMRCDLSTCS AHIS+L+SGSAQTCFDDQLLE HIKNE+IE SQLV A P+  E 
Sbjct: 295  SRGVVTMRCDLSTCSCAHISILLSGSAQTCFDDQLLENHIKNEIIENSQLVRALPTASEG 354

Query: 1523 SKPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGSAV 1344
            +K  L EPRKS SIACGA +FEVCMKVP WASQVLRQLAP++SY S V+LGIASIQG  V
Sbjct: 355  NKLPLSEPRKSASIACGATVFEVCMKVPAWASQVLRQLAPDVSYHSLVALGIASIQGIPV 414

Query: 1343 ASFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETR--SVNGC 1170
            ASF+K+D +RLLFFC+ + ++ +  + + S+PP WL+PPAPSRKRS   QET   S +G 
Sbjct: 415  ASFEKEDAERLLFFCSSQGKEIS-NDLVFSNPPPWLRPPAPSRKRS---QETSPGSHDGH 470

Query: 1169 SMVGEGRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGG 990
             +  +         E+ +KE GP +G ++PL+P RQ+LK+AAMRPIPH R  KM PF  G
Sbjct: 471  RVPNQV----VSKSEEEDKERGPSNGVSLPLLPARQRLKVAAMRPIPHVRRPKMTPF-SG 525

Query: 989  TTEVETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGC 810
             +E + ++GGQ K  + V P TK +IVG  P   RKS SSS QAKQIISLNPLPLKKHGC
Sbjct: 526  ISEADGHDGGQVKAIVPVAPPTKLSIVGLTPSAQRKSFSSSSQAKQIISLNPLPLKKHGC 585

Query: 809  GRCPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSR 630
            GR  I  CSEEEFL+DVMQFLILRGHTRL+PQ GLAEFPDAILN KRLDLYNLY+EVV+R
Sbjct: 586  GRSSIHTCSEEEFLKDVMQFLILRGHTRLIPQSGLAEFPDAILNGKRLDLYNLYKEVVTR 645

Query: 629  GGFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLL 450
            GGFHVGNGINWKGQ+FSKMRN+T+TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLL
Sbjct: 646  GGFHVGNGINWKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLL 705

Query: 449  CHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKA 270
            CHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCS++NF++K QK 
Sbjct: 706  CHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVSNFKKKSQKV 765

Query: 269  ANGYS*GLT 243
            +NG+S GLT
Sbjct: 766  SNGFSQGLT 774


>ref|XP_004303747.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
            [Fragaria vesca subsp. vesca]
          Length = 779

 Score = 1063 bits (2749), Expect = 0.0
 Identities = 541/788 (68%), Positives = 624/788 (79%), Gaps = 3/788 (0%)
 Frame = -2

Query: 2597 MFHVQGPTKPMCGLLAVLC-EAPNSKQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPTT 2421
            MFH QG     C +L V C E    K+ ++  ED  L Y FPELVSSGRLEVQTL +P+ 
Sbjct: 1    MFHAQGT----CSVLVVTCGEISEDKRGKETPEDK-LRYPFPELVSSGRLEVQTLTNPSE 55

Query: 2420 DEFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLEI 2241
            +EF +++E  +PN+VYLQGEQL ND E+G LVW    LS  E++S +F +TLPTTVYLE+
Sbjct: 56   EEFCKLLESYKPNLVYLQGEQLEND-EVGPLVWRDAYLSTAESMSDIFDATLPTTVYLEV 114

Query: 2240 PNSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFR 2061
            PN E+LA AL SKG+PYVIYWK+A S YAACHFR AL SVVQSS +HTWDAFQLAHASFR
Sbjct: 115  PNGEELAVALQSKGIPYVIYWKDAISTYAACHFRHALLSVVQSSSTHTWDAFQLAHASFR 174

Query: 2060 LYCVRNNQVLPANNHKISGKLGPHLLGDPPKITIVPMVKEAGXXXXXXXXDLSGDLPAIK 1881
            LYCV+N+ V+  N  K S +LGP +LG+  KI++ P   +            +G LPAIK
Sbjct: 175  LYCVQNDHVVRVNLDKPSAELGPCILGEHLKISVDPPEADM----EEDEEGATGSLPAIK 230

Query: 1880 IYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQAGTF 1701
            I+DDDVS+RFLVCG P TLDA +L  LEDGLNALLNIE+RGSKLH + SAPPPPLQAGTF
Sbjct: 231  IHDDDVSLRFLVCGQPSTLDAGILEPLEDGLNALLNIEMRGSKLHGKFSAPPPPLQAGTF 290

Query: 1700 SRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSCGES 1521
            SRGVVTMRCD+STCSSAHISLLVSGSAQTCFDDQLLE HIK+E+IE +QLVHA P+   +
Sbjct: 291  SRGVVTMRCDISTCSSAHISLLVSGSAQTCFDDQLLENHIKHEVIEINQLVHAVPNNDRN 350

Query: 1520 KPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGSAVA 1341
            K  L EPRKS +IACGA +FEV MKVP WASQVLRQLAP++SYRS VSLGIASIQG  VA
Sbjct: 351  KLPLVEPRKSAAIACGATVFEVSMKVPVWASQVLRQLAPDVSYRSLVSLGIASIQGLPVA 410

Query: 1340 SFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSVNGC-SM 1164
            SF+KDD  RLLFFC+ + +D    +   S+PP WL+PPAPS+KRS  CQE  ++ G  + 
Sbjct: 411  SFEKDDADRLLFFCSSRTKDSQLNDLFLSTPPAWLRPPAPSKKRSRLCQE--AIPGFRNR 468

Query: 1163 VGEGRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGGTT 984
             G   +  +K++E+  K  G V+G + PL+P RQ+LK AAMRPIPH R  KM PF  G +
Sbjct: 469  QGLPNLAASKVEENE-KALGAVNGFSTPLLPARQRLKTAAMRPIPHVRRPKMTPF-SGIS 526

Query: 983  EVETYNGGQA-KTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGCG 807
            EV  ++G Q  K +L  VP TK NIVG  P T RKS SSS QAKQIISLNPLPLKKHGCG
Sbjct: 527  EVNGHDGSQVVKAHLPPVPPTKLNIVGLTPTTQRKSYSSSSQAKQIISLNPLPLKKHGCG 586

Query: 806  RCPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRG 627
            R PI  C EEEFL+DVMQFLILRGH+RL+PQGGL EFPDAILN KRLDLYNLY+EVV+RG
Sbjct: 587  RGPIHSCLEEEFLKDVMQFLILRGHSRLIPQGGLTEFPDAILNGKRLDLYNLYKEVVTRG 646

Query: 626  GFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC 447
            GFHVGNGINWKGQ+FSKMRN+T+TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC
Sbjct: 647  GFHVGNGINWKGQIFSKMRNYTMTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLC 706

Query: 446  HSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKAA 267
            HSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSI+NF++KPQK  
Sbjct: 707  HSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSISNFKKKPQKVT 766

Query: 266  NGYS*GLT 243
            NG+  G T
Sbjct: 767  NGFPQGST 774


>ref|XP_003547888.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
            [Glycine max]
          Length = 782

 Score = 1059 bits (2738), Expect = 0.0
 Identities = 528/784 (67%), Positives = 617/784 (78%), Gaps = 2/784 (0%)
 Frame = -2

Query: 2594 FHVQGPTKPMCGLLAVLCEAPNSKQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPTTDE 2415
            FH QG  K  C LLAV C   +++ K   ++     Y FPELVS+GRLEVQTL SP  ++
Sbjct: 4    FHSQGTPKHTCTLLAVTCRTSSAEHKLSHAQRT---YPFPELVSAGRLEVQTLCSPEKEQ 60

Query: 2414 FRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLEIPN 2235
            FR+V+E  +PN VYL+G+QL N  E+GSLVW GV+LS  E I+ LFGSTLPT VYLEIPN
Sbjct: 61   FRKVLESFQPNFVYLRGDQLENG-EVGSLVWQGVELSTCEDITELFGSTLPTAVYLEIPN 119

Query: 2234 SEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFRLY 2055
             E  A ALH KG+PYVI+WKN FSCYAACHFRQA  SVVQSS +HTWDAF LA ASF LY
Sbjct: 120  GESFAEALHLKGIPYVIFWKNTFSCYAACHFRQAFLSVVQSSSTHTWDAFHLARASFELY 179

Query: 2054 CVRNNQVLPANNHKISGKLGPHLLGDPPKITIVPMVKEAGXXXXXXXXDLSGDLPAIKIY 1875
            CV+NNQVLP+++   S ++GPHLLGD  KI + P   +            SG LPAIKI+
Sbjct: 180  CVQNNQVLPSDSDDASSEMGPHLLGDCLKINVDPPEIDEEDDDESS----SGSLPAIKIH 235

Query: 1874 DDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQAGTFSR 1695
            +D+V++RFL+CG P T+D  LL SLEDGL ALL IEIRG KLH + SAPPPPLQA  FSR
Sbjct: 236  EDEVNLRFLICGAPSTVDESLLRSLEDGLRALLTIEIRGCKLHGKFSAPPPPLQAAAFSR 295

Query: 1694 GVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSCGESKP 1515
            GVVTMRCD+STCSSAHISLLVSGSAQTCF+DQLLE HIKNE+IEKSQLVHA  +   +K 
Sbjct: 296  GVVTMRCDISTCSSAHISLLVSGSAQTCFNDQLLENHIKNEIIEKSQLVHAQLNNEGNKE 355

Query: 1514 SLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGSAVASF 1335
            ++CEPR+S SIACGA++FE+CMK+P WA Q+LRQLAPE+SYRS V+LGIASIQG  +ASF
Sbjct: 356  NICEPRRSASIACGASVFEICMKLPQWALQILRQLAPEVSYRSLVALGIASIQGLPIASF 415

Query: 1334 DKDDTKRLLFFCTRKERDF--NPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSVNGCSMV 1161
            +KDD +RLLFF    E+D   N  N + SSPP WLKPP P+RKR EP QE  S      V
Sbjct: 416  EKDDAERLLFFYQNCEKDSCTNKNNIIFSSPPGWLKPPPPTRKRCEPRQEA-SPGLHEGV 474

Query: 1160 GEGRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGGTTE 981
              G+ G  K++E+  K+   V+G ++PL P RQ+LK++AMRPIPH R H+M PF G  +E
Sbjct: 475  FAGQGGVCKLNEEE-KDRKIVNGISMPLTPARQRLKVSAMRPIPHIRRHRMTPFCG-PSE 532

Query: 980  VETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGCGRC 801
             + ++G Q +  L +V  TK   +G    THRKS SS+ Q+KQ+ISLNPLPLKKHGCGR 
Sbjct: 533  TDGFDGTQVEAILPLVAPTKRTSIGSTSGTHRKSFSSAAQSKQVISLNPLPLKKHGCGRG 592

Query: 800  PIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRGGF 621
            P+Q CSEEEFL+DVM+FLILRGH RL+PQGGL EFPDAILN KRLDLYNLY+EVV+RGGF
Sbjct: 593  PVQTCSEEEFLKDVMEFLILRGHNRLIPQGGLTEFPDAILNGKRLDLYNLYKEVVTRGGF 652

Query: 620  HVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHS 441
            HVGNGINWKGQ+FSKMRN+T TNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHS
Sbjct: 653  HVGNGINWKGQIFSKMRNYTTTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHS 712

Query: 440  SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKAANG 261
            SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCS+TNF++K Q  ANG
Sbjct: 713  SAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSVTNFKKK-QNVANG 771

Query: 260  YS*G 249
            YS G
Sbjct: 772  YSQG 775


>ref|XP_003533805.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
            isoform X1 [Glycine max]
          Length = 752

 Score = 1045 bits (2702), Expect = 0.0
 Identities = 532/784 (67%), Positives = 599/784 (76%), Gaps = 3/784 (0%)
 Frame = -2

Query: 2600 MMFHVQGPTKPMCGLLAVLCEAPNS---KQKQDFSEDPPLGYSFPELVSSGRLEVQTLIS 2430
            MMFH QG ++  C LLAVL         KQKQ  + +    Y FPEL SSGRLEV+ LI 
Sbjct: 1    MMFHSQGVSRH-CSLLAVLSGKSRDIKQKQKQGNASEDQFPYPFPELSSSGRLEVKVLIE 59

Query: 2429 PTTDEFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVY 2250
            PT DE    +E  +P+ VYLQG+QL +  EIG L W   DLS PEA+ GLF S LP TVY
Sbjct: 60   PTADELGLALEQLQPDFVYLQGQQLEDRGEIGPLGWEDFDLSVPEALCGLFSSKLPNTVY 119

Query: 2249 LEIPNSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHA 2070
            LE P  E LA AL SKGVPY IYWKN FS YAA HFR +LFSV QS+ SHTWDAFQLA A
Sbjct: 120  LETPKGEKLAEALRSKGVPYTIYWKNDFSKYAASHFRHSLFSVAQSTSSHTWDAFQLALA 179

Query: 2069 SFRLYCVRNNQVLPANNHKISGKLGPHLLGDPPKITIVPMVKEAGXXXXXXXXDLSGDLP 1890
            SFRLYC+ NN VLP+N HK +GKLGP +LG PP I + P V +           +S    
Sbjct: 180  SFRLYCIHNN-VLPSNCHKGAGKLGPQILGVPPNIDVSPCVADMKEEEEDSPETIS---- 234

Query: 1889 AIKIYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQA 1710
            A+KIYDDDV+MRFL+CGVPCTLDACLLGSLEDGLNALL  EIRG KLHNR SA PPPLQA
Sbjct: 235  AVKIYDDDVNMRFLICGVPCTLDACLLGSLEDGLNALLFAEIRGCKLHNRTSATPPPLQA 294

Query: 1709 GTFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSC 1530
            GTFSRGVVTMRCD+STCSSAHISLLVSGSA TCF+DQLLE HIK ELIEKSQLV AFP+ 
Sbjct: 295  GTFSRGVVTMRCDISTCSSAHISLLVSGSADTCFNDQLLENHIKKELIEKSQLVQAFPNH 354

Query: 1529 GESKPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGS 1350
             +SK    EPR+S S+ACG+++FEVCM+VP WASQVLRQLAP +SYRS V LGIASIQG 
Sbjct: 355  EQSKAPSSEPRRSASVACGSSVFEVCMQVPAWASQVLRQLAPNLSYRSLVMLGIASIQGL 414

Query: 1349 AVASFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSVNGC 1170
             VASF+KDD +RLLFFCTR+E++  P + + S  P WLKPP+ SRKRSEPC  ++S+N  
Sbjct: 415  PVASFNKDDAERLLFFCTRQEKENCPNDHVFSGIPSWLKPPSTSRKRSEPCSSSKSIN-- 472

Query: 1169 SMVGEGRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGG 990
                 GR  +A                   +   RQK  LA+MRPIPH+  HK+LPF  G
Sbjct: 473  ---DSGRGVEA-------------------IGSHRQKFNLASMRPIPHSNRHKILPF-SG 509

Query: 989  TTEVETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGC 810
             +E   Y+G   K+NL + P  KHN+ GP  VT+RKS S+SFQA QIISLNPLP+KKHGC
Sbjct: 510  LSEGTRYDGDHGKSNLPLAP-IKHNVSGPTSVTNRKSVSNSFQAHQIISLNPLPMKKHGC 568

Query: 809  GRCPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSR 630
             R PI+ CSEEEFLRDVMQFLILRGH RL+P GGLAEFPDAILNAKRLDL+NLYREVVSR
Sbjct: 569  DRAPIRACSEEEFLRDVMQFLILRGHNRLIPPGGLAEFPDAILNAKRLDLFNLYREVVSR 628

Query: 629  GGFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLL 450
            GGFHVGNGINWKGQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLEYEL+HDDVDGECCL+
Sbjct: 629  GGFHVGNGINWKGQVFSKMRNHTMTNRMTGVGNTLKRHYETYLLEYELSHDDVDGECCLM 688

Query: 449  CHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKA 270
            CHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEY+CP CS   F +K QK 
Sbjct: 689  CHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPRCSALKFSKKSQKT 748

Query: 269  ANGY 258
            ANG+
Sbjct: 749  ANGF 752


>ref|XP_006828651.1| hypothetical protein AMTR_s00129p00111730 [Amborella trichopoda]
            gi|548833441|gb|ERM96067.1| hypothetical protein
            AMTR_s00129p00111730 [Amborella trichopoda]
          Length = 810

 Score = 1042 bits (2695), Expect = 0.0
 Identities = 536/803 (66%), Positives = 619/803 (77%), Gaps = 26/803 (3%)
 Frame = -2

Query: 2573 KPMCGLLAVLCEAPNSKQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPTTDEFRRVVEI 2394
            K  C LL VLC   + K+KQ+ +ED P+ Y FPELVSSGRLEVQ + +P+++EF+RV+E 
Sbjct: 12   KQSCILLGVLCGKRSDKEKQENAEDRPV-YPFPELVSSGRLEVQIITNPSSEEFKRVLES 70

Query: 2393 SEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLEIPNSEDLANA 2214
            S+ + VYLQGEQ  +  E+G LV G V++S+ +AI+ LFGS LP+TVYLEIPN E LA A
Sbjct: 71   SDFDFVYLQGEQSLHKDEVGPLVLGDVNISSADAITRLFGSKLPSTVYLEIPNGEKLAEA 130

Query: 2213 LHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFRLYCVRNNQV 2034
            LHSKGVPYVIYW+++FSCYAACHFRQAL S +QSS  HTWD FQLA ASFRLYCVRNN  
Sbjct: 131  LHSKGVPYVIYWRHSFSCYAACHFRQALVSTLQSSSCHTWDVFQLAQASFRLYCVRNNHN 190

Query: 2033 LPANNHKISGKLGPHLLGDPPKITIVPMVKEAGXXXXXXXXDLSGDLPAIKIYDDDVSMR 1854
            L  N  K+SGKLGP LLG+ PKI + P++++ G             L AIKIYDD+VS+R
Sbjct: 191  LVLNGQKVSGKLGPRLLGEAPKI-LTPILQDTGESEGSP-----STLSAIKIYDDEVSLR 244

Query: 1853 FLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQAGTFSRGVVTMRC 1674
            FLVCG PCTLDACLLGSLEDGLNALL+IEIRGSKLHNRVSA PPPL AGTFSRGV+TMRC
Sbjct: 245  FLVCGEPCTLDACLLGSLEDGLNALLSIEIRGSKLHNRVSALPPPLAAGTFSRGVITMRC 304

Query: 1673 DLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSCGESKPSLCEPRK 1494
            DLSTCSSA +SLLVSGSAQTCFD+QLLE HIKNELIEKS LV A PSC ESKPSL  PRK
Sbjct: 305  DLSTCSSARLSLLVSGSAQTCFDEQLLECHIKNELIEKSPLVRALPSCEESKPSLSVPRK 364

Query: 1493 STSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGSAVASFDKDDTKR 1314
            S  +ACGAA+FEV MKVP+WA+QVL QLAPEI YRS V+LGIASIQG+ VASF+K D  R
Sbjct: 365  SACVACGAAVFEVWMKVPSWAAQVLCQLAPEIPYRSLVTLGIASIQGTPVASFEKADADR 424

Query: 1313 LLFFCTRKERDFNP-----QNAMPSSPPVWLKPPAPSRKRSEPCQETRSVNGCSMVGEGR 1149
            LLFFCT++ +  +      +N++ S+P  WL+P  P + R      + +    S   +G 
Sbjct: 425  LLFFCTKQGKSSDILLQLFRNSL-STPANWLRPTPPRKIRLNLWSGSSNTTNTSNQVQGD 483

Query: 1148 MGDAKI-DEDHNKENGPVDGA----------------NIPLVPVRQKLKLAAMRPIPHTR 1020
                K+ DE +     P++                  N  ++P R+++ L A+RPIPH+R
Sbjct: 484  RKRIKLKDEKNTPPRSPIEQKVLQNVNEEEPKLKIEENGSILPTRKRMVLRALRPIPHSR 543

Query: 1019 HHKMLPFIGGTTEVETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISL 840
             HK+LPF G   E++ ++G   K +  VV S KHN     PV+HRK+ +SSFQA+QI+SL
Sbjct: 544  RHKLLPFTG-VPELDPHDGSPLKASGPVVASVKHNYGASAPVSHRKNLTSSFQAQQIVSL 602

Query: 839  NPLPLKKHGCGRCPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDL 660
            NPLPLKKHGC R PIQ CSEEEFLRDVMQFLILRGHTRLVP GGLAEFPDAILNAKRLDL
Sbjct: 603  NPLPLKKHGCSRGPIQECSEEEFLRDVMQFLILRGHTRLVPAGGLAEFPDAILNAKRLDL 662

Query: 659  YNLYREVVSRGGFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAH 480
            YNLYREVVSRGGF+VGNGINWKGQVFSKMRNHT TNRMTGVGNTLKRHYETYLLEYELAH
Sbjct: 663  YNLYREVVSRGGFNVGNGINWKGQVFSKMRNHTTTNRMTGVGNTLKRHYETYLLEYELAH 722

Query: 479  DDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSI 300
            DDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRR GLGAFKDYAKTDGLEYICP CS 
Sbjct: 723  DDVDGECCLLCHSSAAGDWVNCGICGEWAHFGCDRRLGLGAFKDYAKTDGLEYICPRCSA 782

Query: 299  TNFR----RKPQKAANGYS*GLT 243
            +NFR    RK QK  NGYS  LT
Sbjct: 783  SNFRGASARKTQKMGNGYSQALT 805


>gb|EYU21278.1| hypothetical protein MIMGU_mgv1a001736mg [Mimulus guttatus]
          Length = 767

 Score = 1038 bits (2685), Expect = 0.0
 Identities = 525/789 (66%), Positives = 617/789 (78%), Gaps = 8/789 (1%)
 Frame = -2

Query: 2597 MFHVQGPTKPMCGLLAVLCE-APNSKQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPTT 2421
            MFH QG  K  C LLAVLC  A  +K  Q+  ++ P  + FPE+VSSGRLEVQTL +PT 
Sbjct: 1    MFHTQGALKNTCNLLAVLCNRAAENKHSQNVLDERP-NFPFPEIVSSGRLEVQTLKNPTV 59

Query: 2420 DEFRRVVEISEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLEI 2241
            DEF +V++ S+ N+VYLQGE L NDK IGS+VWGG +LS+PEAI+GLF S LPTTVYLE+
Sbjct: 60   DEFSKVLDSSQANLVYLQGEHLENDK-IGSIVWGGFELSSPEAITGLFNSKLPTTVYLEV 118

Query: 2240 PNSEDLANALHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFR 2061
            PN E LA +LHSKG+PYVIYW N+FSCY A HFR ALFS +QSS  HTWD+F+LA ASFR
Sbjct: 119  PNGERLAKSLHSKGIPYVIYWNNSFSCYEASHFRHALFSSIQSSSCHTWDSFKLADASFR 178

Query: 2060 LYCVRNNQVLPANNHKISGKLGPHLLGDPPKITI-VPMVKE--AGXXXXXXXXDLSGDLP 1890
            L+C+R N +       ++ ++GP L+G+ PKIT+  P ++E              SG LP
Sbjct: 179  LHCLRGNNL-------VNDEVGPTLIGEAPKITVDAPEMEEDRVNDEDEDEESLSSGPLP 231

Query: 1889 AIKIYDDDVSMRFLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQA 1710
            AIKIYDDDV+ RFLVCG   +LDA LLGSLEDGLNALLNIE+RGSKLHNRVSA PPPLQA
Sbjct: 232  AIKIYDDDVNTRFLVCGRTTSLDASLLGSLEDGLNALLNIEMRGSKLHNRVSALPPPLQA 291

Query: 1709 GTFSRGVVTMRCDLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSC 1530
            G+FSRGVVTMRCDLST SSAHISLLVSGSAQTCFDDQLLE HIK+E+I+KS+L+ A P+ 
Sbjct: 292  GSFSRGVVTMRCDLSTTSSAHISLLVSGSAQTCFDDQLLENHIKSEIIDKSRLIQAMPNS 351

Query: 1529 GESKPSLCEPRKSTSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGS 1350
             E+KP L EPR+S SIACGA +FEVCMKVP+WA+QVLRQLAP+ISYRS V+LGIA IQG 
Sbjct: 352  DENKPPLSEPRRSVSIACGATVFEVCMKVPSWATQVLRQLAPDISYRSLVALGIAGIQGL 411

Query: 1349 AVASFDKDDTKRLLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSV--N 1176
            AVASF+K+D++RLLFFCT++E      +   ++PP WL+ P PSRKR    QE   V  N
Sbjct: 412  AVASFEKEDSERLLFFCTKQENISRSNDFKLTTPPSWLRAPPPSRKRPSIYQEIVPVTLN 471

Query: 1175 GCSMVGEGRMGDAKIDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPF- 999
            G S         + ++E++NKE    +G N  L   ++K+K+AA+RPIPH RH KMLPF 
Sbjct: 472  GLS---------SSVNENNNKEIKFSNGVNTSLSSAKRKIKIAALRPIPHVRHQKMLPFS 522

Query: 998  -IGGTTEVETYNGGQAKTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLK 822
             I         +G   K +L   P+ KH  V PV    RKS S S+QAKQ+ISLNPLPLK
Sbjct: 523  RIADFDLHHHLDGSYVKASLPSAPA-KHVSVTPVS---RKSGSGSYQAKQVISLNPLPLK 578

Query: 821  KHGCGRCPIQICSEEEFLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYRE 642
            KHGCGR P+ +CSEEEFL+DVMQFLILRGH RL+PQ G+ EFPDAILNAKRLDL+NLYRE
Sbjct: 579  KHGCGRSPLHVCSEEEFLKDVMQFLILRGHNRLIPQNGIDEFPDAILNAKRLDLFNLYRE 638

Query: 641  VVSRGGFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE 462
            VV+RGGFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE
Sbjct: 639  VVTRGGFHVGNGINWKGQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGE 698

Query: 461  CCLLCHSSAAGDWVNCGICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRK 282
            CCLLCHSSA GDWVNCG+CGEWAHFGCDRR GLGAFKDYAKTDGLEYICP CS++N+++K
Sbjct: 699  CCLLCHSSAPGDWVNCGLCGEWAHFGCDRRPGLGAFKDYAKTDGLEYICPQCSVSNYKKK 758

Query: 281  PQKAANGYS 255
              K+ NGYS
Sbjct: 759  IPKSGNGYS 767


>ref|XP_006587068.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
            isoform X3 [Glycine max]
          Length = 772

 Score = 1037 bits (2682), Expect = 0.0
 Identities = 526/772 (68%), Positives = 591/772 (76%), Gaps = 3/772 (0%)
 Frame = -2

Query: 2564 CGLLAVLCEAPNS---KQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPTTDEFRRVVEI 2394
            C LLAVL         KQKQ  + +    Y FPEL SSGRLEV+ LI PT DE    +E 
Sbjct: 32   CSLLAVLSGKSRDIKQKQKQGNASEDQFPYPFPELSSSGRLEVKVLIEPTADELGLALEQ 91

Query: 2393 SEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLEIPNSEDLANA 2214
             +P+ VYLQG+QL +  EIG L W   DLS PEA+ GLF S LP TVYLE P  E LA A
Sbjct: 92   LQPDFVYLQGQQLEDRGEIGPLGWEDFDLSVPEALCGLFSSKLPNTVYLETPKGEKLAEA 151

Query: 2213 LHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFRLYCVRNNQV 2034
            L SKGVPY IYWKN FS YAA HFR +LFSV QS+ SHTWDAFQLA ASFRLYC+ NN V
Sbjct: 152  LRSKGVPYTIYWKNDFSKYAASHFRHSLFSVAQSTSSHTWDAFQLALASFRLYCIHNN-V 210

Query: 2033 LPANNHKISGKLGPHLLGDPPKITIVPMVKEAGXXXXXXXXDLSGDLPAIKIYDDDVSMR 1854
            LP+N HK +GKLGP +LG PP I + P V +           +S    A+KIYDDDV+MR
Sbjct: 211  LPSNCHKGAGKLGPQILGVPPNIDVSPCVADMKEEEEDSPETIS----AVKIYDDDVNMR 266

Query: 1853 FLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQAGTFSRGVVTMRC 1674
            FL+CGVPCTLDACLLGSLEDGLNALL  EIRG KLHNR SA PPPLQAGTFSRGVVTMRC
Sbjct: 267  FLICGVPCTLDACLLGSLEDGLNALLFAEIRGCKLHNRTSATPPPLQAGTFSRGVVTMRC 326

Query: 1673 DLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSCGESKPSLCEPRK 1494
            D+STCSSAHISLLVSGSA TCF+DQLLE HIK ELIEKSQLV AFP+  +SK    EPR+
Sbjct: 327  DISTCSSAHISLLVSGSADTCFNDQLLENHIKKELIEKSQLVQAFPNHEQSKAPSSEPRR 386

Query: 1493 STSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGSAVASFDKDDTKR 1314
            S S+ACG+++FEVCM+VP WASQVLRQLAP +SYRS V LGIASIQG  VASF+KDD +R
Sbjct: 387  SASVACGSSVFEVCMQVPAWASQVLRQLAPNLSYRSLVMLGIASIQGLPVASFNKDDAER 446

Query: 1313 LLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSVNGCSMVGEGRMGDAK 1134
            LLFFCTR+E++  P + + S  P WLKPP+ SRKRSEPC  ++S+N       GR  +A 
Sbjct: 447  LLFFCTRQEKENCPNDHVFSGIPSWLKPPSTSRKRSEPCSSSKSIN-----DSGRGVEA- 500

Query: 1133 IDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGGTTEVETYNGGQA 954
                              +   RQK  LA+MRPIPH+  HK+LPF  G +E   Y+G   
Sbjct: 501  ------------------IGSHRQKFNLASMRPIPHSNRHKILPF-SGLSEGTRYDGDHG 541

Query: 953  KTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGCGRCPIQICSEEE 774
            K+NL + P  KHN+ GP  VT+RKS S+SFQA QIISLNPLP+KKHGC R PI+ CSEEE
Sbjct: 542  KSNLPLAP-IKHNVSGPTSVTNRKSVSNSFQAHQIISLNPLPMKKHGCDRAPIRACSEEE 600

Query: 773  FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRGGFHVGNGINWK 594
            FLRDVMQFLILRGH RL+P GGLAEFPDAILNAKRLDL+NLYREVVSRGGFHVGNGINWK
Sbjct: 601  FLRDVMQFLILRGHNRLIPPGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWK 660

Query: 593  GQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC 414
            GQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLEYEL+HDDVDGECCL+CHSSAAGDWVNC
Sbjct: 661  GQVFSKMRNHTMTNRMTGVGNTLKRHYETYLLEYELSHDDVDGECCLMCHSSAAGDWVNC 720

Query: 413  GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKAANGY 258
            GICGEWAHFGCDRRQGLGAFKDYAKTDGLEY+CP CS   F +K QK ANG+
Sbjct: 721  GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPRCSALKFSKKSQKTANGF 772


>ref|XP_006587067.1| PREDICTED: AT-rich interactive domain-containing protein 4-like
            isoform X2 [Glycine max]
          Length = 795

 Score = 1037 bits (2682), Expect = 0.0
 Identities = 526/772 (68%), Positives = 591/772 (76%), Gaps = 3/772 (0%)
 Frame = -2

Query: 2564 CGLLAVLCEAPNS---KQKQDFSEDPPLGYSFPELVSSGRLEVQTLISPTTDEFRRVVEI 2394
            C LLAVL         KQKQ  + +    Y FPEL SSGRLEV+ LI PT DE    +E 
Sbjct: 55   CSLLAVLSGKSRDIKQKQKQGNASEDQFPYPFPELSSSGRLEVKVLIEPTADELGLALEQ 114

Query: 2393 SEPNIVYLQGEQLPNDKEIGSLVWGGVDLSNPEAISGLFGSTLPTTVYLEIPNSEDLANA 2214
             +P+ VYLQG+QL +  EIG L W   DLS PEA+ GLF S LP TVYLE P  E LA A
Sbjct: 115  LQPDFVYLQGQQLEDRGEIGPLGWEDFDLSVPEALCGLFSSKLPNTVYLETPKGEKLAEA 174

Query: 2213 LHSKGVPYVIYWKNAFSCYAACHFRQALFSVVQSSCSHTWDAFQLAHASFRLYCVRNNQV 2034
            L SKGVPY IYWKN FS YAA HFR +LFSV QS+ SHTWDAFQLA ASFRLYC+ NN V
Sbjct: 175  LRSKGVPYTIYWKNDFSKYAASHFRHSLFSVAQSTSSHTWDAFQLALASFRLYCIHNN-V 233

Query: 2033 LPANNHKISGKLGPHLLGDPPKITIVPMVKEAGXXXXXXXXDLSGDLPAIKIYDDDVSMR 1854
            LP+N HK +GKLGP +LG PP I + P V +           +S    A+KIYDDDV+MR
Sbjct: 234  LPSNCHKGAGKLGPQILGVPPNIDVSPCVADMKEEEEDSPETIS----AVKIYDDDVNMR 289

Query: 1853 FLVCGVPCTLDACLLGSLEDGLNALLNIEIRGSKLHNRVSAPPPPLQAGTFSRGVVTMRC 1674
            FL+CGVPCTLDACLLGSLEDGLNALL  EIRG KLHNR SA PPPLQAGTFSRGVVTMRC
Sbjct: 290  FLICGVPCTLDACLLGSLEDGLNALLFAEIRGCKLHNRTSATPPPLQAGTFSRGVVTMRC 349

Query: 1673 DLSTCSSAHISLLVSGSAQTCFDDQLLETHIKNELIEKSQLVHAFPSCGESKPSLCEPRK 1494
            D+STCSSAHISLLVSGSA TCF+DQLLE HIK ELIEKSQLV AFP+  +SK    EPR+
Sbjct: 350  DISTCSSAHISLLVSGSADTCFNDQLLENHIKKELIEKSQLVQAFPNHEQSKAPSSEPRR 409

Query: 1493 STSIACGAAIFEVCMKVPTWASQVLRQLAPEISYRSFVSLGIASIQGSAVASFDKDDTKR 1314
            S S+ACG+++FEVCM+VP WASQVLRQLAP +SYRS V LGIASIQG  VASF+KDD +R
Sbjct: 410  SASVACGSSVFEVCMQVPAWASQVLRQLAPNLSYRSLVMLGIASIQGLPVASFNKDDAER 469

Query: 1313 LLFFCTRKERDFNPQNAMPSSPPVWLKPPAPSRKRSEPCQETRSVNGCSMVGEGRMGDAK 1134
            LLFFCTR+E++  P + + S  P WLKPP+ SRKRSEPC  ++S+N       GR  +A 
Sbjct: 470  LLFFCTRQEKENCPNDHVFSGIPSWLKPPSTSRKRSEPCSSSKSIN-----DSGRGVEA- 523

Query: 1133 IDEDHNKENGPVDGANIPLVPVRQKLKLAAMRPIPHTRHHKMLPFIGGTTEVETYNGGQA 954
                              +   RQK  LA+MRPIPH+  HK+LPF  G +E   Y+G   
Sbjct: 524  ------------------IGSHRQKFNLASMRPIPHSNRHKILPF-SGLSEGTRYDGDHG 564

Query: 953  KTNLSVVPSTKHNIVGPVPVTHRKSSSSSFQAKQIISLNPLPLKKHGCGRCPIQICSEEE 774
            K+NL + P  KHN+ GP  VT+RKS S+SFQA QIISLNPLP+KKHGC R PI+ CSEEE
Sbjct: 565  KSNLPLAP-IKHNVSGPTSVTNRKSVSNSFQAHQIISLNPLPMKKHGCDRAPIRACSEEE 623

Query: 773  FLRDVMQFLILRGHTRLVPQGGLAEFPDAILNAKRLDLYNLYREVVSRGGFHVGNGINWK 594
            FLRDVMQFLILRGH RL+P GGLAEFPDAILNAKRLDL+NLYREVVSRGGFHVGNGINWK
Sbjct: 624  FLRDVMQFLILRGHNRLIPPGGLAEFPDAILNAKRLDLFNLYREVVSRGGFHVGNGINWK 683

Query: 593  GQVFSKMRNHTVTNRMTGVGNTLKRHYETYLLEYELAHDDVDGECCLLCHSSAAGDWVNC 414
            GQVFSKMRNHT+TNRMTGVGNTLKRHYETYLLEYEL+HDDVDGECCL+CHSSAAGDWVNC
Sbjct: 684  GQVFSKMRNHTMTNRMTGVGNTLKRHYETYLLEYELSHDDVDGECCLMCHSSAAGDWVNC 743

Query: 413  GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYICPHCSITNFRRKPQKAANGY 258
            GICGEWAHFGCDRRQGLGAFKDYAKTDGLEY+CP CS   F +K QK ANG+
Sbjct: 744  GICGEWAHFGCDRRQGLGAFKDYAKTDGLEYVCPRCSALKFSKKSQKTANGF 795


Top