BLASTX nr result
ID: Cornus23_contig00010558
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cornus23_contig00010558 (1048 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_010652083.1| PREDICTED: uncharacterized protein LOC100259... 292 4e-76 emb|CAN67843.1| hypothetical protein VITISV_016666 [Vitis vinifera] 270 1e-69 ref|XP_012092343.1| PREDICTED: uncharacterized protein LOC105650... 258 7e-66 ref|XP_012092341.1| PREDICTED: uncharacterized protein LOC105650... 258 7e-66 ref|XP_011033644.1| PREDICTED: uncharacterized protein LOC105132... 254 6e-65 ref|XP_011033643.1| PREDICTED: uncharacterized protein LOC105132... 254 6e-65 ref|XP_002520708.1| conserved hypothetical protein [Ricinus comm... 241 5e-61 ref|XP_010244631.1| PREDICTED: uncharacterized protein LOC104588... 241 9e-61 ref|XP_010244630.1| PREDICTED: uncharacterized protein LOC104588... 241 9e-61 gb|KJB63070.1| hypothetical protein B456_009G451700, partial [Go... 237 1e-59 gb|KJB63069.1| hypothetical protein B456_009G451700 [Gossypium r... 237 1e-59 gb|KJB63067.1| hypothetical protein B456_009G451700 [Gossypium r... 237 1e-59 ref|XP_012444042.1| PREDICTED: uncharacterized protein LOC105768... 237 1e-59 ref|XP_010112707.1| hypothetical protein L484_020433 [Morus nota... 235 5e-59 gb|KHG24791.1| hypothetical protein F383_07105 [Gossypium arboreum] 234 7e-59 ref|XP_011658033.1| PREDICTED: uncharacterized protein LOC101207... 234 9e-59 ref|XP_011658036.1| PREDICTED: uncharacterized protein LOC101207... 234 9e-59 ref|XP_007020458.1| Sequence-specific DNA binding,sequence-speci... 234 1e-58 ref|XP_007020457.1| NDX1 homeobox protein, putative isoform 2 [T... 234 1e-58 ref|XP_007020456.1| Sequence-specific DNA binding,sequence-speci... 234 1e-58 >ref|XP_010652083.1| PREDICTED: uncharacterized protein LOC100259581 [Vitis vinifera] Length = 950 Score = 292 bits (747), Expect = 4e-76 Identities = 153/238 (64%), Positives = 179/238 (75%), Gaps = 2/238 (0%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M Q+TLIEKALVDEPDMQRNAALIQSW+DKLS +GPE++ SQLKNW Sbjct: 713 RKRKRTIMNDTQMTLIEKALVDEPDMQRNAALIQSWADKLSFHGPELTASQLKNWLNNRK 772 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHC-DSPASPVEDAYVPLTARG-IHQTGIAE 647 KDVR SE D+TFPDKQ SG+ DSP SP ED + P TARG HQ+ I Sbjct: 773 ARLARAAKDVRVASEVDSTFPDKQVGSGVGSLHDSPESPGEDFFAPSTARGGTHQSAIGG 832 Query: 646 STLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGS 467 S R + +EA AE ++I PAEFVR EPGQ VL+DG+G++IGKGKV+QVQGKWYG Sbjct: 833 SVSRAGA-DNAEAATAEFVDINPAEFVRREPGQYVVLLDGQGDDIGKGKVHQVQGKWYGK 891 Query: 466 NLEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQS 293 NLEES+ CVVDV+ELKAER RLPHP E TGT+FDEA TKLG+MRV WDSNKL +L+S Sbjct: 892 NLEESQTCVVDVMELKAERWSRLPHPSETTGTSFDEAETKLGVMRVSWDSNKLCILRS 949 >emb|CAN67843.1| hypothetical protein VITISV_016666 [Vitis vinifera] Length = 1134 Score = 270 bits (690), Expect = 1e-69 Identities = 142/222 (63%), Positives = 165/222 (74%), Gaps = 2/222 (0%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M Q+TLIEKALVDEPDMQRNAALIQSW+DKLS +GPE++ SQLKNW Sbjct: 818 RKRKRTIMNDTQMTLIEKALVDEPDMQRNAALIQSWADKLSFHGPELTASQLKNWLNNRK 877 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHC-DSPASPVEDAYVPLTARG-IHQTGIAE 647 KDVR SE D+TFPDKQ SG+ DSP SP ED + P TARG HQ+ I Sbjct: 878 ARLARAAKDVRVASEVDSTFPDKQVGSGVGSLHDSPESPGEDFFAPSTARGGTHQSAIGG 937 Query: 646 STLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGS 467 S R + +EA AE ++I PAEFVR EPGQ VL+DG+G++IGKGKV+QVQGKWYG Sbjct: 938 SVSRAGA-DNAEAATAEFVDINPAEFVRREPGQYVVLLDGQGDDIGKGKVHQVQGKWYGK 996 Query: 466 NLEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLG 341 NLEES+ CVVDV+ELKAER RLPHP E TGT+FDEA TKLG Sbjct: 997 NLEESQTCVVDVMELKAERWSRLPHPSETTGTSFDEAETKLG 1038 >ref|XP_012092343.1| PREDICTED: uncharacterized protein LOC105650070 isoform X2 [Jatropha curcas] Length = 949 Score = 258 bits (658), Expect = 7e-66 Identities = 141/235 (60%), Positives = 167/235 (71%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M Q++LIEKALVDEPDMQRN+A IQ W+DKLS++G E++ SQLKNW Sbjct: 722 RKRKRTIMNDYQMSLIEKALVDEPDMQRNSASIQRWADKLSIHGSEVTFSQLKNWLNNRK 781 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAEST 641 KDVRAP E D+ KQG S +H DSP S ED P AR + ST Sbjct: 782 ARLARAGKDVRAPVEFDSAHSVKQGMSTHSH-DSPESRGEDN-APSGAR------LVPST 833 Query: 640 LRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSNL 461 R +E +E LAE + I AEFV+C+PGQ VLVD +GEEIGK KVYQVQGKWYG NL Sbjct: 834 SRIGTSENAETSLAEFVGIGAAEFVQCKPGQYVVLVDKQGEEIGKAKVYQVQGKWYGKNL 893 Query: 460 EESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296 EESE CVVDV ELKA+R +RLP+P EATGT+F EA TKLG+MRVLWDSNK+FM + Sbjct: 894 EESETCVVDVTELKADRWVRLPYPSEATGTSFSEAETKLGVMRVLWDSNKIFMFR 948 >ref|XP_012092341.1| PREDICTED: uncharacterized protein LOC105650070 isoform X1 [Jatropha curcas] gi|802794853|ref|XP_012092342.1| PREDICTED: uncharacterized protein LOC105650070 isoform X1 [Jatropha curcas] gi|643704475|gb|KDP21539.1| hypothetical protein JCGZ_22010 [Jatropha curcas] Length = 952 Score = 258 bits (658), Expect = 7e-66 Identities = 141/235 (60%), Positives = 167/235 (71%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M Q++LIEKALVDEPDMQRN+A IQ W+DKLS++G E++ SQLKNW Sbjct: 725 RKRKRTIMNDYQMSLIEKALVDEPDMQRNSASIQRWADKLSIHGSEVTFSQLKNWLNNRK 784 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAEST 641 KDVRAP E D+ KQG S +H DSP S ED P AR + ST Sbjct: 785 ARLARAGKDVRAPVEFDSAHSVKQGMSTHSH-DSPESRGEDN-APSGAR------LVPST 836 Query: 640 LRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSNL 461 R +E +E LAE + I AEFV+C+PGQ VLVD +GEEIGK KVYQVQGKWYG NL Sbjct: 837 SRIGTSENAETSLAEFVGIGAAEFVQCKPGQYVVLVDKQGEEIGKAKVYQVQGKWYGKNL 896 Query: 460 EESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296 EESE CVVDV ELKA+R +RLP+P EATGT+F EA TKLG+MRVLWDSNK+FM + Sbjct: 897 EESETCVVDVTELKADRWVRLPYPSEATGTSFSEAETKLGVMRVLWDSNKIFMFR 951 >ref|XP_011033644.1| PREDICTED: uncharacterized protein LOC105132061 isoform X2 [Populus euphratica] Length = 807 Score = 254 bits (650), Expect = 6e-65 Identities = 133/233 (57%), Positives = 169/233 (72%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M QITL+EKAL+DEP+MQRNAA +QSW+DKLS+NG E++ SQLKNW Sbjct: 576 RKRKRTIMNDYQITLMEKALLDEPEMQRNAAALQSWADKLSLNGSEVTPSQLKNWLNNRK 635 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAEST 641 KDVRAP E DNTFP+KQ + D+P SP ED L+A+G+ T S Sbjct: 636 ARLARAGKDVRAPMEVDNTFPEKQ-VGQVQRQDTPESPSEDN-TTLSAKGLQNT----SE 689 Query: 640 LRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSNL 461 + + ++ LA+ ++I +EFV+C+PGQ VLVDG+GEEIGKGKVYQVQGKWYG L Sbjct: 690 IGVFGDPEAGIGLADFVDIGASEFVQCKPGQFVVLVDGQGEEIGKGKVYQVQGKWYGRIL 749 Query: 460 EESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFM 302 EESE+CVVDV ELK E+ +RLP+P E TG +F EA K+G+MRVLWDSNK++M Sbjct: 750 EESEMCVVDVTELKTEKWVRLPYPSETTGMSFYEAEQKIGVMRVLWDSNKIYM 802 >ref|XP_011033643.1| PREDICTED: uncharacterized protein LOC105132061 isoform X1 [Populus euphratica] Length = 955 Score = 254 bits (650), Expect = 6e-65 Identities = 133/233 (57%), Positives = 169/233 (72%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M QITL+EKAL+DEP+MQRNAA +QSW+DKLS+NG E++ SQLKNW Sbjct: 724 RKRKRTIMNDYQITLMEKALLDEPEMQRNAAALQSWADKLSLNGSEVTPSQLKNWLNNRK 783 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAEST 641 KDVRAP E DNTFP+KQ + D+P SP ED L+A+G+ T S Sbjct: 784 ARLARAGKDVRAPMEVDNTFPEKQ-VGQVQRQDTPESPSEDN-TTLSAKGLQNT----SE 837 Query: 640 LRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSNL 461 + + ++ LA+ ++I +EFV+C+PGQ VLVDG+GEEIGKGKVYQVQGKWYG L Sbjct: 838 IGVFGDPEAGIGLADFVDIGASEFVQCKPGQFVVLVDGQGEEIGKGKVYQVQGKWYGRIL 897 Query: 460 EESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFM 302 EESE+CVVDV ELK E+ +RLP+P E TG +F EA K+G+MRVLWDSNK++M Sbjct: 898 EESEMCVVDVTELKTEKWVRLPYPSETTGMSFYEAEQKIGVMRVLWDSNKIYM 950 >ref|XP_002520708.1| conserved hypothetical protein [Ricinus communis] gi|223540093|gb|EEF41670.1| conserved hypothetical protein [Ricinus communis] Length = 957 Score = 241 bits (616), Expect = 5e-61 Identities = 129/235 (54%), Positives = 158/235 (67%), Gaps = 2/235 (0%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M Q++LIE+ALVDEPDM RNAA +QSW+DKLS++G E+++SQLKNW Sbjct: 727 RKRKRTIMNEYQMSLIEEALVDEPDMHRNAASLQSWADKLSLHGSEVTSSQLKNWLNNRK 786 Query: 820 XXXXXXXK--DVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAE 647 DVR P E D+ +KQ + H + + VP AR Sbjct: 787 ARLARAGAGKDVRTPMEVDHALSEKQSVPALRHSHDSSESHGEVNVPAGAR--------L 838 Query: 646 STLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGS 467 ST R E +E LA+ I AE V+C+PGQ VLVD +G+EIGKGKVYQVQGKWYG Sbjct: 839 STARIGSAENAEISLAQFFGIDAAELVQCKPGQYVVLVDKQGDEIGKGKVYQVQGKWYGK 898 Query: 466 NLEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFM 302 +LEESE CVVDV ELKAER +RLP+P EATGT+F EA TKLG+MRVLWDSNK+FM Sbjct: 899 SLEESETCVVDVTELKAERWVRLPYPSEATGTSFSEAETKLGVMRVLWDSNKIFM 953 >ref|XP_010244631.1| PREDICTED: uncharacterized protein LOC104588414 isoform X2 [Nelumbo nucifera] Length = 916 Score = 241 bits (614), Expect = 9e-61 Identities = 129/250 (51%), Positives = 165/250 (66%), Gaps = 15/250 (6%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M QITLIE+AL+DEP+MQRNA L+QSW+DKLSV+G E+++SQLKNW Sbjct: 666 RKRKRNIMNDTQITLIERALLDEPEMQRNATLLQSWADKLSVHGSELTSSQLKNWLNNRK 725 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAH-CDSPASPVEDAYVP--LTARGIHQT--G 656 ++ RAPSEGDNTFPDKQG SG A DSP SP ED YVP T G +Q+ Sbjct: 726 ARLARAAREARAPSEGDNTFPDKQGGSGQAQFYDSPESPSEDFYVPPSTTRAGSNQSTPK 785 Query: 655 IAESTLRTVVNEKSEAVLAELIEITPAE----------FVRCEPGQCAVLVDGKGEEIGK 506 TLRT E SE + ++ + + + EPGQ L+DG+G+E+G+ Sbjct: 786 FGGVTLRTGSGEASEMTPTDFVDFAAKQSMQMDCSSLGYAQYEPGQYVSLIDGEGKEVGR 845 Query: 505 GKVYQVQGKWYGSNLEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVL 326 G VYQV+G+W+G +L E+ C+VDV ELK ER RL HP EA GTTFDEA +K G+MRV Sbjct: 846 GNVYQVEGRWHGKSLAEAGTCIVDVHELKVERGTRLQHPVEAAGTTFDEAESKNGVMRVA 905 Query: 325 WDSNKLFMLQ 296 WD NK+ L+ Sbjct: 906 WDVNKILPLR 915 >ref|XP_010244630.1| PREDICTED: uncharacterized protein LOC104588414 isoform X1 [Nelumbo nucifera] Length = 991 Score = 241 bits (614), Expect = 9e-61 Identities = 129/250 (51%), Positives = 165/250 (66%), Gaps = 15/250 (6%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M QITLIE+AL+DEP+MQRNA L+QSW+DKLSV+G E+++SQLKNW Sbjct: 741 RKRKRNIMNDTQITLIERALLDEPEMQRNATLLQSWADKLSVHGSELTSSQLKNWLNNRK 800 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAH-CDSPASPVEDAYVP--LTARGIHQT--G 656 ++ RAPSEGDNTFPDKQG SG A DSP SP ED YVP T G +Q+ Sbjct: 801 ARLARAAREARAPSEGDNTFPDKQGGSGQAQFYDSPESPSEDFYVPPSTTRAGSNQSTPK 860 Query: 655 IAESTLRTVVNEKSEAVLAELIEITPAE----------FVRCEPGQCAVLVDGKGEEIGK 506 TLRT E SE + ++ + + + EPGQ L+DG+G+E+G+ Sbjct: 861 FGGVTLRTGSGEASEMTPTDFVDFAAKQSMQMDCSSLGYAQYEPGQYVSLIDGEGKEVGR 920 Query: 505 GKVYQVQGKWYGSNLEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVL 326 G VYQV+G+W+G +L E+ C+VDV ELK ER RL HP EA GTTFDEA +K G+MRV Sbjct: 921 GNVYQVEGRWHGKSLAEAGTCIVDVHELKVERGTRLQHPVEAAGTTFDEAESKNGVMRVA 980 Query: 325 WDSNKLFMLQ 296 WD NK+ L+ Sbjct: 981 WDVNKILPLR 990 >gb|KJB63070.1| hypothetical protein B456_009G451700, partial [Gossypium raimondii] Length = 913 Score = 237 bits (604), Expect = 1e-59 Identities = 121/236 (51%), Positives = 162/236 (68%), Gaps = 1/236 (0%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M EQ+T++E+AL+DEP+MQRN ALIQSW+DKLS +G E++ SQL+NW Sbjct: 685 RKRKRTIMNDEQVTIMERALLDEPEMQRNTALIQSWADKLSHHGSEVTCSQLRNWLNNRK 744 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHC-DSPASPVEDAYVPLTARGIHQTGIAES 644 KD R P E DN F KQG H +P SP ++ P RG Sbjct: 745 ARLARLSKDARPPPEPDNAFAGKQGGPQQGHSLRAPDSPGQET-TPSNTRGTRSM----- 798 Query: 643 TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464 + +N V E ++ AEFV+C+PGQ VLVDG+G+EIGKGKV+QVQGKW+G + Sbjct: 799 ---SRMNTSENPVAPEFVDYGAAEFVQCKPGQFIVLVDGRGQEIGKGKVHQVQGKWWGKS 855 Query: 463 LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296 LEES CVVDV++LKA+R ++LP+P E+TGT+F++A KLG+MRV+WDSNK+FML+ Sbjct: 856 LEESGTCVVDVVDLKADRWVKLPYPSESTGTSFEDAEKKLGVMRVMWDSNKIFMLR 911 >gb|KJB63069.1| hypothetical protein B456_009G451700 [Gossypium raimondii] Length = 750 Score = 237 bits (604), Expect = 1e-59 Identities = 121/236 (51%), Positives = 162/236 (68%), Gaps = 1/236 (0%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M EQ+T++E+AL+DEP+MQRN ALIQSW+DKLS +G E++ SQL+NW Sbjct: 522 RKRKRTIMNDEQVTIMERALLDEPEMQRNTALIQSWADKLSHHGSEVTCSQLRNWLNNRK 581 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHC-DSPASPVEDAYVPLTARGIHQTGIAES 644 KD R P E DN F KQG H +P SP ++ P RG Sbjct: 582 ARLARLSKDARPPPEPDNAFAGKQGGPQQGHSLRAPDSPGQET-TPSNTRGTRSM----- 635 Query: 643 TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464 + +N V E ++ AEFV+C+PGQ VLVDG+G+EIGKGKV+QVQGKW+G + Sbjct: 636 ---SRMNTSENPVAPEFVDYGAAEFVQCKPGQFIVLVDGRGQEIGKGKVHQVQGKWWGKS 692 Query: 463 LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296 LEES CVVDV++LKA+R ++LP+P E+TGT+F++A KLG+MRV+WDSNK+FML+ Sbjct: 693 LEESGTCVVDVVDLKADRWVKLPYPSESTGTSFEDAEKKLGVMRVMWDSNKIFMLR 748 >gb|KJB63067.1| hypothetical protein B456_009G451700 [Gossypium raimondii] Length = 894 Score = 237 bits (604), Expect = 1e-59 Identities = 121/236 (51%), Positives = 162/236 (68%), Gaps = 1/236 (0%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M EQ+T++E+AL+DEP+MQRN ALIQSW+DKLS +G E++ SQL+NW Sbjct: 666 RKRKRTIMNDEQVTIMERALLDEPEMQRNTALIQSWADKLSHHGSEVTCSQLRNWLNNRK 725 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHC-DSPASPVEDAYVPLTARGIHQTGIAES 644 KD R P E DN F KQG H +P SP ++ P RG Sbjct: 726 ARLARLSKDARPPPEPDNAFAGKQGGPQQGHSLRAPDSPGQET-TPSNTRGTRSM----- 779 Query: 643 TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464 + +N V E ++ AEFV+C+PGQ VLVDG+G+EIGKGKV+QVQGKW+G + Sbjct: 780 ---SRMNTSENPVAPEFVDYGAAEFVQCKPGQFIVLVDGRGQEIGKGKVHQVQGKWWGKS 836 Query: 463 LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296 LEES CVVDV++LKA+R ++LP+P E+TGT+F++A KLG+MRV+WDSNK+FML+ Sbjct: 837 LEESGTCVVDVVDLKADRWVKLPYPSESTGTSFEDAEKKLGVMRVMWDSNKIFMLR 892 >ref|XP_012444042.1| PREDICTED: uncharacterized protein LOC105768587 [Gossypium raimondii] gi|823222646|ref|XP_012444043.1| PREDICTED: uncharacterized protein LOC105768587 [Gossypium raimondii] gi|763796069|gb|KJB63065.1| hypothetical protein B456_009G451700 [Gossypium raimondii] gi|763796070|gb|KJB63066.1| hypothetical protein B456_009G451700 [Gossypium raimondii] gi|763796072|gb|KJB63068.1| hypothetical protein B456_009G451700 [Gossypium raimondii] Length = 924 Score = 237 bits (604), Expect = 1e-59 Identities = 121/236 (51%), Positives = 162/236 (68%), Gaps = 1/236 (0%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M EQ+T++E+AL+DEP+MQRN ALIQSW+DKLS +G E++ SQL+NW Sbjct: 696 RKRKRTIMNDEQVTIMERALLDEPEMQRNTALIQSWADKLSHHGSEVTCSQLRNWLNNRK 755 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHC-DSPASPVEDAYVPLTARGIHQTGIAES 644 KD R P E DN F KQG H +P SP ++ P RG Sbjct: 756 ARLARLSKDARPPPEPDNAFAGKQGGPQQGHSLRAPDSPGQET-TPSNTRGTRSM----- 809 Query: 643 TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464 + +N V E ++ AEFV+C+PGQ VLVDG+G+EIGKGKV+QVQGKW+G + Sbjct: 810 ---SRMNTSENPVAPEFVDYGAAEFVQCKPGQFIVLVDGRGQEIGKGKVHQVQGKWWGKS 866 Query: 463 LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296 LEES CVVDV++LKA+R ++LP+P E+TGT+F++A KLG+MRV+WDSNK+FML+ Sbjct: 867 LEESGTCVVDVVDLKADRWVKLPYPSESTGTSFEDAEKKLGVMRVMWDSNKIFMLR 922 >ref|XP_010112707.1| hypothetical protein L484_020433 [Morus notabilis] gi|587948407|gb|EXC34665.1| hypothetical protein L484_020433 [Morus notabilis] Length = 965 Score = 235 bits (599), Expect = 5e-59 Identities = 127/236 (53%), Positives = 159/236 (67%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M +Q+ L+E+ALVDEPDMQRNA+LIQ+W+DKLS +G E+++SQLKNW Sbjct: 734 RKRKRTIMNDKQVELMERALVDEPDMQRNASLIQAWADKLSFHGSEVTSSQLKNWLNNRK 793 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAEST 641 KDVR E +N+F +KQG + SP SP EDA V Q T Sbjct: 794 ARLARTGKDVRPTLEAENSFLEKQGGPILRSNYSPESPGEDATVQPNVGRDPQA----MT 849 Query: 640 LRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSNL 461 RT E SE AE P+EFV+CEPGQ V+VD GEEI KGKV+QV GKWYG NL Sbjct: 850 WRTNAAETSEVAPAEAA-FGPSEFVQCEPGQQVVIVDAAGEEIAKGKVFQVHGKWYGKNL 908 Query: 460 EESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQS 293 +E CVVDV +LK +R RLPHP ATG +F+EA TK+G+MRVLWDS+K+F+L+S Sbjct: 909 DELRTCVVDVKDLKVKRGTRLPHPSVATGGSFEEAETKIGVMRVLWDSSKIFVLRS 964 >gb|KHG24791.1| hypothetical protein F383_07105 [Gossypium arboreum] Length = 924 Score = 234 bits (598), Expect = 7e-59 Identities = 120/236 (50%), Positives = 161/236 (68%), Gaps = 1/236 (0%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M EQ+T++E+AL+DEP+MQRN LIQSW+DKLS +G E++ SQL+NW Sbjct: 696 RKRKRTIMNDEQVTIMERALLDEPEMQRNTTLIQSWADKLSHHGSEVTCSQLRNWLNNRK 755 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHC-DSPASPVEDAYVPLTARGIHQTGIAES 644 KD R P E DN F KQG H +P SP ++ P RG Sbjct: 756 ARLARLSKDARPPPEPDNAFAGKQGGPQQGHSLRAPDSPGQET-TPSNTRGTRSM----- 809 Query: 643 TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464 + +N V E ++ AEFV+C+PGQ VLVDG+G+EIGKGKV+QVQGKW+G + Sbjct: 810 ---SRMNTSENPVAPEFVDYGAAEFVQCKPGQFIVLVDGRGQEIGKGKVHQVQGKWWGKS 866 Query: 463 LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296 LEES CVVDV++LKA+R ++LP+P E+TGT+F++A KLG+MRV+WDSNK+FML+ Sbjct: 867 LEESGSCVVDVVDLKADRWVKLPYPSESTGTSFEDAEKKLGVMRVMWDSNKIFMLR 922 >ref|XP_011658033.1| PREDICTED: uncharacterized protein LOC101207456 isoform X1 [Cucumis sativus] Length = 939 Score = 234 bits (597), Expect = 9e-59 Identities = 125/236 (52%), Positives = 153/236 (64%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M +QI++IE+AL+DEP+MQRN A IQ W+D+L G E+++SQLKNW Sbjct: 709 RKRKRTVMNEKQISVIERALLDEPEMQRNPASIQFWADELIRYGSEVASSQLKNWLNNRK 768 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAEST 641 +D RA E DN PDKQG CDSP SP ED +VP T R S Sbjct: 769 ARLARTARDSRATLEADNAIPDKQGGMTAGSCDSPDSPCEDKHVPNTGRD------RRSA 822 Query: 640 LRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSNL 461 RT S+ E + P EFV +PGQ +LVD GEEI KGKV+QV GKWYG NL Sbjct: 823 SRTNTANNSKNSTTEFNDSGPTEFVHFKPGQYVILVDVLGEEIAKGKVHQVHGKWYGRNL 882 Query: 460 EESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQS 293 EE E VVD+ ELKA++ LP+P EATGT+F EA TK+G+MRVLWD NK+FMLQS Sbjct: 883 EELETLVVDIDELKADKNTVLPYPYEATGTSFHEAETKIGVMRVLWDFNKIFMLQS 938 >ref|XP_011658036.1| PREDICTED: uncharacterized protein LOC101207456 isoform X2 [Cucumis sativus] gi|778661408|ref|XP_011658040.1| PREDICTED: uncharacterized protein LOC101207456 isoform X2 [Cucumis sativus] gi|700210602|gb|KGN65698.1| hypothetical protein Csa_1G502860 [Cucumis sativus] Length = 932 Score = 234 bits (597), Expect = 9e-59 Identities = 125/236 (52%), Positives = 153/236 (64%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M +QI++IE+AL+DEP+MQRN A IQ W+D+L G E+++SQLKNW Sbjct: 702 RKRKRTVMNEKQISVIERALLDEPEMQRNPASIQFWADELIRYGSEVASSQLKNWLNNRK 761 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAEST 641 +D RA E DN PDKQG CDSP SP ED +VP T R S Sbjct: 762 ARLARTARDSRATLEADNAIPDKQGGMTAGSCDSPDSPCEDKHVPNTGRD------RRSA 815 Query: 640 LRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSNL 461 RT S+ E + P EFV +PGQ +LVD GEEI KGKV+QV GKWYG NL Sbjct: 816 SRTNTANNSKNSTTEFNDSGPTEFVHFKPGQYVILVDVLGEEIAKGKVHQVHGKWYGRNL 875 Query: 460 EESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQS 293 EE E VVD+ ELKA++ LP+P EATGT+F EA TK+G+MRVLWD NK+FMLQS Sbjct: 876 EELETLVVDIDELKADKNTVLPYPYEATGTSFHEAETKIGVMRVLWDFNKIFMLQS 931 >ref|XP_007020458.1| Sequence-specific DNA binding,sequence-specific DNA binding transcription factors, putative isoform 3 [Theobroma cacao] gi|508720086|gb|EOY11983.1| Sequence-specific DNA binding,sequence-specific DNA binding transcription factors, putative isoform 3 [Theobroma cacao] Length = 874 Score = 234 bits (596), Expect = 1e-58 Identities = 124/236 (52%), Positives = 160/236 (67%), Gaps = 1/236 (0%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M EQ+T+IE+AL+DEP+MQRN A IQSW+DKL +G E++ SQL+NW Sbjct: 646 RKRKRTIMNDEQVTIIERALLDEPEMQRNTASIQSWADKLCHHGSEVTCSQLRNWLNNRK 705 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAH-CDSPASPVEDAYVPLTARGIHQTGIAES 644 KD R P E DN F KQG H +P S E+A P RG S Sbjct: 706 ARLARASKDARPPPEPDNAFAGKQGGPQPGHPFKAPDSSGEEA-APSNTRG------TRS 758 Query: 643 TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464 R +E EA E ++ AEFV+C+PGQ VLVDG+GEEIGKGKV+QVQGKW G + Sbjct: 759 MSRISTSENPEA--PEFVDFGAAEFVQCKPGQFVVLVDGRGEEIGKGKVHQVQGKWCGKS 816 Query: 463 LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296 LEES CVVD ++LKA++ ++LP+P EATGT+F+EA TK G+MRV+WDSNK+F+L+ Sbjct: 817 LEESGTCVVDAVDLKADKWVKLPYPSEATGTSFEEAETKFGVMRVMWDSNKIFLLR 872 >ref|XP_007020457.1| NDX1 homeobox protein, putative isoform 2 [Theobroma cacao] gi|508720085|gb|EOY11982.1| NDX1 homeobox protein, putative isoform 2 [Theobroma cacao] Length = 926 Score = 234 bits (596), Expect = 1e-58 Identities = 124/236 (52%), Positives = 160/236 (67%), Gaps = 1/236 (0%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M EQ+T+IE+AL+DEP+MQRN A IQSW+DKL +G E++ SQL+NW Sbjct: 698 RKRKRTIMNDEQVTIIERALLDEPEMQRNTASIQSWADKLCHHGSEVTCSQLRNWLNNRK 757 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAH-CDSPASPVEDAYVPLTARGIHQTGIAES 644 KD R P E DN F KQG H +P S E+A P RG S Sbjct: 758 ARLARASKDARPPPEPDNAFAGKQGGPQPGHPFKAPDSSGEEA-APSNTRG------TRS 810 Query: 643 TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464 R +E EA E ++ AEFV+C+PGQ VLVDG+GEEIGKGKV+QVQGKW G + Sbjct: 811 MSRISTSENPEA--PEFVDFGAAEFVQCKPGQFVVLVDGRGEEIGKGKVHQVQGKWCGKS 868 Query: 463 LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296 LEES CVVD ++LKA++ ++LP+P EATGT+F+EA TK G+MRV+WDSNK+F+L+ Sbjct: 869 LEESGTCVVDAVDLKADKWVKLPYPSEATGTSFEEAETKFGVMRVMWDSNKIFLLR 924 >ref|XP_007020456.1| Sequence-specific DNA binding,sequence-specific DNA binding transcription factors, putative isoform 1 [Theobroma cacao] gi|508720084|gb|EOY11981.1| Sequence-specific DNA binding,sequence-specific DNA binding transcription factors, putative isoform 1 [Theobroma cacao] Length = 1035 Score = 234 bits (596), Expect = 1e-58 Identities = 124/236 (52%), Positives = 160/236 (67%), Gaps = 1/236 (0%) Frame = -1 Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821 RKRKR++M EQ+T+IE+AL+DEP+MQRN A IQSW+DKL +G E++ SQL+NW Sbjct: 807 RKRKRTIMNDEQVTIIERALLDEPEMQRNTASIQSWADKLCHHGSEVTCSQLRNWLNNRK 866 Query: 820 XXXXXXXKDVRAPSEGDNTFPDKQGESGIAH-CDSPASPVEDAYVPLTARGIHQTGIAES 644 KD R P E DN F KQG H +P S E+A P RG S Sbjct: 867 ARLARASKDARPPPEPDNAFAGKQGGPQPGHPFKAPDSSGEEA-APSNTRG------TRS 919 Query: 643 TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464 R +E EA E ++ AEFV+C+PGQ VLVDG+GEEIGKGKV+QVQGKW G + Sbjct: 920 MSRISTSENPEA--PEFVDFGAAEFVQCKPGQFVVLVDGRGEEIGKGKVHQVQGKWCGKS 977 Query: 463 LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296 LEES CVVD ++LKA++ ++LP+P EATGT+F+EA TK G+MRV+WDSNK+F+L+ Sbjct: 978 LEESGTCVVDAVDLKADKWVKLPYPSEATGTSFEEAETKFGVMRVMWDSNKIFLLR 1033