BLASTX nr result

ID: Cornus23_contig00010558 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cornus23_contig00010558
         (1048 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010652083.1| PREDICTED: uncharacterized protein LOC100259...   292   4e-76
emb|CAN67843.1| hypothetical protein VITISV_016666 [Vitis vinifera]   270   1e-69
ref|XP_012092343.1| PREDICTED: uncharacterized protein LOC105650...   258   7e-66
ref|XP_012092341.1| PREDICTED: uncharacterized protein LOC105650...   258   7e-66
ref|XP_011033644.1| PREDICTED: uncharacterized protein LOC105132...   254   6e-65
ref|XP_011033643.1| PREDICTED: uncharacterized protein LOC105132...   254   6e-65
ref|XP_002520708.1| conserved hypothetical protein [Ricinus comm...   241   5e-61
ref|XP_010244631.1| PREDICTED: uncharacterized protein LOC104588...   241   9e-61
ref|XP_010244630.1| PREDICTED: uncharacterized protein LOC104588...   241   9e-61
gb|KJB63070.1| hypothetical protein B456_009G451700, partial [Go...   237   1e-59
gb|KJB63069.1| hypothetical protein B456_009G451700 [Gossypium r...   237   1e-59
gb|KJB63067.1| hypothetical protein B456_009G451700 [Gossypium r...   237   1e-59
ref|XP_012444042.1| PREDICTED: uncharacterized protein LOC105768...   237   1e-59
ref|XP_010112707.1| hypothetical protein L484_020433 [Morus nota...   235   5e-59
gb|KHG24791.1| hypothetical protein F383_07105 [Gossypium arboreum]   234   7e-59
ref|XP_011658033.1| PREDICTED: uncharacterized protein LOC101207...   234   9e-59
ref|XP_011658036.1| PREDICTED: uncharacterized protein LOC101207...   234   9e-59
ref|XP_007020458.1| Sequence-specific DNA binding,sequence-speci...   234   1e-58
ref|XP_007020457.1| NDX1 homeobox protein, putative isoform 2 [T...   234   1e-58
ref|XP_007020456.1| Sequence-specific DNA binding,sequence-speci...   234   1e-58

>ref|XP_010652083.1| PREDICTED: uncharacterized protein LOC100259581 [Vitis vinifera]
          Length = 950

 Score =  292 bits (747), Expect = 4e-76
 Identities = 153/238 (64%), Positives = 179/238 (75%), Gaps = 2/238 (0%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M   Q+TLIEKALVDEPDMQRNAALIQSW+DKLS +GPE++ SQLKNW     
Sbjct: 713  RKRKRTIMNDTQMTLIEKALVDEPDMQRNAALIQSWADKLSFHGPELTASQLKNWLNNRK 772

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHC-DSPASPVEDAYVPLTARG-IHQTGIAE 647
                   KDVR  SE D+TFPDKQ  SG+    DSP SP ED + P TARG  HQ+ I  
Sbjct: 773  ARLARAAKDVRVASEVDSTFPDKQVGSGVGSLHDSPESPGEDFFAPSTARGGTHQSAIGG 832

Query: 646  STLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGS 467
            S  R    + +EA  AE ++I PAEFVR EPGQ  VL+DG+G++IGKGKV+QVQGKWYG 
Sbjct: 833  SVSRAGA-DNAEAATAEFVDINPAEFVRREPGQYVVLLDGQGDDIGKGKVHQVQGKWYGK 891

Query: 466  NLEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQS 293
            NLEES+ CVVDV+ELKAER  RLPHP E TGT+FDEA TKLG+MRV WDSNKL +L+S
Sbjct: 892  NLEESQTCVVDVMELKAERWSRLPHPSETTGTSFDEAETKLGVMRVSWDSNKLCILRS 949


>emb|CAN67843.1| hypothetical protein VITISV_016666 [Vitis vinifera]
          Length = 1134

 Score =  270 bits (690), Expect = 1e-69
 Identities = 142/222 (63%), Positives = 165/222 (74%), Gaps = 2/222 (0%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M   Q+TLIEKALVDEPDMQRNAALIQSW+DKLS +GPE++ SQLKNW     
Sbjct: 818  RKRKRTIMNDTQMTLIEKALVDEPDMQRNAALIQSWADKLSFHGPELTASQLKNWLNNRK 877

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHC-DSPASPVEDAYVPLTARG-IHQTGIAE 647
                   KDVR  SE D+TFPDKQ  SG+    DSP SP ED + P TARG  HQ+ I  
Sbjct: 878  ARLARAAKDVRVASEVDSTFPDKQVGSGVGSLHDSPESPGEDFFAPSTARGGTHQSAIGG 937

Query: 646  STLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGS 467
            S  R    + +EA  AE ++I PAEFVR EPGQ  VL+DG+G++IGKGKV+QVQGKWYG 
Sbjct: 938  SVSRAGA-DNAEAATAEFVDINPAEFVRREPGQYVVLLDGQGDDIGKGKVHQVQGKWYGK 996

Query: 466  NLEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLG 341
            NLEES+ CVVDV+ELKAER  RLPHP E TGT+FDEA TKLG
Sbjct: 997  NLEESQTCVVDVMELKAERWSRLPHPSETTGTSFDEAETKLG 1038


>ref|XP_012092343.1| PREDICTED: uncharacterized protein LOC105650070 isoform X2 [Jatropha
            curcas]
          Length = 949

 Score =  258 bits (658), Expect = 7e-66
 Identities = 141/235 (60%), Positives = 167/235 (71%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M   Q++LIEKALVDEPDMQRN+A IQ W+DKLS++G E++ SQLKNW     
Sbjct: 722  RKRKRTIMNDYQMSLIEKALVDEPDMQRNSASIQRWADKLSIHGSEVTFSQLKNWLNNRK 781

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAEST 641
                   KDVRAP E D+    KQG S  +H DSP S  ED   P  AR      +  ST
Sbjct: 782  ARLARAGKDVRAPVEFDSAHSVKQGMSTHSH-DSPESRGEDN-APSGAR------LVPST 833

Query: 640  LRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSNL 461
             R   +E +E  LAE + I  AEFV+C+PGQ  VLVD +GEEIGK KVYQVQGKWYG NL
Sbjct: 834  SRIGTSENAETSLAEFVGIGAAEFVQCKPGQYVVLVDKQGEEIGKAKVYQVQGKWYGKNL 893

Query: 460  EESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296
            EESE CVVDV ELKA+R +RLP+P EATGT+F EA TKLG+MRVLWDSNK+FM +
Sbjct: 894  EESETCVVDVTELKADRWVRLPYPSEATGTSFSEAETKLGVMRVLWDSNKIFMFR 948


>ref|XP_012092341.1| PREDICTED: uncharacterized protein LOC105650070 isoform X1 [Jatropha
            curcas] gi|802794853|ref|XP_012092342.1| PREDICTED:
            uncharacterized protein LOC105650070 isoform X1 [Jatropha
            curcas] gi|643704475|gb|KDP21539.1| hypothetical protein
            JCGZ_22010 [Jatropha curcas]
          Length = 952

 Score =  258 bits (658), Expect = 7e-66
 Identities = 141/235 (60%), Positives = 167/235 (71%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M   Q++LIEKALVDEPDMQRN+A IQ W+DKLS++G E++ SQLKNW     
Sbjct: 725  RKRKRTIMNDYQMSLIEKALVDEPDMQRNSASIQRWADKLSIHGSEVTFSQLKNWLNNRK 784

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAEST 641
                   KDVRAP E D+    KQG S  +H DSP S  ED   P  AR      +  ST
Sbjct: 785  ARLARAGKDVRAPVEFDSAHSVKQGMSTHSH-DSPESRGEDN-APSGAR------LVPST 836

Query: 640  LRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSNL 461
             R   +E +E  LAE + I  AEFV+C+PGQ  VLVD +GEEIGK KVYQVQGKWYG NL
Sbjct: 837  SRIGTSENAETSLAEFVGIGAAEFVQCKPGQYVVLVDKQGEEIGKAKVYQVQGKWYGKNL 896

Query: 460  EESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296
            EESE CVVDV ELKA+R +RLP+P EATGT+F EA TKLG+MRVLWDSNK+FM +
Sbjct: 897  EESETCVVDVTELKADRWVRLPYPSEATGTSFSEAETKLGVMRVLWDSNKIFMFR 951


>ref|XP_011033644.1| PREDICTED: uncharacterized protein LOC105132061 isoform X2 [Populus
            euphratica]
          Length = 807

 Score =  254 bits (650), Expect = 6e-65
 Identities = 133/233 (57%), Positives = 169/233 (72%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M   QITL+EKAL+DEP+MQRNAA +QSW+DKLS+NG E++ SQLKNW     
Sbjct: 576  RKRKRTIMNDYQITLMEKALLDEPEMQRNAAALQSWADKLSLNGSEVTPSQLKNWLNNRK 635

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAEST 641
                   KDVRAP E DNTFP+KQ    +   D+P SP ED    L+A+G+  T    S 
Sbjct: 636  ARLARAGKDVRAPMEVDNTFPEKQ-VGQVQRQDTPESPSEDN-TTLSAKGLQNT----SE 689

Query: 640  LRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSNL 461
            +    + ++   LA+ ++I  +EFV+C+PGQ  VLVDG+GEEIGKGKVYQVQGKWYG  L
Sbjct: 690  IGVFGDPEAGIGLADFVDIGASEFVQCKPGQFVVLVDGQGEEIGKGKVYQVQGKWYGRIL 749

Query: 460  EESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFM 302
            EESE+CVVDV ELK E+ +RLP+P E TG +F EA  K+G+MRVLWDSNK++M
Sbjct: 750  EESEMCVVDVTELKTEKWVRLPYPSETTGMSFYEAEQKIGVMRVLWDSNKIYM 802


>ref|XP_011033643.1| PREDICTED: uncharacterized protein LOC105132061 isoform X1 [Populus
            euphratica]
          Length = 955

 Score =  254 bits (650), Expect = 6e-65
 Identities = 133/233 (57%), Positives = 169/233 (72%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M   QITL+EKAL+DEP+MQRNAA +QSW+DKLS+NG E++ SQLKNW     
Sbjct: 724  RKRKRTIMNDYQITLMEKALLDEPEMQRNAAALQSWADKLSLNGSEVTPSQLKNWLNNRK 783

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAEST 641
                   KDVRAP E DNTFP+KQ    +   D+P SP ED    L+A+G+  T    S 
Sbjct: 784  ARLARAGKDVRAPMEVDNTFPEKQ-VGQVQRQDTPESPSEDN-TTLSAKGLQNT----SE 837

Query: 640  LRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSNL 461
            +    + ++   LA+ ++I  +EFV+C+PGQ  VLVDG+GEEIGKGKVYQVQGKWYG  L
Sbjct: 838  IGVFGDPEAGIGLADFVDIGASEFVQCKPGQFVVLVDGQGEEIGKGKVYQVQGKWYGRIL 897

Query: 460  EESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFM 302
            EESE+CVVDV ELK E+ +RLP+P E TG +F EA  K+G+MRVLWDSNK++M
Sbjct: 898  EESEMCVVDVTELKTEKWVRLPYPSETTGMSFYEAEQKIGVMRVLWDSNKIYM 950


>ref|XP_002520708.1| conserved hypothetical protein [Ricinus communis]
            gi|223540093|gb|EEF41670.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 957

 Score =  241 bits (616), Expect = 5e-61
 Identities = 129/235 (54%), Positives = 158/235 (67%), Gaps = 2/235 (0%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M   Q++LIE+ALVDEPDM RNAA +QSW+DKLS++G E+++SQLKNW     
Sbjct: 727  RKRKRTIMNEYQMSLIEEALVDEPDMHRNAASLQSWADKLSLHGSEVTSSQLKNWLNNRK 786

Query: 820  XXXXXXXK--DVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAE 647
                      DVR P E D+   +KQ    + H    +    +  VP  AR         
Sbjct: 787  ARLARAGAGKDVRTPMEVDHALSEKQSVPALRHSHDSSESHGEVNVPAGAR--------L 838

Query: 646  STLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGS 467
            ST R    E +E  LA+   I  AE V+C+PGQ  VLVD +G+EIGKGKVYQVQGKWYG 
Sbjct: 839  STARIGSAENAEISLAQFFGIDAAELVQCKPGQYVVLVDKQGDEIGKGKVYQVQGKWYGK 898

Query: 466  NLEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFM 302
            +LEESE CVVDV ELKAER +RLP+P EATGT+F EA TKLG+MRVLWDSNK+FM
Sbjct: 899  SLEESETCVVDVTELKAERWVRLPYPSEATGTSFSEAETKLGVMRVLWDSNKIFM 953


>ref|XP_010244631.1| PREDICTED: uncharacterized protein LOC104588414 isoform X2 [Nelumbo
            nucifera]
          Length = 916

 Score =  241 bits (614), Expect = 9e-61
 Identities = 129/250 (51%), Positives = 165/250 (66%), Gaps = 15/250 (6%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M   QITLIE+AL+DEP+MQRNA L+QSW+DKLSV+G E+++SQLKNW     
Sbjct: 666  RKRKRNIMNDTQITLIERALLDEPEMQRNATLLQSWADKLSVHGSELTSSQLKNWLNNRK 725

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAH-CDSPASPVEDAYVP--LTARGIHQT--G 656
                   ++ RAPSEGDNTFPDKQG SG A   DSP SP ED YVP   T  G +Q+   
Sbjct: 726  ARLARAAREARAPSEGDNTFPDKQGGSGQAQFYDSPESPSEDFYVPPSTTRAGSNQSTPK 785

Query: 655  IAESTLRTVVNEKSEAVLAELIEITPAE----------FVRCEPGQCAVLVDGKGEEIGK 506
                TLRT   E SE    + ++    +          + + EPGQ   L+DG+G+E+G+
Sbjct: 786  FGGVTLRTGSGEASEMTPTDFVDFAAKQSMQMDCSSLGYAQYEPGQYVSLIDGEGKEVGR 845

Query: 505  GKVYQVQGKWYGSNLEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVL 326
            G VYQV+G+W+G +L E+  C+VDV ELK ER  RL HP EA GTTFDEA +K G+MRV 
Sbjct: 846  GNVYQVEGRWHGKSLAEAGTCIVDVHELKVERGTRLQHPVEAAGTTFDEAESKNGVMRVA 905

Query: 325  WDSNKLFMLQ 296
            WD NK+  L+
Sbjct: 906  WDVNKILPLR 915


>ref|XP_010244630.1| PREDICTED: uncharacterized protein LOC104588414 isoform X1 [Nelumbo
            nucifera]
          Length = 991

 Score =  241 bits (614), Expect = 9e-61
 Identities = 129/250 (51%), Positives = 165/250 (66%), Gaps = 15/250 (6%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M   QITLIE+AL+DEP+MQRNA L+QSW+DKLSV+G E+++SQLKNW     
Sbjct: 741  RKRKRNIMNDTQITLIERALLDEPEMQRNATLLQSWADKLSVHGSELTSSQLKNWLNNRK 800

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAH-CDSPASPVEDAYVP--LTARGIHQT--G 656
                   ++ RAPSEGDNTFPDKQG SG A   DSP SP ED YVP   T  G +Q+   
Sbjct: 801  ARLARAAREARAPSEGDNTFPDKQGGSGQAQFYDSPESPSEDFYVPPSTTRAGSNQSTPK 860

Query: 655  IAESTLRTVVNEKSEAVLAELIEITPAE----------FVRCEPGQCAVLVDGKGEEIGK 506
                TLRT   E SE    + ++    +          + + EPGQ   L+DG+G+E+G+
Sbjct: 861  FGGVTLRTGSGEASEMTPTDFVDFAAKQSMQMDCSSLGYAQYEPGQYVSLIDGEGKEVGR 920

Query: 505  GKVYQVQGKWYGSNLEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVL 326
            G VYQV+G+W+G +L E+  C+VDV ELK ER  RL HP EA GTTFDEA +K G+MRV 
Sbjct: 921  GNVYQVEGRWHGKSLAEAGTCIVDVHELKVERGTRLQHPVEAAGTTFDEAESKNGVMRVA 980

Query: 325  WDSNKLFMLQ 296
            WD NK+  L+
Sbjct: 981  WDVNKILPLR 990


>gb|KJB63070.1| hypothetical protein B456_009G451700, partial [Gossypium raimondii]
          Length = 913

 Score =  237 bits (604), Expect = 1e-59
 Identities = 121/236 (51%), Positives = 162/236 (68%), Gaps = 1/236 (0%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M  EQ+T++E+AL+DEP+MQRN ALIQSW+DKLS +G E++ SQL+NW     
Sbjct: 685  RKRKRTIMNDEQVTIMERALLDEPEMQRNTALIQSWADKLSHHGSEVTCSQLRNWLNNRK 744

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHC-DSPASPVEDAYVPLTARGIHQTGIAES 644
                   KD R P E DN F  KQG     H   +P SP ++   P   RG         
Sbjct: 745  ARLARLSKDARPPPEPDNAFAGKQGGPQQGHSLRAPDSPGQET-TPSNTRGTRSM----- 798

Query: 643  TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464
               + +N     V  E ++   AEFV+C+PGQ  VLVDG+G+EIGKGKV+QVQGKW+G +
Sbjct: 799  ---SRMNTSENPVAPEFVDYGAAEFVQCKPGQFIVLVDGRGQEIGKGKVHQVQGKWWGKS 855

Query: 463  LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296
            LEES  CVVDV++LKA+R ++LP+P E+TGT+F++A  KLG+MRV+WDSNK+FML+
Sbjct: 856  LEESGTCVVDVVDLKADRWVKLPYPSESTGTSFEDAEKKLGVMRVMWDSNKIFMLR 911


>gb|KJB63069.1| hypothetical protein B456_009G451700 [Gossypium raimondii]
          Length = 750

 Score =  237 bits (604), Expect = 1e-59
 Identities = 121/236 (51%), Positives = 162/236 (68%), Gaps = 1/236 (0%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M  EQ+T++E+AL+DEP+MQRN ALIQSW+DKLS +G E++ SQL+NW     
Sbjct: 522  RKRKRTIMNDEQVTIMERALLDEPEMQRNTALIQSWADKLSHHGSEVTCSQLRNWLNNRK 581

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHC-DSPASPVEDAYVPLTARGIHQTGIAES 644
                   KD R P E DN F  KQG     H   +P SP ++   P   RG         
Sbjct: 582  ARLARLSKDARPPPEPDNAFAGKQGGPQQGHSLRAPDSPGQET-TPSNTRGTRSM----- 635

Query: 643  TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464
               + +N     V  E ++   AEFV+C+PGQ  VLVDG+G+EIGKGKV+QVQGKW+G +
Sbjct: 636  ---SRMNTSENPVAPEFVDYGAAEFVQCKPGQFIVLVDGRGQEIGKGKVHQVQGKWWGKS 692

Query: 463  LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296
            LEES  CVVDV++LKA+R ++LP+P E+TGT+F++A  KLG+MRV+WDSNK+FML+
Sbjct: 693  LEESGTCVVDVVDLKADRWVKLPYPSESTGTSFEDAEKKLGVMRVMWDSNKIFMLR 748


>gb|KJB63067.1| hypothetical protein B456_009G451700 [Gossypium raimondii]
          Length = 894

 Score =  237 bits (604), Expect = 1e-59
 Identities = 121/236 (51%), Positives = 162/236 (68%), Gaps = 1/236 (0%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M  EQ+T++E+AL+DEP+MQRN ALIQSW+DKLS +G E++ SQL+NW     
Sbjct: 666  RKRKRTIMNDEQVTIMERALLDEPEMQRNTALIQSWADKLSHHGSEVTCSQLRNWLNNRK 725

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHC-DSPASPVEDAYVPLTARGIHQTGIAES 644
                   KD R P E DN F  KQG     H   +P SP ++   P   RG         
Sbjct: 726  ARLARLSKDARPPPEPDNAFAGKQGGPQQGHSLRAPDSPGQET-TPSNTRGTRSM----- 779

Query: 643  TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464
               + +N     V  E ++   AEFV+C+PGQ  VLVDG+G+EIGKGKV+QVQGKW+G +
Sbjct: 780  ---SRMNTSENPVAPEFVDYGAAEFVQCKPGQFIVLVDGRGQEIGKGKVHQVQGKWWGKS 836

Query: 463  LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296
            LEES  CVVDV++LKA+R ++LP+P E+TGT+F++A  KLG+MRV+WDSNK+FML+
Sbjct: 837  LEESGTCVVDVVDLKADRWVKLPYPSESTGTSFEDAEKKLGVMRVMWDSNKIFMLR 892


>ref|XP_012444042.1| PREDICTED: uncharacterized protein LOC105768587 [Gossypium raimondii]
            gi|823222646|ref|XP_012444043.1| PREDICTED:
            uncharacterized protein LOC105768587 [Gossypium
            raimondii] gi|763796069|gb|KJB63065.1| hypothetical
            protein B456_009G451700 [Gossypium raimondii]
            gi|763796070|gb|KJB63066.1| hypothetical protein
            B456_009G451700 [Gossypium raimondii]
            gi|763796072|gb|KJB63068.1| hypothetical protein
            B456_009G451700 [Gossypium raimondii]
          Length = 924

 Score =  237 bits (604), Expect = 1e-59
 Identities = 121/236 (51%), Positives = 162/236 (68%), Gaps = 1/236 (0%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M  EQ+T++E+AL+DEP+MQRN ALIQSW+DKLS +G E++ SQL+NW     
Sbjct: 696  RKRKRTIMNDEQVTIMERALLDEPEMQRNTALIQSWADKLSHHGSEVTCSQLRNWLNNRK 755

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHC-DSPASPVEDAYVPLTARGIHQTGIAES 644
                   KD R P E DN F  KQG     H   +P SP ++   P   RG         
Sbjct: 756  ARLARLSKDARPPPEPDNAFAGKQGGPQQGHSLRAPDSPGQET-TPSNTRGTRSM----- 809

Query: 643  TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464
               + +N     V  E ++   AEFV+C+PGQ  VLVDG+G+EIGKGKV+QVQGKW+G +
Sbjct: 810  ---SRMNTSENPVAPEFVDYGAAEFVQCKPGQFIVLVDGRGQEIGKGKVHQVQGKWWGKS 866

Query: 463  LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296
            LEES  CVVDV++LKA+R ++LP+P E+TGT+F++A  KLG+MRV+WDSNK+FML+
Sbjct: 867  LEESGTCVVDVVDLKADRWVKLPYPSESTGTSFEDAEKKLGVMRVMWDSNKIFMLR 922


>ref|XP_010112707.1| hypothetical protein L484_020433 [Morus notabilis]
            gi|587948407|gb|EXC34665.1| hypothetical protein
            L484_020433 [Morus notabilis]
          Length = 965

 Score =  235 bits (599), Expect = 5e-59
 Identities = 127/236 (53%), Positives = 159/236 (67%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M  +Q+ L+E+ALVDEPDMQRNA+LIQ+W+DKLS +G E+++SQLKNW     
Sbjct: 734  RKRKRTIMNDKQVELMERALVDEPDMQRNASLIQAWADKLSFHGSEVTSSQLKNWLNNRK 793

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAEST 641
                   KDVR   E +N+F +KQG   +    SP SP EDA V        Q      T
Sbjct: 794  ARLARTGKDVRPTLEAENSFLEKQGGPILRSNYSPESPGEDATVQPNVGRDPQA----MT 849

Query: 640  LRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSNL 461
             RT   E SE   AE     P+EFV+CEPGQ  V+VD  GEEI KGKV+QV GKWYG NL
Sbjct: 850  WRTNAAETSEVAPAEAA-FGPSEFVQCEPGQQVVIVDAAGEEIAKGKVFQVHGKWYGKNL 908

Query: 460  EESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQS 293
            +E   CVVDV +LK +R  RLPHP  ATG +F+EA TK+G+MRVLWDS+K+F+L+S
Sbjct: 909  DELRTCVVDVKDLKVKRGTRLPHPSVATGGSFEEAETKIGVMRVLWDSSKIFVLRS 964


>gb|KHG24791.1| hypothetical protein F383_07105 [Gossypium arboreum]
          Length = 924

 Score =  234 bits (598), Expect = 7e-59
 Identities = 120/236 (50%), Positives = 161/236 (68%), Gaps = 1/236 (0%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M  EQ+T++E+AL+DEP+MQRN  LIQSW+DKLS +G E++ SQL+NW     
Sbjct: 696  RKRKRTIMNDEQVTIMERALLDEPEMQRNTTLIQSWADKLSHHGSEVTCSQLRNWLNNRK 755

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHC-DSPASPVEDAYVPLTARGIHQTGIAES 644
                   KD R P E DN F  KQG     H   +P SP ++   P   RG         
Sbjct: 756  ARLARLSKDARPPPEPDNAFAGKQGGPQQGHSLRAPDSPGQET-TPSNTRGTRSM----- 809

Query: 643  TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464
               + +N     V  E ++   AEFV+C+PGQ  VLVDG+G+EIGKGKV+QVQGKW+G +
Sbjct: 810  ---SRMNTSENPVAPEFVDYGAAEFVQCKPGQFIVLVDGRGQEIGKGKVHQVQGKWWGKS 866

Query: 463  LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296
            LEES  CVVDV++LKA+R ++LP+P E+TGT+F++A  KLG+MRV+WDSNK+FML+
Sbjct: 867  LEESGSCVVDVVDLKADRWVKLPYPSESTGTSFEDAEKKLGVMRVMWDSNKIFMLR 922


>ref|XP_011658033.1| PREDICTED: uncharacterized protein LOC101207456 isoform X1 [Cucumis
            sativus]
          Length = 939

 Score =  234 bits (597), Expect = 9e-59
 Identities = 125/236 (52%), Positives = 153/236 (64%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M  +QI++IE+AL+DEP+MQRN A IQ W+D+L   G E+++SQLKNW     
Sbjct: 709  RKRKRTVMNEKQISVIERALLDEPEMQRNPASIQFWADELIRYGSEVASSQLKNWLNNRK 768

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAEST 641
                   +D RA  E DN  PDKQG      CDSP SP ED +VP T R         S 
Sbjct: 769  ARLARTARDSRATLEADNAIPDKQGGMTAGSCDSPDSPCEDKHVPNTGRD------RRSA 822

Query: 640  LRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSNL 461
             RT     S+    E  +  P EFV  +PGQ  +LVD  GEEI KGKV+QV GKWYG NL
Sbjct: 823  SRTNTANNSKNSTTEFNDSGPTEFVHFKPGQYVILVDVLGEEIAKGKVHQVHGKWYGRNL 882

Query: 460  EESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQS 293
            EE E  VVD+ ELKA++   LP+P EATGT+F EA TK+G+MRVLWD NK+FMLQS
Sbjct: 883  EELETLVVDIDELKADKNTVLPYPYEATGTSFHEAETKIGVMRVLWDFNKIFMLQS 938


>ref|XP_011658036.1| PREDICTED: uncharacterized protein LOC101207456 isoform X2 [Cucumis
            sativus] gi|778661408|ref|XP_011658040.1| PREDICTED:
            uncharacterized protein LOC101207456 isoform X2 [Cucumis
            sativus] gi|700210602|gb|KGN65698.1| hypothetical protein
            Csa_1G502860 [Cucumis sativus]
          Length = 932

 Score =  234 bits (597), Expect = 9e-59
 Identities = 125/236 (52%), Positives = 153/236 (64%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M  +QI++IE+AL+DEP+MQRN A IQ W+D+L   G E+++SQLKNW     
Sbjct: 702  RKRKRTVMNEKQISVIERALLDEPEMQRNPASIQFWADELIRYGSEVASSQLKNWLNNRK 761

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAHCDSPASPVEDAYVPLTARGIHQTGIAEST 641
                   +D RA  E DN  PDKQG      CDSP SP ED +VP T R         S 
Sbjct: 762  ARLARTARDSRATLEADNAIPDKQGGMTAGSCDSPDSPCEDKHVPNTGRD------RRSA 815

Query: 640  LRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSNL 461
             RT     S+    E  +  P EFV  +PGQ  +LVD  GEEI KGKV+QV GKWYG NL
Sbjct: 816  SRTNTANNSKNSTTEFNDSGPTEFVHFKPGQYVILVDVLGEEIAKGKVHQVHGKWYGRNL 875

Query: 460  EESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQS 293
            EE E  VVD+ ELKA++   LP+P EATGT+F EA TK+G+MRVLWD NK+FMLQS
Sbjct: 876  EELETLVVDIDELKADKNTVLPYPYEATGTSFHEAETKIGVMRVLWDFNKIFMLQS 931


>ref|XP_007020458.1| Sequence-specific DNA binding,sequence-specific DNA binding
            transcription factors, putative isoform 3 [Theobroma
            cacao] gi|508720086|gb|EOY11983.1| Sequence-specific DNA
            binding,sequence-specific DNA binding transcription
            factors, putative isoform 3 [Theobroma cacao]
          Length = 874

 Score =  234 bits (596), Expect = 1e-58
 Identities = 124/236 (52%), Positives = 160/236 (67%), Gaps = 1/236 (0%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M  EQ+T+IE+AL+DEP+MQRN A IQSW+DKL  +G E++ SQL+NW     
Sbjct: 646  RKRKRTIMNDEQVTIIERALLDEPEMQRNTASIQSWADKLCHHGSEVTCSQLRNWLNNRK 705

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAH-CDSPASPVEDAYVPLTARGIHQTGIAES 644
                   KD R P E DN F  KQG     H   +P S  E+A  P   RG        S
Sbjct: 706  ARLARASKDARPPPEPDNAFAGKQGGPQPGHPFKAPDSSGEEA-APSNTRG------TRS 758

Query: 643  TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464
              R   +E  EA   E ++   AEFV+C+PGQ  VLVDG+GEEIGKGKV+QVQGKW G +
Sbjct: 759  MSRISTSENPEA--PEFVDFGAAEFVQCKPGQFVVLVDGRGEEIGKGKVHQVQGKWCGKS 816

Query: 463  LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296
            LEES  CVVD ++LKA++ ++LP+P EATGT+F+EA TK G+MRV+WDSNK+F+L+
Sbjct: 817  LEESGTCVVDAVDLKADKWVKLPYPSEATGTSFEEAETKFGVMRVMWDSNKIFLLR 872


>ref|XP_007020457.1| NDX1 homeobox protein, putative isoform 2 [Theobroma cacao]
            gi|508720085|gb|EOY11982.1| NDX1 homeobox protein,
            putative isoform 2 [Theobroma cacao]
          Length = 926

 Score =  234 bits (596), Expect = 1e-58
 Identities = 124/236 (52%), Positives = 160/236 (67%), Gaps = 1/236 (0%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M  EQ+T+IE+AL+DEP+MQRN A IQSW+DKL  +G E++ SQL+NW     
Sbjct: 698  RKRKRTIMNDEQVTIIERALLDEPEMQRNTASIQSWADKLCHHGSEVTCSQLRNWLNNRK 757

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAH-CDSPASPVEDAYVPLTARGIHQTGIAES 644
                   KD R P E DN F  KQG     H   +P S  E+A  P   RG        S
Sbjct: 758  ARLARASKDARPPPEPDNAFAGKQGGPQPGHPFKAPDSSGEEA-APSNTRG------TRS 810

Query: 643  TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464
              R   +E  EA   E ++   AEFV+C+PGQ  VLVDG+GEEIGKGKV+QVQGKW G +
Sbjct: 811  MSRISTSENPEA--PEFVDFGAAEFVQCKPGQFVVLVDGRGEEIGKGKVHQVQGKWCGKS 868

Query: 463  LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296
            LEES  CVVD ++LKA++ ++LP+P EATGT+F+EA TK G+MRV+WDSNK+F+L+
Sbjct: 869  LEESGTCVVDAVDLKADKWVKLPYPSEATGTSFEEAETKFGVMRVMWDSNKIFLLR 924


>ref|XP_007020456.1| Sequence-specific DNA binding,sequence-specific DNA binding
            transcription factors, putative isoform 1 [Theobroma
            cacao] gi|508720084|gb|EOY11981.1| Sequence-specific DNA
            binding,sequence-specific DNA binding transcription
            factors, putative isoform 1 [Theobroma cacao]
          Length = 1035

 Score =  234 bits (596), Expect = 1e-58
 Identities = 124/236 (52%), Positives = 160/236 (67%), Gaps = 1/236 (0%)
 Frame = -1

Query: 1000 RKRKRSLMTAEQITLIEKALVDEPDMQRNAALIQSWSDKLSVNGPEISTSQLKNWXXXXX 821
            RKRKR++M  EQ+T+IE+AL+DEP+MQRN A IQSW+DKL  +G E++ SQL+NW     
Sbjct: 807  RKRKRTIMNDEQVTIIERALLDEPEMQRNTASIQSWADKLCHHGSEVTCSQLRNWLNNRK 866

Query: 820  XXXXXXXKDVRAPSEGDNTFPDKQGESGIAH-CDSPASPVEDAYVPLTARGIHQTGIAES 644
                   KD R P E DN F  KQG     H   +P S  E+A  P   RG        S
Sbjct: 867  ARLARASKDARPPPEPDNAFAGKQGGPQPGHPFKAPDSSGEEA-APSNTRG------TRS 919

Query: 643  TLRTVVNEKSEAVLAELIEITPAEFVRCEPGQCAVLVDGKGEEIGKGKVYQVQGKWYGSN 464
              R   +E  EA   E ++   AEFV+C+PGQ  VLVDG+GEEIGKGKV+QVQGKW G +
Sbjct: 920  MSRISTSENPEA--PEFVDFGAAEFVQCKPGQFVVLVDGRGEEIGKGKVHQVQGKWCGKS 977

Query: 463  LEESEVCVVDVLELKAERCMRLPHPCEATGTTFDEANTKLGLMRVLWDSNKLFMLQ 296
            LEES  CVVD ++LKA++ ++LP+P EATGT+F+EA TK G+MRV+WDSNK+F+L+
Sbjct: 978  LEESGTCVVDAVDLKADKWVKLPYPSEATGTSFEEAETKFGVMRVMWDSNKIFLLR 1033


Top