BLASTX nr result

ID: Zanthoxylum22_contig00007864 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zanthoxylum22_contig00007864
         (1581 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006479943.1| PREDICTED: uncharacterized protein LOC102624...   647   0.0  
ref|XP_006444311.1| hypothetical protein CICLE_v10020382mg [Citr...   640   0.0  
ref|XP_002523069.1| DNA binding protein, putative [Ricinus commu...   548   e-153
ref|XP_012082753.1| PREDICTED: uncharacterized protein LOC105642...   534   e-148
ref|XP_010652188.1| PREDICTED: uncharacterized protein LOC100267...   489   e-135
ref|XP_002268579.1| PREDICTED: uncharacterized protein LOC100267...   489   e-135
ref|XP_007050904.1| DNA binding protein, putative isoform 1 [The...   484   e-133
ref|XP_007050906.1| DNA binding protein, putative isoform 3 [The...   479   e-132
ref|XP_012474959.1| PREDICTED: uncharacterized protein LOC105791...   453   e-124
gb|KJB24366.1| hypothetical protein B456_004G142300 [Gossypium r...   448   e-123
ref|XP_012474958.1| PREDICTED: uncharacterized protein LOC105791...   448   e-123
ref|XP_010259580.1| PREDICTED: uncharacterized protein LOC104598...   425   e-116
ref|XP_010269553.1| PREDICTED: uncharacterized protein LOC104606...   418   e-114
ref|XP_010652189.1| PREDICTED: uncharacterized protein LOC100267...   417   e-113
gb|KJB24368.1| hypothetical protein B456_004G142300 [Gossypium r...   410   e-111
ref|XP_012474960.1| PREDICTED: uncharacterized protein LOC105791...   405   e-110
ref|XP_007031010.1| RING/U-box superfamily protein isoform 1 [Th...   390   e-105
ref|XP_004302386.1| PREDICTED: uncharacterized protein LOC101307...   386   e-104
ref|XP_008370422.1| PREDICTED: uncharacterized protein LOC103433...   385   e-104
ref|XP_007205224.1| hypothetical protein PRUPE_ppa005981mg [Prun...   385   e-104

>ref|XP_006479943.1| PREDICTED: uncharacterized protein LOC102624227 [Citrus sinensis]
          Length = 412

 Score =  647 bits (1668), Expect = 0.0
 Identities = 315/421 (74%), Positives = 336/421 (79%), Gaps = 1/421 (0%)
 Frame = -1

Query: 1500 IRTIVGSKATDHNPHQMXXXXXXXXXXDPPSTGAACSICLDLISDNGVRSRAKLQCGHEY 1321
            +RT+VGSKATDH PHQ             PS+G ACSICLDL+S+NG+RSRAKLQCGHE+
Sbjct: 1    MRTMVGSKATDHGPHQKDGHDDDDIE---PSSGLACSICLDLVSENGIRSRAKLQCGHEF 57

Query: 1320 HLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGSTSSLP-LSMEDWIPDEDLYDLSYSEM 1144
            HLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGST SLP LSMEDWIPDED YDLSYSEM
Sbjct: 58   HLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGSTRSLPELSMEDWIPDEDFYDLSYSEM 117

Query: 1143 PFRVHWCPFGEFARLGSSFEEVEPPSTTYHDLRGQNTVFSEHTGASSMAHSYVAYVGPVP 964
            PFRVHWCPFGEFA+LGSSFEEVEPPSTTYHDLRG N +FS     SSMAHSYVAYVGP P
Sbjct: 118  PFRVHWCPFGEFAQLGSSFEEVEPPSTTYHDLRGHNAIFS-----SSMAHSYVAYVGPAP 172

Query: 963  ATTSRSSNNVDDPHLNHHWNGLSGGNEIFTPHAFPYINIQYYHWGRHSPPFSISSSHMNG 784
             TTSRSSNNVDD H+N HWN LSG NEIFTPHAFP +N+QY  WGR  PPFSIS+  MN 
Sbjct: 173  LTTSRSSNNVDDRHINPHWNVLSGQNEIFTPHAFPAVNLQYTSWGRQPPPFSISTGQMNV 232

Query: 783  VEPNSSLHATLRSSHGESDATPIPRSFHHPLVFDHGSGPRAGSSFVSSVVLRRPGSSSLT 604
             EP S+ HATLRSSHGESDA PIPRSF HPLVFDHGSGPRAG+SFVSSV  RRPGS +LT
Sbjct: 233  AEPTSTPHATLRSSHGESDAAPIPRSFLHPLVFDHGSGPRAGNSFVSSVFPRRPGSGALT 292

Query: 603  PERIQVSHVLHHQPSSNSPGLPTPVFPGIRRFDGSRSLPAVVPAPLQHDQNGGFYVFPPS 424
             ERI   H  H Q SSNSPGLPT V PG+RRFD  RSLPA VPAP QHDQNGGFY+ PPS
Sbjct: 293  RERIHAFH--HRQSSSNSPGLPTTVVPGLRRFDSPRSLPAAVPAPPQHDQNGGFYILPPS 350

Query: 423  SSGQNIPEAENPMLNYFNVWERERSSRLQSVSRDSNWGSFHQXXXXXXXXXXXXSFWHRH 244
            S G  + EAENP  N+F+VWERERS    SVSRDSNWGSFHQ             FWHRH
Sbjct: 351  SPGHTVHEAENPSPNHFHVWERERSYPSPSVSRDSNWGSFHQTTSGSDMGNGLGGFWHRH 410

Query: 243  S 241
            S
Sbjct: 411  S 411


>ref|XP_006444311.1| hypothetical protein CICLE_v10020382mg [Citrus clementina]
            gi|557546573|gb|ESR57551.1| hypothetical protein
            CICLE_v10020382mg [Citrus clementina]
            gi|641868573|gb|KDO87257.1| hypothetical protein
            CISIN_1g015207mg [Citrus sinensis]
          Length = 411

 Score =  640 bits (1652), Expect = 0.0
 Identities = 314/421 (74%), Positives = 335/421 (79%), Gaps = 1/421 (0%)
 Frame = -1

Query: 1500 IRTIVGSKATDHNPHQMXXXXXXXXXXDPPSTGAACSICLDLISDNGVRSRAKLQCGHEY 1321
            +RT+VGSKATDH PHQ             PS+G ACSICLDL+S+NG+RSRAKLQCGHE+
Sbjct: 1    MRTMVGSKATDHGPHQKDGHDDDDIE---PSSGLACSICLDLVSENGIRSRAKLQCGHEF 57

Query: 1320 HLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGSTSSLP-LSMEDWIPDEDLYDLSYSEM 1144
            HLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGST SLP LSMEDWIPDED YDLSYSEM
Sbjct: 58   HLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGSTRSLPELSMEDWIPDEDFYDLSYSEM 117

Query: 1143 PFRVHWCPFGEFARLGSSFEEVEPPSTTYHDLRGQNTVFSEHTGASSMAHSYVAYVGPVP 964
            PFRVHWCPFGEFA+LGSSFEEVEPPSTTYHDLRG N +FS     SSMAHSYVAYVGP P
Sbjct: 118  PFRVHWCPFGEFAQLGSSFEEVEPPSTTYHDLRGHNAIFS-----SSMAHSYVAYVGPAP 172

Query: 963  ATTSRSSNNVDDPHLNHHWNGLSGGNEIFTPHAFPYINIQYYHWGRHSPPFSISSSHMNG 784
             TTSRSSNNVDD H+N HWN LSG NEIFTPHAFP +N+QY  WGR  PPFSIS+  MN 
Sbjct: 173  LTTSRSSNNVDDRHINPHWNVLSGQNEIFTPHAFPAVNLQYTSWGRQPPPFSISTGQMNV 232

Query: 783  VEPNSSLHATLRSSHGESDATPIPRSFHHPLVFDHGSGPRAGSSFVSSVVLRRPGSSSLT 604
             EP S+ HATLRSSHGESDA PIPRSF HPLVFDHGSGPRAG+SFV SV  RRPGS +LT
Sbjct: 233  AEPTSTPHATLRSSHGESDAAPIPRSFLHPLVFDHGSGPRAGNSFV-SVFPRRPGSGALT 291

Query: 603  PERIQVSHVLHHQPSSNSPGLPTPVFPGIRRFDGSRSLPAVVPAPLQHDQNGGFYVFPPS 424
             ERI   H  H Q SSNSPGLPT V PG+RRFD  RSLPA VPAP QHDQNGGFY+ PPS
Sbjct: 292  RERIHAFH--HRQSSSNSPGLPTTVVPGLRRFDSPRSLPAAVPAPPQHDQNGGFYILPPS 349

Query: 423  SSGQNIPEAENPMLNYFNVWERERSSRLQSVSRDSNWGSFHQXXXXXXXXXXXXSFWHRH 244
            S G  + EAENP  N+F+VWERERS    SVSRDSNWGSFHQ             FWHRH
Sbjct: 350  SPGHTVHEAENPSPNHFHVWERERSYPSPSVSRDSNWGSFHQTTSGSDMGNGLGGFWHRH 409

Query: 243  S 241
            S
Sbjct: 410  S 410


>ref|XP_002523069.1| DNA binding protein, putative [Ricinus communis]
            gi|223537631|gb|EEF39254.1| DNA binding protein, putative
            [Ricinus communis]
          Length = 430

 Score =  548 bits (1411), Expect = e-153
 Identities = 260/393 (66%), Positives = 303/393 (77%), Gaps = 1/393 (0%)
 Frame = -1

Query: 1416 PPSTGAACSICLDLISDNGVRSRAKLQCGHEYHLDCIGSAFNMKGAMQCPNCRRIEKGQW 1237
            P S+G +CSICLD + DNG RSRAKLQCGHE+HLDCIGSAFNMKGAMQCPNCR++EKGQW
Sbjct: 44   PSSSGISCSICLDTVLDNGGRSRAKLQCGHEFHLDCIGSAFNMKGAMQCPNCRKVEKGQW 103

Query: 1236 LYANGSTSSLP-LSMEDWIPDEDLYDLSYSEMPFRVHWCPFGEFARLGSSFEEVEPPSTT 1060
            LYANGS   LP +SM+DWIP+ED YDLSYSEMP+RVHWCPFGE AR+GSSF EVE PSTT
Sbjct: 104  LYANGSNRMLPEMSMDDWIPEEDFYDLSYSEMPYRVHWCPFGELARVGSSFGEVESPSTT 163

Query: 1059 YHDLRGQNTVFSEHTGASSMAHSYVAYVGPVPATTSRSSNNVDDPHLNHHWNGLSGGNEI 880
            YHDLRG ++V++EHT ASS+AHSYVAYVGP+P   SRS++ +DDP+ NHHWNGLSG +EI
Sbjct: 164  YHDLRGHHSVYAEHTAASSVAHSYVAYVGPIPPNPSRSNDGIDDPNFNHHWNGLSGRHEI 223

Query: 879  FTPHAFPYINIQYYHWGRHSPPFSISSSHMNGVEPNSSLHATLRSSHGESDATPIPRSFH 700
            F+ HAFP INIQY++WGR SPPFS+SSSH+NGV+P +S+  T RSS GESD      SF 
Sbjct: 224  FSTHAFPAINIQYHNWGRRSPPFSVSSSHINGVDP-ASVPMTFRSSVGESDTRTRSTSFP 282

Query: 699  HPLVFDHGSGPRAGSSFVSSVVLRRPGSSSLTPERIQVSHVLHHQPSSNSPGLPTPVFPG 520
            HP+VF HGSGP AGSSFVSS+  R PGS + T ERIQ+SH  H Q SS+ PG+P+P+  G
Sbjct: 283  HPIVFGHGSGPTAGSSFVSSIFPRHPGSGARTNERIQISHAFHRQQSSSPPGVPSPIIHG 342

Query: 519  IRRFDGSRSLPAVVPAPLQHDQNGGFYVFPPSSSGQNIPEAENPMLNYFNVWERERSSRL 340
            IRRFDG R LP VVPAP QHD +GGF + PPSSS QN  EAENP+ N+F+  ERER    
Sbjct: 343  IRRFDGPRGLPTVVPAPPQHDHSGGFLIIPPSSSSQNSQEAENPLPNHFHARERERLPHF 402

Query: 339  QSVSRDSNWGSFHQXXXXXXXXXXXXSFWHRHS 241
            Q  S   N G                SFWHRHS
Sbjct: 403  QHASFHQNTGG------GPNPGNRSSSFWHRHS 429


>ref|XP_012082753.1| PREDICTED: uncharacterized protein LOC105642518 [Jatropha curcas]
            gi|643716523|gb|KDP28149.1| hypothetical protein
            JCGZ_13920 [Jatropha curcas]
          Length = 438

 Score =  534 bits (1375), Expect = e-148
 Identities = 256/420 (60%), Positives = 309/420 (73%), Gaps = 4/420 (0%)
 Frame = -1

Query: 1491 IVGSKATDHNPHQMXXXXXXXXXXDPPSTGAACSICLDLISDNGVRSRAKLQCGHEYHLD 1312
            ++   A DH+              +P S+G +CSICLD +SDNG RSRAKLQCGHE+HLD
Sbjct: 25   LMDGDADDHHNQLHQQHDGGADEDEPSSSGVSCSICLDTVSDNGGRSRAKLQCGHEFHLD 84

Query: 1311 CIGSAFNMKGAMQCPNCRRIEKGQWLYANGSTSSL-PLSMEDWIPDEDLYDLSYSEMPFR 1135
            CIGSAFNMKGAMQCPNCR++EKGQWLYANGST +L  +SM+DW PD+D YDLSYS  P+R
Sbjct: 85   CIGSAFNMKGAMQCPNCRKVEKGQWLYANGSTRTLHDMSMDDWFPDDDFYDLSYSGTPYR 144

Query: 1134 VHWCPFGEFARLGSSFEEVEPPSTTYHDLRGQNTVFSEHTGASSMAHSYVAYVGPVPATT 955
            +HWCPFGE A++GSSFEEVE PS  YHDLRG +++++EHT ASS+AHSYVAYVGP+P  +
Sbjct: 145  IHWCPFGELAQVGSSFEEVESPSNNYHDLRGHHSIYAEHTAASSVAHSYVAYVGPIPPNS 204

Query: 954  SRSSNNVDDPHLNHHWNGLSGGNEIFTPHAFPYINIQYYHWGRHSPPFSISSSHMNGVEP 775
            SRS+ ++DDP+ NHHWNGLSG +EIF+ HAFP INIQY  WGR SPPFS+SSSH NGV+P
Sbjct: 205  SRSNGSIDDPNFNHHWNGLSGRHEIFSAHAFPAINIQYQSWGRRSPPFSVSSSHANGVDP 264

Query: 774  NSSLHATLRSSHGESDATPIPRSFHHPLVFDHGSGPRAGSSFVSSVVLRRPGSSSLTPER 595
             +S+  T R+S+GESD    P SF HP+ F HGSG  AGSSFVSS+  R PGS + T ER
Sbjct: 265  -ASVPLTFRASNGESDTRMRPTSFPHPIPFGHGSGSSAGSSFVSSIFPRHPGSGARTHER 323

Query: 594  IQVSHVL-HHQPSSNSPGLPTPVFPGIRRFDGSRSLPAVVPAPLQHDQNGGFYVFPPSSS 418
            IQ++H   H Q S + PG+P P+  G+RRFDG R L  VVPAP QHD + GF + PPSSS
Sbjct: 324  IQIAHAFQHRQHSGSPPGIPPPIVHGVRRFDGPRGLHTVVPAPPQHD-HSGFLIIPPSSS 382

Query: 417  GQNIPEAENPMLNYFNVWERERSSRLQSVS--RDSNWGSFHQXXXXXXXXXXXXSFWHRH 244
            GQN+ E E    N+F+ WERE+   LQ VS  RDS WGSFHQ            SFWHRH
Sbjct: 383  GQNLQEPE----NHFHAWEREQLPHLQHVSSDRDSGWGSFHQSNGGPNPGNRSSSFWHRH 438


>ref|XP_010652188.1| PREDICTED: uncharacterized protein LOC100267498 isoform X2 [Vitis
            vinifera] gi|297743887|emb|CBI36857.3| unnamed protein
            product [Vitis vinifera]
          Length = 397

 Score =  489 bits (1260), Expect = e-135
 Identities = 244/400 (61%), Positives = 286/400 (71%), Gaps = 8/400 (2%)
 Frame = -1

Query: 1416 PPSTG----AACSICLDLISDNGVRSRAKLQCGHEYHLDCIGSAFNMKGAMQCPNCRRIE 1249
            PPS G     +CSICLDL++DNG RSRAKLQCGHE+HLDCIGSAFNMKGAMQCPNCR+IE
Sbjct: 11   PPSHGEVSFVSCSICLDLVTDNGERSRAKLQCGHEFHLDCIGSAFNMKGAMQCPNCRKIE 70

Query: 1248 KGQWLYANGSTSSLP-LSMEDWIPDEDLYDLSYSEMPFRVHWCPFGEFARLGSSFEEVEP 1072
            +G+WL+ANGS  S P  SM+DW PDE+ YD +YSEMPFRV WCPF  F ++ SSFEEVE 
Sbjct: 71   RGRWLFANGSARSFPEFSMDDWTPDEETYDFNYSEMPFRVQWCPFSGFTQVRSSFEEVES 130

Query: 1071 PSTTYHDLRGQNTVFSEHTGASSMAHSYVAYVGPVPATTSRSSNNVDDPHLNHHWNGLSG 892
            PSTT+HDL+G + + SEH  ASS AHSYVAY GP+P T S SS +VDD + NHHWN LS 
Sbjct: 131  PSTTHHDLQGHHAILSEHAAASSAAHSYVAYFGPIPPTHSNSSESVDDLNFNHHWNSLSA 190

Query: 891  GNEIFTPHAFPYINIQYYHWGRHSPPFSISSSHMNGVEPNSSLHATLRSSHGESDATPIP 712
             +EIF+ HAFP I+IQY  WG HSPPFS +SSH+NG E   +L ATLRS  GESDA    
Sbjct: 191  HSEIFSSHAFPAIDIQYQSWGHHSPPFSPTSSHINGAEQAPALPATLRSMRGESDAMTRS 250

Query: 711  RSFHHPLVFDHGSGPRAGSSFVSSVVLRRPGSSSL-TPERIQVSHVLHHQ-PSSNSPGLP 538
             SF HPL+F  GSG RAGS+FVSS+V   PG+S L T ERI +SH L HQ P  NSPG+P
Sbjct: 251  GSFVHPLLFGPGSGHRAGSAFVSSIVPNHPGNSVLRTYERIHISHALPHQHPPPNSPGMP 310

Query: 537  TPVFPGIRRFDGSRSLPAVVPAPLQHDQNGGFYVFPPS-SSGQNIPEAENPMLNYFNVWE 361
            T + PG+RRF+G R+LP VVPA  Q D + GFY+FPPS +S +NI EAENP LN+F+ W 
Sbjct: 311  TSIVPGVRRFNGPRALPPVVPAASQSDHSAGFYIFPPSGASIRNIHEAENPSLNHFHAW- 369

Query: 360  RERSSRLQSVSRDSNWGSFHQXXXXXXXXXXXXSFWHRHS 241
                        DS WGSFHQ            SFW   S
Sbjct: 370  ------------DSGWGSFHQATGGSDSGSRSSSFWRHWS 397


>ref|XP_002268579.1| PREDICTED: uncharacterized protein LOC100267498 isoform X1 [Vitis
            vinifera]
          Length = 407

 Score =  489 bits (1260), Expect = e-135
 Identities = 244/400 (61%), Positives = 286/400 (71%), Gaps = 8/400 (2%)
 Frame = -1

Query: 1416 PPSTG----AACSICLDLISDNGVRSRAKLQCGHEYHLDCIGSAFNMKGAMQCPNCRRIE 1249
            PPS G     +CSICLDL++DNG RSRAKLQCGHE+HLDCIGSAFNMKGAMQCPNCR+IE
Sbjct: 21   PPSHGEVSFVSCSICLDLVTDNGERSRAKLQCGHEFHLDCIGSAFNMKGAMQCPNCRKIE 80

Query: 1248 KGQWLYANGSTSSLP-LSMEDWIPDEDLYDLSYSEMPFRVHWCPFGEFARLGSSFEEVEP 1072
            +G+WL+ANGS  S P  SM+DW PDE+ YD +YSEMPFRV WCPF  F ++ SSFEEVE 
Sbjct: 81   RGRWLFANGSARSFPEFSMDDWTPDEETYDFNYSEMPFRVQWCPFSGFTQVRSSFEEVES 140

Query: 1071 PSTTYHDLRGQNTVFSEHTGASSMAHSYVAYVGPVPATTSRSSNNVDDPHLNHHWNGLSG 892
            PSTT+HDL+G + + SEH  ASS AHSYVAY GP+P T S SS +VDD + NHHWN LS 
Sbjct: 141  PSTTHHDLQGHHAILSEHAAASSAAHSYVAYFGPIPPTHSNSSESVDDLNFNHHWNSLSA 200

Query: 891  GNEIFTPHAFPYINIQYYHWGRHSPPFSISSSHMNGVEPNSSLHATLRSSHGESDATPIP 712
             +EIF+ HAFP I+IQY  WG HSPPFS +SSH+NG E   +L ATLRS  GESDA    
Sbjct: 201  HSEIFSSHAFPAIDIQYQSWGHHSPPFSPTSSHINGAEQAPALPATLRSMRGESDAMTRS 260

Query: 711  RSFHHPLVFDHGSGPRAGSSFVSSVVLRRPGSSSL-TPERIQVSHVLHHQ-PSSNSPGLP 538
             SF HPL+F  GSG RAGS+FVSS+V   PG+S L T ERI +SH L HQ P  NSPG+P
Sbjct: 261  GSFVHPLLFGPGSGHRAGSAFVSSIVPNHPGNSVLRTYERIHISHALPHQHPPPNSPGMP 320

Query: 537  TPVFPGIRRFDGSRSLPAVVPAPLQHDQNGGFYVFPPS-SSGQNIPEAENPMLNYFNVWE 361
            T + PG+RRF+G R+LP VVPA  Q D + GFY+FPPS +S +NI EAENP LN+F+ W 
Sbjct: 321  TSIVPGVRRFNGPRALPPVVPAASQSDHSAGFYIFPPSGASIRNIHEAENPSLNHFHAW- 379

Query: 360  RERSSRLQSVSRDSNWGSFHQXXXXXXXXXXXXSFWHRHS 241
                        DS WGSFHQ            SFW   S
Sbjct: 380  ------------DSGWGSFHQATGGSDSGSRSSSFWRHWS 407


>ref|XP_007050904.1| DNA binding protein, putative isoform 1 [Theobroma cacao]
            gi|508703165|gb|EOX95061.1| DNA binding protein, putative
            isoform 1 [Theobroma cacao]
          Length = 429

 Score =  484 bits (1245), Expect = e-133
 Identities = 245/414 (59%), Positives = 294/414 (71%), Gaps = 17/414 (4%)
 Frame = -1

Query: 1491 IVGSKAT--------DHNPHQ-MXXXXXXXXXXDPPSTGAACSICLDLISDNGVRSRAKL 1339
            +VGSKA+        DH+ H              PPS+  +CSICLDL+SD+  RSRAKL
Sbjct: 1    MVGSKASQLGGDLDHDHDDHHHQLIDGHAGDAAVPPSSEVSCSICLDLVSDSSGRSRAKL 60

Query: 1338 QCGHEYHLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGSTSSLP-LSMEDWIPDEDLYD 1162
            QCGHE+HLDCIGSAFN+KGAMQCPNCR++EKGQWLYA+GS+ SLP LS EDW  D+D YD
Sbjct: 61   QCGHEFHLDCIGSAFNVKGAMQCPNCRKVEKGQWLYASGSSRSLPELSTEDWNLDDDYYD 120

Query: 1161 LSYSEMPFRVHWCPFGEFARLGSSFEEVEPPSTTYHDLRGQNTVFSEHTGASSMAHSYVA 982
              YSEMPFRV WCPFGEF+R+GSS EEVE PSTTYH++ G + +F+EH  ASS+AHSYVA
Sbjct: 121  PGYSEMPFRVQWCPFGEFSRIGSSSEEVESPSTTYHEIHGHHAIFAEHAAASSVAHSYVA 180

Query: 981  YVGPVPATTSRSSNNVDDPHLNHHWNGLSGGNEIFTPHAFPYINIQYYHWGRHSPPFSIS 802
            YVGP+P TT RSS++VDDP+ N HWN LSG NEIF PHA P I+IQY+ WG+H P FS+S
Sbjct: 181  YVGPLPPTTLRSSDSVDDPNFNRHWNSLSGHNEIFIPHALPTISIQYHSWGQHPPNFSVS 240

Query: 801  SSHMNGVEPNSSLHATLRSSHGESDATPIPRSFHHPLVFDHGSGPRAGSSFVSSVVLRRP 622
             SH++  +P S   A LRSS+GE DA   PRSF H   F+HGS  RAGSSFVSSV  R P
Sbjct: 241  DSHISHTDPASVPAAALRSSNGELDALSRPRSFPHHFPFEHGSSSRAGSSFVSSVFPRHP 300

Query: 621  GSSSLTPERIQVSHVLHHQ------PSSNSPGLPTPVFPGIRRFDGSRSLPAVVPAPLQH 460
            GSS+ T +RIQ S   + Q      P  N PG+PTPV PG+     +R L  V PA  Q 
Sbjct: 301  GSSAHTHDRIQASLAFYRQQHRFNHPRFNRPGVPTPVVPGM-----TRGLTPVAPAVPQP 355

Query: 459  DQNGGFYVFPP-SSSGQNIPEAENPMLNYFNVWERERSSRLQSVSRDSNWGSFH 301
            DQ G FY++PP SSSGQN+ EAE+   + +N  ERER S   +VSRDS WGS+H
Sbjct: 356  DQGGSFYIYPPSSSSGQNLHEAESFFPSNYNALERERLSHFPTVSRDSGWGSYH 409


>ref|XP_007050906.1| DNA binding protein, putative isoform 3 [Theobroma cacao]
            gi|508703167|gb|EOX95063.1| DNA binding protein, putative
            isoform 3 [Theobroma cacao]
          Length = 427

 Score =  479 bits (1233), Expect = e-132
 Identities = 245/414 (59%), Positives = 294/414 (71%), Gaps = 17/414 (4%)
 Frame = -1

Query: 1491 IVGSKAT--------DHNPHQ-MXXXXXXXXXXDPPSTGAACSICLDLISDNGVRSRAKL 1339
            +VGSKA+        DH+ H              PPS+  +CSICLDL+SD+  RSRAKL
Sbjct: 1    MVGSKASQLGGDLDHDHDDHHHQLIDGHAGDAAVPPSSEVSCSICLDLVSDSSGRSRAKL 60

Query: 1338 QCGHEYHLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGSTSSLP-LSMEDWIPDEDLYD 1162
            QCGHE+HLDCIGSAFN+KGAMQCPNCR++EKGQWLYA+GS+ SLP LS EDW  D+D YD
Sbjct: 61   QCGHEFHLDCIGSAFNVKGAMQCPNCRKVEKGQWLYASGSSRSLPELSTEDWNLDDDYYD 120

Query: 1161 LSYSEMPFRVHWCPFGEFARLGSSFEEVEPPSTTYHDLRGQNTVFSEHTGASSMAHSYVA 982
              YSEMPFRV WCPFGEF+R+GSS EEVE PSTTYH++ G + +F+EH  ASS+AHSYVA
Sbjct: 121  PGYSEMPFRVQWCPFGEFSRIGSSSEEVESPSTTYHEIHGHHAIFAEHAAASSVAHSYVA 180

Query: 981  YVGPVPATTSRSSNNVDDPHLNHHWNGLSGGNEIFTPHAFPYINIQYYHWGRHSPPFSIS 802
            YVGP+P TT RSS++VDDP+ N HWN LSG NEIF PHA P I+IQY+ WG+H P FS+S
Sbjct: 181  YVGPLPPTTLRSSDSVDDPNFNRHWNSLSGHNEIFIPHALPTISIQYHSWGQHPPNFSVS 240

Query: 801  SSHMNGVEPNSSLHATLRSSHGESDATPIPRSFHHPLVFDHGSGPRAGSSFVSSVVLRRP 622
             SH++  +P S   A LRSS+GE DA   PRSF H   F+HGS  RAGSSFVSSV  R P
Sbjct: 241  DSHISHTDPASVPAAALRSSNGELDALSRPRSFPHHFPFEHGS--RAGSSFVSSVFPRHP 298

Query: 621  GSSSLTPERIQVSHVLHHQ------PSSNSPGLPTPVFPGIRRFDGSRSLPAVVPAPLQH 460
            GSS+ T +RIQ S   + Q      P  N PG+PTPV PG+     +R L  V PA  Q 
Sbjct: 299  GSSAHTHDRIQASLAFYRQQHRFNHPRFNRPGVPTPVVPGM-----TRGLTPVAPAVPQP 353

Query: 459  DQNGGFYVFPP-SSSGQNIPEAENPMLNYFNVWERERSSRLQSVSRDSNWGSFH 301
            DQ G FY++PP SSSGQN+ EAE+   + +N  ERER S   +VSRDS WGS+H
Sbjct: 354  DQGGSFYIYPPSSSSGQNLHEAESFFPSNYNALERERLSHFPTVSRDSGWGSYH 407


>ref|XP_012474959.1| PREDICTED: uncharacterized protein LOC105791441 isoform X2 [Gossypium
            raimondii] gi|763757031|gb|KJB24362.1| hypothetical
            protein B456_004G142300 [Gossypium raimondii]
          Length = 421

 Score =  453 bits (1165), Expect = e-124
 Identities = 230/410 (56%), Positives = 283/410 (69%), Gaps = 14/410 (3%)
 Frame = -1

Query: 1491 IVGSKAT----DHNPHQMXXXXXXXXXXDPP--STGAACSICLDLISDNGVRSRAKLQCG 1330
            +V SKA+    DH+ H              P  S+  +CSICLDL+SD G RSRAKL CG
Sbjct: 1    MVRSKASLLDLDHDDHHQLMDGPDGDVSAVPLSSSDISCSICLDLVSDTGGRSRAKLLCG 60

Query: 1329 HEYHLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGSTSSLP-LSMEDWIPDEDLYDLSY 1153
            H++HLDCIGSAFNMKGAMQCPNCR++EKGQWLYANGS  SLP L+MEDW  D+D Y+  Y
Sbjct: 61   HQFHLDCIGSAFNMKGAMQCPNCRKVEKGQWLYANGSNRSLPELTMEDWNLDDDYYEPVY 120

Query: 1152 SEMPFRVHWCPFGEFARLGSSFEEVEPPSTTYHDLRGQNTVFSEHTGASSMAHSYVAYVG 973
            SEM FR  WCP+GEF R+GSS EEVE PSTTYH++ G + +F+EH  ASS+AHSYVAYVG
Sbjct: 121  SEMQFRAQWCPYGEFTRIGSSSEEVESPSTTYHEIHGHHAIFAEHAAASSVAHSYVAYVG 180

Query: 972  PVPATTSRSSNNVDDPHLNHHWNGLSGGNEIFTPHAFPYINIQYYHWGRHSPPFSISSSH 793
            P+P+TT R+S++VDDP+ N HWN LSG NEIF PHAFP I IQY+ WGRHSP FSIS+SH
Sbjct: 181  PLPSTTLRNSDSVDDPNFNRHWNILSGHNEIFIPHAFPTIRIQYHSWGRHSPNFSISNSH 240

Query: 792  MNGVEPNSSLHATLRSSHGESDATPIPRSFHHPLVFDHGSGPRAGSSFVSSVVLRRPGSS 613
            +   +P S   A LRSS+GE DA+ +PR F HP  F+HGS  R GSSFVSSV    PGS 
Sbjct: 241  IGNTDPASVPAAALRSSNGEPDASTVPRLFGHPFPFEHGSSSRGGSSFVSSVFHHHPGSG 300

Query: 612  SLTPERIQVSHVLH------HQPSSNSPGLPTPVFPGIRRFDGSRSLPAVVPAPLQHDQN 451
            + T +R   S   +      +Q   N PG+P  V PGI      R +  + PA  Q DQ 
Sbjct: 301  AHTHDRTWPSLAYYRQQHRFNQQRFNRPGVPALVVPGI------RGVAPMTPAVPQPDQT 354

Query: 450  GGFYVFP-PSSSGQNIPEAENPMLNYFNVWERERSSRLQSVSRDSNWGSF 304
            GGFY++P  SSSGQN+PEAE+   N +   ERER S  +++SR + WG++
Sbjct: 355  GGFYIYPRSSSSGQNLPEAESSYPNNYIALERERLSHFRTMSRVTGWGAY 404


>gb|KJB24366.1| hypothetical protein B456_004G142300 [Gossypium raimondii]
          Length = 419

 Score =  448 bits (1153), Expect = e-123
 Identities = 230/410 (56%), Positives = 283/410 (69%), Gaps = 14/410 (3%)
 Frame = -1

Query: 1491 IVGSKAT----DHNPHQMXXXXXXXXXXDPP--STGAACSICLDLISDNGVRSRAKLQCG 1330
            +V SKA+    DH+ H              P  S+  +CSICLDL+SD G RSRAKL CG
Sbjct: 1    MVRSKASLLDLDHDDHHQLMDGPDGDVSAVPLSSSDISCSICLDLVSDTGGRSRAKLLCG 60

Query: 1329 HEYHLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGSTSSLP-LSMEDWIPDEDLYDLSY 1153
            H++HLDCIGSAFNMKGAMQCPNCR++EKGQWLYANGS  SLP L+MEDW  D+D Y+  Y
Sbjct: 61   HQFHLDCIGSAFNMKGAMQCPNCRKVEKGQWLYANGSNRSLPELTMEDWNLDDDYYEPVY 120

Query: 1152 SEMPFRVHWCPFGEFARLGSSFEEVEPPSTTYHDLRGQNTVFSEHTGASSMAHSYVAYVG 973
            SEM FR  WCP+GEF R+GSS EEVE PSTTYH++ G + +F+EH  ASS+AHSYVAYVG
Sbjct: 121  SEMQFRAQWCPYGEFTRIGSSSEEVESPSTTYHEIHGHHAIFAEHAAASSVAHSYVAYVG 180

Query: 972  PVPATTSRSSNNVDDPHLNHHWNGLSGGNEIFTPHAFPYINIQYYHWGRHSPPFSISSSH 793
            P+P+TT R+S++VDDP+ N HWN LSG NEIF PHAFP I IQY+ WGRHSP FSIS+SH
Sbjct: 181  PLPSTTLRNSDSVDDPNFNRHWNILSGHNEIFIPHAFPTIRIQYHSWGRHSPNFSISNSH 240

Query: 792  MNGVEPNSSLHATLRSSHGESDATPIPRSFHHPLVFDHGSGPRAGSSFVSSVVLRRPGSS 613
            +   +P S   A LRSS+GE DA+ +PR F HP  F+HGS  R GSSFVSSV    PGS 
Sbjct: 241  IGNTDPASVPAAALRSSNGEPDASTVPRLFGHPFPFEHGS--RGGSSFVSSVFHHHPGSG 298

Query: 612  SLTPERIQVSHVLH------HQPSSNSPGLPTPVFPGIRRFDGSRSLPAVVPAPLQHDQN 451
            + T +R   S   +      +Q   N PG+P  V PGI      R +  + PA  Q DQ 
Sbjct: 299  AHTHDRTWPSLAYYRQQHRFNQQRFNRPGVPALVVPGI------RGVAPMTPAVPQPDQT 352

Query: 450  GGFYVFP-PSSSGQNIPEAENPMLNYFNVWERERSSRLQSVSRDSNWGSF 304
            GGFY++P  SSSGQN+PEAE+   N +   ERER S  +++SR + WG++
Sbjct: 353  GGFYIYPRSSSSGQNLPEAESSYPNNYIALERERLSHFRTMSRVTGWGAY 402


>ref|XP_012474958.1| PREDICTED: uncharacterized protein LOC105791441 isoform X1 [Gossypium
            raimondii] gi|763757032|gb|KJB24363.1| hypothetical
            protein B456_004G142300 [Gossypium raimondii]
          Length = 422

 Score =  448 bits (1153), Expect = e-123
 Identities = 230/411 (55%), Positives = 283/411 (68%), Gaps = 15/411 (3%)
 Frame = -1

Query: 1491 IVGSKAT----DHNPHQMXXXXXXXXXXDPP--STGAACSICLDLISDNGVRSRAKLQCG 1330
            +V SKA+    DH+ H              P  S+  +CSICLDL+SD G RSRAKL CG
Sbjct: 1    MVRSKASLLDLDHDDHHQLMDGPDGDVSAVPLSSSDISCSICLDLVSDTGGRSRAKLLCG 60

Query: 1329 HEYHLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGSTSSLP-LSMEDWIPDEDLYDLSY 1153
            H++HLDCIGSAFNMKGAMQCPNCR++EKGQWLYANGS  SLP L+MEDW  D+D Y+  Y
Sbjct: 61   HQFHLDCIGSAFNMKGAMQCPNCRKVEKGQWLYANGSNRSLPELTMEDWNLDDDYYEPVY 120

Query: 1152 SEMP-FRVHWCPFGEFARLGSSFEEVEPPSTTYHDLRGQNTVFSEHTGASSMAHSYVAYV 976
            SEM  FR  WCP+GEF R+GSS EEVE PSTTYH++ G + +F+EH  ASS+AHSYVAYV
Sbjct: 121  SEMQQFRAQWCPYGEFTRIGSSSEEVESPSTTYHEIHGHHAIFAEHAAASSVAHSYVAYV 180

Query: 975  GPVPATTSRSSNNVDDPHLNHHWNGLSGGNEIFTPHAFPYINIQYYHWGRHSPPFSISSS 796
            GP+P+TT R+S++VDDP+ N HWN LSG NEIF PHAFP I IQY+ WGRHSP FSIS+S
Sbjct: 181  GPLPSTTLRNSDSVDDPNFNRHWNILSGHNEIFIPHAFPTIRIQYHSWGRHSPNFSISNS 240

Query: 795  HMNGVEPNSSLHATLRSSHGESDATPIPRSFHHPLVFDHGSGPRAGSSFVSSVVLRRPGS 616
            H+   +P S   A LRSS+GE DA+ +PR F HP  F+HGS  R GSSFVSSV    PGS
Sbjct: 241  HIGNTDPASVPAAALRSSNGEPDASTVPRLFGHPFPFEHGSSSRGGSSFVSSVFHHHPGS 300

Query: 615  SSLTPERIQVSHVLH------HQPSSNSPGLPTPVFPGIRRFDGSRSLPAVVPAPLQHDQ 454
             + T +R   S   +      +Q   N PG+P  V PGI      R +  + PA  Q DQ
Sbjct: 301  GAHTHDRTWPSLAYYRQQHRFNQQRFNRPGVPALVVPGI------RGVAPMTPAVPQPDQ 354

Query: 453  NGGFYVFP-PSSSGQNIPEAENPMLNYFNVWERERSSRLQSVSRDSNWGSF 304
             GGFY++P  SSSGQN+PEAE+   N +   ERER S  +++SR + WG++
Sbjct: 355  TGGFYIYPRSSSSGQNLPEAESSYPNNYIALERERLSHFRTMSRVTGWGAY 405


>ref|XP_010259580.1| PREDICTED: uncharacterized protein LOC104598959 isoform X1 [Nelumbo
            nucifera]
          Length = 441

 Score =  425 bits (1092), Expect = e-116
 Identities = 213/396 (53%), Positives = 266/396 (67%), Gaps = 7/396 (1%)
 Frame = -1

Query: 1410 STGAACSICLDLISDNGVRSRAKLQCGHEYHLDCIGSAFNMKGAMQCPNCRRIEKGQWLY 1231
            ++  +CSICL+ ++DNG RSRAKLQCGHE+HLDCIGSAFN KG MQCPNCR+IEKGQWLY
Sbjct: 21   ASSVSCSICLEAVTDNGDRSRAKLQCGHEFHLDCIGSAFNAKGVMQCPNCRKIEKGQWLY 80

Query: 1230 ANGSTSSLPLSMEDWIPDEDLYDLSYSEMPFRVHWCPFGEFARLGSSFEEVEPPSTTYHD 1051
            ANGS S    SM+DW+ DEDLYDL YSEMPF VHWCPFG  AR+ SSFEE E PST YHD
Sbjct: 81   ANGSRSFPEFSMDDWVHDEDLYDLGYSEMPFGVHWCPFGGLARVTSSFEEGESPSTAYHD 140

Query: 1050 LRGQNTVFSEHTGASSMAHS--YVAYVGPV-PATTSRSSNNVDDPHLNHHWNGLSGGNEI 880
            L G + +F+EHT ASS AHS  YVAY  P+ P+ +S +    + P+ +HHWN LSG ++I
Sbjct: 141  LLGHHAIFAEHTAASSAAHSCPYVAYFQPLQPSASSSNEGMAEGPNFSHHWNSLSGPSDI 200

Query: 879  FTPHAFPYINIQYYHWGRHSPPFSISSSHMNGVEPNSSLHATLRSSHGESDATPIPRSFH 700
             T H FP +++ Y+ W  HSPPFS + S + G +  +   +TLRS+ G+++  P   SF 
Sbjct: 201  STSHGFPGLDLHYHSWDHHSPPFSPTGSRIAGADQATIPSSTLRSTRGDTEPLPRSGSFV 260

Query: 699  HPLVFDHGSGPRAGSSFVSSVVLRRPGSSSLTPERIQVSHVLHHQPSSNSPGLPTPVFPG 520
            HP +  HGSGPRAGSS VSS+V   P SS+ T +RIQ  H  H Q   NSPG+  P+F G
Sbjct: 261  HPYLLGHGSGPRAGSSVVSSMVPPYPSSSARTHDRIQGLHAYHQQQPGNSPGMRAPIFSG 320

Query: 519  IRRFDGSRSLPAVVPAPLQHDQNGGFYVFPPS-SSGQNIPEAENPMLNYFNVWERERSS- 346
            IRR   S +L  V P     D  G FY+FP S SSG+N+ +A+NP+ N F  WER+R + 
Sbjct: 321  IRR---SSALRPVGPVASSSDHTGTFYMFPSSGSSGRNLQDADNPLRNRFYAWERDRFAP 377

Query: 345  -RLQSVSRDSN-WGSFHQXXXXXXXXXXXXSFWHRH 244
              L  V R+S+ WG FHQ            SFW RH
Sbjct: 378  FPLIPVDRESSWWGPFHQAAGGSDSGGRTSSFWQRH 413


>ref|XP_010269553.1| PREDICTED: uncharacterized protein LOC104606172 [Nelumbo nucifera]
          Length = 445

 Score =  418 bits (1075), Expect = e-114
 Identities = 214/398 (53%), Positives = 266/398 (66%), Gaps = 9/398 (2%)
 Frame = -1

Query: 1410 STGAACSICLDLISDNGVRSRAKLQCGHEYHLDCIGSAFNMKGAMQCPNCRRIEKGQWLY 1231
            ++  +CSICL+ ++DNG RSRAKLQCGHE+HLDCIGSAFN KG MQCPNCR+IEKGQWLY
Sbjct: 21   ASSVSCSICLEAVTDNGDRSRAKLQCGHEFHLDCIGSAFNAKGMMQCPNCRKIEKGQWLY 80

Query: 1230 ANGSTSSLPLSMEDWIPDEDLYDLSYSEMPFRVHWCPFGEFARLGSSFEEVEPPSTTYHD 1051
            ANG  S    SM+DW  DEDLYDLSYSEMPF VHWCPF   AR+ SSFEE E PST YHD
Sbjct: 81   ANGCRSFAEFSMDDWTHDEDLYDLSYSEMPFGVHWCPFSGLARVPSSFEEGESPSTAYHD 140

Query: 1050 LRGQNTVFSEHTGASS-MAHS--YVAYVGPVPATTSRSSNNV-DDPHLNHHWNGLSGGNE 883
            L G + +F++HT ASS  AHS  YVAY  P+  +TS S+ ++ D P+ NHHWN LSG ++
Sbjct: 141  LLGHHAIFADHTAASSAAAHSCPYVAYFQPLQPSTSSSNESIGDGPNFNHHWNSLSGPSD 200

Query: 882  IFTPHAFPYINIQYYHWGRHSPPFSISSSHMNGVEPNSSLHATLRSSHGESDATPIPRSF 703
            +   H  P ++IQY+ W  HSPPFS +SS M G +  S   +TLRS+ G+++  P   SF
Sbjct: 201  MSVSHGLPTVDIQYHSWDHHSPPFSPTSSRMAGADQASVHSSTLRSTRGDTEGLPRSGSF 260

Query: 702  HHPLVFDHGSGPRAGSSFVSSVVLRRPGSSSLTPERIQVSHVLH-HQPSSNSPGLPTPVF 526
             HP +  HGSGPRAGSS V S++    GSS+   +R+Q  H  H  Q SS++PG+  P+F
Sbjct: 261  VHPFLLGHGSGPRAGSSVV-SLIPPYSGSSARAHDRVQGLHAYHQQQQSSSTPGMRGPIF 319

Query: 525  PGIRRFDGSRSLPAVVPAPLQHDQNGGFYVFPPS-SSGQNIPEAENPMLNYFNVWERERS 349
             G+RR  G R L  V P     D  G FYVFPPS SS +N+ EAENP+ + F  WER+R 
Sbjct: 320  SGVRRSSGPRGLAPVGPLASSSDHTGAFYVFPPSGSSSRNLQEAENPVRSRFYAWERDRF 379

Query: 348  S--RLQSVSRDSN-WGSFHQXXXXXXXXXXXXSFWHRH 244
            +   L  V R+S+ WG FHQ            SFW RH
Sbjct: 380  APFPLIPVDRESSWWGPFHQAAGGSDSGSRTSSFWQRH 417


>ref|XP_010652189.1| PREDICTED: uncharacterized protein LOC100267498 isoform X3 [Vitis
            vinifera]
          Length = 341

 Score =  417 bits (1072), Expect = e-113
 Identities = 209/354 (59%), Positives = 247/354 (69%), Gaps = 4/354 (1%)
 Frame = -1

Query: 1290 MKGAMQCPNCRRIEKGQWLYANGSTSSLP-LSMEDWIPDEDLYDLSYSEMPFRVHWCPFG 1114
            MKGAMQCPNCR+IE+G+WL+ANGS  S P  SM+DW PDE+ YD +YSEMPFRV WCPF 
Sbjct: 1    MKGAMQCPNCRKIERGRWLFANGSARSFPEFSMDDWTPDEETYDFNYSEMPFRVQWCPFS 60

Query: 1113 EFARLGSSFEEVEPPSTTYHDLRGQNTVFSEHTGASSMAHSYVAYVGPVPATTSRSSNNV 934
             F ++ SSFEEVE PSTT+HDL+G + + SEH  ASS AHSYVAY GP+P T S SS +V
Sbjct: 61   GFTQVRSSFEEVESPSTTHHDLQGHHAILSEHAAASSAAHSYVAYFGPIPPTHSNSSESV 120

Query: 933  DDPHLNHHWNGLSGGNEIFTPHAFPYINIQYYHWGRHSPPFSISSSHMNGVEPNSSLHAT 754
            DD + NHHWN LS  +EIF+ HAFP I+IQY  WG HSPPFS +SSH+NG E   +L AT
Sbjct: 121  DDLNFNHHWNSLSAHSEIFSSHAFPAIDIQYQSWGHHSPPFSPTSSHINGAEQAPALPAT 180

Query: 753  LRSSHGESDATPIPRSFHHPLVFDHGSGPRAGSSFVSSVVLRRPGSSSL-TPERIQVSHV 577
            LRS  GESDA     SF HPL+F  GSG RAGS+FVSS+V   PG+S L T ERI +SH 
Sbjct: 181  LRSMRGESDAMTRSGSFVHPLLFGPGSGHRAGSAFVSSIVPNHPGNSVLRTYERIHISHA 240

Query: 576  LHHQ-PSSNSPGLPTPVFPGIRRFDGSRSLPAVVPAPLQHDQNGGFYVFPPS-SSGQNIP 403
            L HQ P  NSPG+PT + PG+RRF+G R+LP VVPA  Q D + GFY+FPPS +S +NI 
Sbjct: 241  LPHQHPPPNSPGMPTSIVPGVRRFNGPRALPPVVPAASQSDHSAGFYIFPPSGASIRNIH 300

Query: 402  EAENPMLNYFNVWERERSSRLQSVSRDSNWGSFHQXXXXXXXXXXXXSFWHRHS 241
            EAENP LN+F+ W             DS WGSFHQ            SFW   S
Sbjct: 301  EAENPSLNHFHAW-------------DSGWGSFHQATGGSDSGSRSSSFWRHWS 341


>gb|KJB24368.1| hypothetical protein B456_004G142300 [Gossypium raimondii]
          Length = 396

 Score =  410 bits (1053), Expect = e-111
 Identities = 216/410 (52%), Positives = 266/410 (64%), Gaps = 14/410 (3%)
 Frame = -1

Query: 1491 IVGSKAT----DHNPHQMXXXXXXXXXXDPP--STGAACSICLDLISDNGVRSRAKLQCG 1330
            +V SKA+    DH+ H              P  S+  +CSICLDL+SD G RSRAKL CG
Sbjct: 1    MVRSKASLLDLDHDDHHQLMDGPDGDVSAVPLSSSDISCSICLDLVSDTGGRSRAKLLCG 60

Query: 1329 HEYHLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGSTSSLP-LSMEDWIPDEDLYDLSY 1153
            H++HLDCIGSAFNMKGAMQCPNCR++EKGQWLYANGS  SLP L+MEDW  D+D Y+  Y
Sbjct: 61   HQFHLDCIGSAFNMKGAMQCPNCRKVEKGQWLYANGSNRSLPELTMEDWNLDDDYYEPVY 120

Query: 1152 SEMPFRVHWCPFGEFARLGSSFEEVEPPSTTYHDLRGQNTVFSEHTGASSMAHSYVAYVG 973
            SEM FR  WCP+GEF R+GSS EEVE PSTTYH++ G + +F+EH  ASS+AHSYVAYVG
Sbjct: 121  SEMQFRAQWCPYGEFTRIGSSSEEVESPSTTYHEIHGHHAIFAEHAAASSVAHSYVAYVG 180

Query: 972  PVPATTSRSSNNVDDPHLNHHWNGLSGGNEIFTPHAFPYINIQYYHWGRHSPPFSISSSH 793
            P+P+TT R+S++VDDP+ N HWN LSG NEIF PHAFP I IQY+               
Sbjct: 181  PLPSTTLRNSDSVDDPNFNRHWNILSGHNEIFIPHAFPTIRIQYH--------------- 225

Query: 792  MNGVEPNSSLHATLRSSHGESDATPIPRSFHHPLVFDHGSGPRAGSSFVSSVVLRRPGSS 613
                       A LRSS+GE DA+ +PR F HP  F+HGS  R GSSFVSSV    PGS 
Sbjct: 226  ----------TAALRSSNGEPDASTVPRLFGHPFPFEHGSSSRGGSSFVSSVFHHHPGSG 275

Query: 612  SLTPERIQVSHVLH------HQPSSNSPGLPTPVFPGIRRFDGSRSLPAVVPAPLQHDQN 451
            + T +R   S   +      +Q   N PG+P  V PGI      R +  + PA  Q DQ 
Sbjct: 276  AHTHDRTWPSLAYYRQQHRFNQQRFNRPGVPALVVPGI------RGVAPMTPAVPQPDQT 329

Query: 450  GGFYVFP-PSSSGQNIPEAENPMLNYFNVWERERSSRLQSVSRDSNWGSF 304
            GGFY++P  SSSGQN+PEAE+   N +   ERER S  +++SR + WG++
Sbjct: 330  GGFYIYPRSSSSGQNLPEAESSYPNNYIALERERLSHFRTMSRVTGWGAY 379


>ref|XP_012474960.1| PREDICTED: uncharacterized protein LOC105791441 isoform X3 [Gossypium
            raimondii]
          Length = 397

 Score =  405 bits (1041), Expect = e-110
 Identities = 216/411 (52%), Positives = 266/411 (64%), Gaps = 15/411 (3%)
 Frame = -1

Query: 1491 IVGSKAT----DHNPHQMXXXXXXXXXXDPP--STGAACSICLDLISDNGVRSRAKLQCG 1330
            +V SKA+    DH+ H              P  S+  +CSICLDL+SD G RSRAKL CG
Sbjct: 1    MVRSKASLLDLDHDDHHQLMDGPDGDVSAVPLSSSDISCSICLDLVSDTGGRSRAKLLCG 60

Query: 1329 HEYHLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGSTSSLP-LSMEDWIPDEDLYDLSY 1153
            H++HLDCIGSAFNMKGAMQCPNCR++EKGQWLYANGS  SLP L+MEDW  D+D Y+  Y
Sbjct: 61   HQFHLDCIGSAFNMKGAMQCPNCRKVEKGQWLYANGSNRSLPELTMEDWNLDDDYYEPVY 120

Query: 1152 SEMP-FRVHWCPFGEFARLGSSFEEVEPPSTTYHDLRGQNTVFSEHTGASSMAHSYVAYV 976
            SEM  FR  WCP+GEF R+GSS EEVE PSTTYH++ G + +F+EH  ASS+AHSYVAYV
Sbjct: 121  SEMQQFRAQWCPYGEFTRIGSSSEEVESPSTTYHEIHGHHAIFAEHAAASSVAHSYVAYV 180

Query: 975  GPVPATTSRSSNNVDDPHLNHHWNGLSGGNEIFTPHAFPYINIQYYHWGRHSPPFSISSS 796
            GP+P+TT R+S++VDDP+ N HWN LSG NEIF PHAFP I IQY+              
Sbjct: 181  GPLPSTTLRNSDSVDDPNFNRHWNILSGHNEIFIPHAFPTIRIQYH-------------- 226

Query: 795  HMNGVEPNSSLHATLRSSHGESDATPIPRSFHHPLVFDHGSGPRAGSSFVSSVVLRRPGS 616
                        A LRSS+GE DA+ +PR F HP  F+HGS  R GSSFVSSV    PGS
Sbjct: 227  -----------TAALRSSNGEPDASTVPRLFGHPFPFEHGSSSRGGSSFVSSVFHHHPGS 275

Query: 615  SSLTPERIQVSHVLH------HQPSSNSPGLPTPVFPGIRRFDGSRSLPAVVPAPLQHDQ 454
             + T +R   S   +      +Q   N PG+P  V PGI      R +  + PA  Q DQ
Sbjct: 276  GAHTHDRTWPSLAYYRQQHRFNQQRFNRPGVPALVVPGI------RGVAPMTPAVPQPDQ 329

Query: 453  NGGFYVFP-PSSSGQNIPEAENPMLNYFNVWERERSSRLQSVSRDSNWGSF 304
             GGFY++P  SSSGQN+PEAE+   N +   ERER S  +++SR + WG++
Sbjct: 330  TGGFYIYPRSSSSGQNLPEAESSYPNNYIALERERLSHFRTMSRVTGWGAY 380


>ref|XP_007031010.1| RING/U-box superfamily protein isoform 1 [Theobroma cacao]
            gi|508719615|gb|EOY11512.1| RING/U-box superfamily
            protein isoform 1 [Theobroma cacao]
          Length = 437

 Score =  390 bits (1003), Expect = e-105
 Identities = 202/392 (51%), Positives = 249/392 (63%), Gaps = 7/392 (1%)
 Frame = -1

Query: 1398 ACSICLDLISDNGVRSRAKLQCGHEYHLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGS 1219
            +CSICL+ ++DNG RS AKLQCGH++HLDCIGSAFN+KGAMQCPNCR+IEKGQWLYANG 
Sbjct: 36   SCSICLETVADNGDRSWAKLQCGHQFHLDCIGSAFNIKGAMQCPNCRKIEKGQWLYANGC 95

Query: 1218 TSSLPLSMEDWIPDEDLYDLSYSEMPFRVHWCPFGEFARLGSSFEEVEPPSTTYHDLRGQ 1039
             S    S++DW  DEDLYDLSYSEM F VHWCPFG  ARL SSFEE E  STTYH+L GQ
Sbjct: 96   RSYPEFSVDDWTHDEDLYDLSYSEMSFGVHWCPFGSVARLPSSFEEGEFSSTTYHELLGQ 155

Query: 1038 NTVFSEHTGASSMAH--SYVAYVGPV--PATTSRSSNNVDDPHLNHHWNGLSGGNEIFTP 871
            + +F+EH+  SS +H   YVAY GP   P++++ S +  D  + N HWNG S  +EI T 
Sbjct: 156  HAIFAEHSAVSSASHPCPYVAYFGPTIHPSSSNSSGSVSDSSNFNSHWNGPSVPSEIPTS 215

Query: 870  HAFPYINIQYYHWGRHSPPFSISSSHMNGVEPNSSLHATLRSSHGESDATPIPRSFHHPL 691
            +AFP +++ Y+ W  HSPPFS SSS +   +  S    + RSS   +D  P   SF HP 
Sbjct: 216  YAFPAVDLHYHGWEHHSPPFSTSSSRIGSSDQPSIPPVSQRSSRSSTD-MPRSGSFMHPF 274

Query: 690  VFDHGSGPRAGSSFVSSVVLRRPGSSSLTPERIQVSHVLHHQPS-SNSPGLPTPVFPGIR 514
            V  H SG RAGSS  SS++   PGS++   +R+Q     + Q   S SP + TP+ PG R
Sbjct: 275  VVGHSSGARAGSSVASSLIPPYPGSNARARDRVQALQAYYQQQQPSTSPAIRTPIIPGSR 334

Query: 513  RFDGSRSLPAVVPAPLQHDQNGGFYVFPPSSSGQNIPEAENPMLNYFNVWERER--SSRL 340
            R    RSL  V P     DQ GGFY  P  +SG+N  EAENP+   F+ WER+   S  L
Sbjct: 335  RSSSHRSLAQVGPVASSSDQVGGFYFVPSGTSGRNFQEAENPLSTRFHAWERDHLPSFSL 394

Query: 339  QSVSRDSNWGSFHQXXXXXXXXXXXXSFWHRH 244
              V RDS WG+FHQ            SF  RH
Sbjct: 395  NQVDRDSGWGAFHQAAGGSDPGIRSSSFRQRH 426


>ref|XP_004302386.1| PREDICTED: uncharacterized protein LOC101307805 [Fragaria vesca
            subsp. vesca]
          Length = 436

 Score =  386 bits (992), Expect = e-104
 Identities = 193/374 (51%), Positives = 246/374 (65%), Gaps = 8/374 (2%)
 Frame = -1

Query: 1398 ACSICLDLISDNGVRSRAKLQCGHEYHLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGS 1219
            +CSICL++++DNG RS AKLQCGH++HLDCIGSAFN+KGAMQCPNCR+IEKGQWLYANG 
Sbjct: 36   SCSICLEVVADNGDRSWAKLQCGHQFHLDCIGSAFNIKGAMQCPNCRKIEKGQWLYANGC 95

Query: 1218 TSSLPLSMEDWIPDEDLYDLSYSEMPFRVHWCPFGEFARLGSSFEEVEPPSTTYHDLRGQ 1039
             S    SM+DW  DEDLYDLSYSEM F VHWCPFG  ARL SSFEE E   T+YHDL GQ
Sbjct: 96   RSIPEFSMDDWNHDEDLYDLSYSEMSFGVHWCPFGSLARLPSSFEEGEFSPTSYHDLLGQ 155

Query: 1038 NTVFSEHTGASSMAH--SYVAYVGPVPATTSRSSNNVDD-PHLNHHWNGLSGGNEIFTPH 868
            + +F+EHT  SS  H   Y+AY GPV  ++S SS ++ +  + NHHW G +  +E+ + +
Sbjct: 156  HAIFAEHTAVSSAGHPCPYIAYFGPVHPSSSNSSGSISEASNFNHHWTGTTLPSELPSSY 215

Query: 867  AFPYINIQYYHWGRHSPPFSISSSHMNGVEPNSSLHATLRSSHGESDATPIPR--SFHHP 694
            AFP +++ Y  W  HSPPFS +S+H+ GV+       T RS+   S+   IPR  SF HP
Sbjct: 216  AFPAMDLHYPSWDHHSPPFSTNSNHIGGVDQTPIPSVTQRSARPSSE---IPRSGSFMHP 272

Query: 693  LVFDHGSGPRAGSSFVSSVVLRRPGSSSLTPERIQVSHVLHHQPS-SNSPGLPTPVFPGI 517
             +  H S  RAGSS +SS++   PGS++   +R+Q     + Q   SNSP + TP+  G 
Sbjct: 273  FLVGHSSSARAGSSVMSSMIPPYPGSNARARDRVQALQAYYQQQQPSNSPTMRTPMISGA 332

Query: 516  RRFDGSRSLPAVVPAPLQHDQNGGFYVFPPSSSGQNIPEAENPMLNYFNVWERER--SSR 343
            RR    R LP V P     DQNGGFY +P  SSG+N  EAENP+   F+ WER+   S  
Sbjct: 333  RRTSNQRGLPQVGPVASSSDQNGGFYFYPSGSSGRNYQEAENPLSTRFHAWERDHLPSFS 392

Query: 342  LQSVSRDSNWGSFH 301
            +  V RD  W + H
Sbjct: 393  MNQVDRDPGWAALH 406


>ref|XP_008370422.1| PREDICTED: uncharacterized protein LOC103433908 [Malus domestica]
          Length = 434

 Score =  385 bits (990), Expect = e-104
 Identities = 197/394 (50%), Positives = 253/394 (64%), Gaps = 9/394 (2%)
 Frame = -1

Query: 1398 ACSICLDLISDNGVRSRAKLQCGHEYHLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGS 1219
            +CSICL++++D G RS AKLQCGH++HLDCIGSAFN+KGAMQCPNCR+IEKGQWLY+NG 
Sbjct: 33   SCSICLEVVADKGDRSWAKLQCGHQFHLDCIGSAFNVKGAMQCPNCRKIEKGQWLYSNGC 92

Query: 1218 TSSLPLSMEDWIPDEDLYDLSYSEMPFRVHWCPFGEFARLGSSFEEVEPPSTTYHDLRGQ 1039
             S    SM+DW  DEDLYDLSYSEM F VHWCPFG  ARL SSFEE E   T+YH+L GQ
Sbjct: 93   RSFPEFSMDDWTHDEDLYDLSYSEMSFGVHWCPFGSLARLPSSFEEGEFSPTSYHELLGQ 152

Query: 1038 NTVFSEHTGASSMAH--SYVAYVGPVPATTSRSSNNVDD-PHLNHHWNGLSGGNEIFTPH 868
            + +F+EHT  SS AH   Y+AY GP+  ++S SS NV +  + NHHW+G S  +E+   +
Sbjct: 153  HAIFAEHTAVSSAAHPCPYIAYFGPIHPSSSNSSGNVSEASNFNHHWSGTSVPSEMPNSY 212

Query: 867  AFPYINIQYYHWGRHS-PPFSISSSHMNGVEPNSSLHATLRSSHGESDATPIPR--SFHH 697
            AFP +++ Y+ W  HS PPFS +++H+ G +  S    T RS+   +D   IPR  SF H
Sbjct: 213  AFPAMDLHYHSWEHHSPPPFSTTNNHIGGADQASVPSVTQRSARPSAD---IPRSGSFMH 269

Query: 696  PLVFDHGSGPRAGSSFVSSVVLRRPGSSSLTPERIQVSHVLHHQPS-SNSPGLPTPVFPG 520
            P +  H S  RAGSS  SS++   PGS++   +R+Q     + Q   +NSP + TP+ PG
Sbjct: 270  PFLVGHSSSARAGSSVTSSMIPPYPGSNARARDRVQALQAYYQQQQPNNSPTMRTPIVPG 329

Query: 519  IRRFDGSRSLPAVVPAPLQHDQNGGFYVFPPSSSGQNIPEAENPMLNYFNVWERER--SS 346
             RR    R +  V P     DQNGGFY FP  SSG+N  EAENP+ N F+ WER+   S 
Sbjct: 330  ARRSSSQRGVAQVGPVASSSDQNGGFYFFPSGSSGRNYQEAENPLPNRFHPWERDHMPSF 389

Query: 345  RLQSVSRDSNWGSFHQXXXXXXXXXXXXSFWHRH 244
             +  V RD  W +F+Q            SF  RH
Sbjct: 390  SMNQVDRDQGWSAFNQGGSGSDSAIRGSSFRQRH 423


>ref|XP_007205224.1| hypothetical protein PRUPE_ppa005981mg [Prunus persica]
            gi|462400866|gb|EMJ06423.1| hypothetical protein
            PRUPE_ppa005981mg [Prunus persica]
          Length = 434

 Score =  385 bits (990), Expect = e-104
 Identities = 196/392 (50%), Positives = 247/392 (63%), Gaps = 7/392 (1%)
 Frame = -1

Query: 1398 ACSICLDLISDNGVRSRAKLQCGHEYHLDCIGSAFNMKGAMQCPNCRRIEKGQWLYANGS 1219
            +CSICL++++DNG RS AKLQCGH++HLDCIGSAFN+KGAMQCPNCR+IEKGQWLYANG 
Sbjct: 33   SCSICLEVVADNGDRSWAKLQCGHQFHLDCIGSAFNIKGAMQCPNCRKIEKGQWLYANGY 92

Query: 1218 TSSLPLSMEDWIPDEDLYDLSYSEMPFRVHWCPFGEFARLGSSFEEVEPPSTTYHDLRGQ 1039
             S     ++DW  DEDLYDL YSEM F VHWCPFG  ARL SSFEE E   T+YHDL GQ
Sbjct: 93   RSIQEFGVDDWTHDEDLYDLGYSEMSFGVHWCPFGSLARLPSSFEEGEFSPTSYHDLLGQ 152

Query: 1038 NTVFSEHTGASSMAH--SYVAYVGPVPATTSRSSNNVDD-PHLNHHWNGLSGGNEIFTPH 868
            + +F+EHT  SS  H   Y+AY GP+  ++S SS +V +  + NHHW+G S  +EI   +
Sbjct: 153  HAIFAEHTAVSSAGHPCPYIAYFGPIHPSSSNSSGSVSEASNFNHHWSGTSVPSEIPNSY 212

Query: 867  AFPYINIQYYHWGRHS-PPFSISSSHMNGVEPNSSLHATLRSSHGESDATPIPRSFHHPL 691
            AFP +++ Y+ W  HS PPFS +++H+ G E  S    T RS+   SD  P   SF HP 
Sbjct: 213  AFPAMDLHYHSWEHHSPPPFSTTNNHIGGAEQGSIPSVTQRSARPSSD-LPRSGSFMHPF 271

Query: 690  VFDHGSGPRAGSSFVSSVVLRRPGSSSLTPERIQVSHVLHHQPS-SNSPGLPTPVFPGIR 514
            +  H S  RAGSS  SS++   PGS++   +R+Q     + Q   SNSP + TP+  G R
Sbjct: 272  LVGHSSSARAGSSVTSSMIPPYPGSNARARDRVQALQAYYQQQQPSNSPTMRTPIIQGAR 331

Query: 513  RFDGSRSLPAVVPAPLQHDQNGGFYVFPPSSSGQNIPEAENPMLNYFNVWERER--SSRL 340
            R    R +  V P     DQNGGFY FP  SSG+N  EAENP+ N F+ WER+   S  +
Sbjct: 332  RSSSQRGVAQVGPVASSSDQNGGFYFFPSGSSGRNYQEAENPLPNRFHAWERDHLPSFSM 391

Query: 339  QSVSRDSNWGSFHQXXXXXXXXXXXXSFWHRH 244
              V RD  W + HQ            SF  RH
Sbjct: 392  NQVDRDQGWAAVHQGGSGSDSAMRGNSFRQRH 423


Top