BLASTX nr result
ID: Cocculus23_contig00041362
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00041362 (541 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004494047.1| PREDICTED: uncharacterized protein LOC101494... 219 3e-55 ref|XP_007203877.1| hypothetical protein PRUPE_ppa005088mg [Prun... 219 3e-55 ref|XP_002530334.1| conserved hypothetical protein [Ricinus comm... 216 3e-54 ref|XP_004303543.1| PREDICTED: uncharacterized protein LOC101300... 216 4e-54 ref|XP_006481747.1| PREDICTED: uncharacterized protein LOC102616... 214 8e-54 ref|XP_006430166.1| hypothetical protein CICLE_v10011628mg [Citr... 213 2e-53 ref|XP_002308465.2| hypothetical protein POPTR_0006s22750g [Popu... 213 2e-53 ref|XP_007027791.1| GATA zinc finger domain-containing protein C... 213 3e-53 ref|XP_004246498.1| PREDICTED: uncharacterized protein LOC101249... 210 2e-52 ref|XP_006341098.1| PREDICTED: uncharacterized protein LOC102599... 208 6e-52 ref|XP_006403785.1| hypothetical protein EUTSA_v10010329mg [Eutr... 207 1e-51 ref|XP_006291061.1| hypothetical protein CARUB_v10017176mg [Caps... 207 1e-51 ref|XP_007145293.1| hypothetical protein PHAVU_007G226900g [Phas... 204 1e-50 ref|XP_004164825.1| PREDICTED: uncharacterized LOC101208906 [Cuc... 203 2e-50 ref|XP_004149221.1| PREDICTED: uncharacterized protein LOC101208... 203 2e-50 gb|EXB54583.1| hypothetical protein L484_019153 [Morus notabilis] 202 3e-50 ref|XP_007027792.1| GATA zinc finger domain-containing protein C... 199 5e-49 ref|XP_007027793.1| Uncharacterized protein TCM_022632 [Theobrom... 197 2e-48 gb|AAK59422.1| unknown protein [Arabidopsis thaliana] 197 2e-48 ref|NP_566970.1| uncharacterized protein [Arabidopsis thaliana] ... 197 2e-48 >ref|XP_004494047.1| PREDICTED: uncharacterized protein LOC101494882 [Cicer arietinum] Length = 464 Score = 219 bits (559), Expect = 3e-55 Identities = 105/175 (60%), Positives = 134/175 (76%) Frame = -3 Query: 527 NLPFVELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDT 348 NL +L+P GKT K W RGLD++GYSLN+ + NL FV++ S RSS VRLQL++D+T Sbjct: 192 NLAIEDLIPEGKTNKPFWARGLDVLGYSLNAFRFSNLSFVNVDSPRSSRMVRLQLNADET 251 Query: 347 LRLVSGCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHN 168 + L++GCK+RGIKLCGA++AAGM+AA+TSK LP+HQ EKY+V+TLID R LDP L +N Sbjct: 252 MSLLAGCKSRGIKLCGALAAAGMIAAWTSKHLPDHQTEKYAVVTLIDCRPILDPVLSSNN 311 Query: 167 FGFYHTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 GFYH+AIL+THDV G W+LA R YTSF A NKHF D++D+NFLMCK Sbjct: 312 CGFYHSAILNTHDVCGE---TLWELAKRSYTSFANAVKCNKHFSDMSDLNFLMCK 363 >ref|XP_007203877.1| hypothetical protein PRUPE_ppa005088mg [Prunus persica] gi|462399408|gb|EMJ05076.1| hypothetical protein PRUPE_ppa005088mg [Prunus persica] Length = 477 Score = 219 bits (559), Expect = 3e-55 Identities = 99/175 (56%), Positives = 136/175 (77%) Frame = -3 Query: 527 NLPFVELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDT 348 +L +L+P+GK K W RG+D++GYSLNSL+ NL F D SAR S V+LQL+ D Sbjct: 202 SLGIEDLIPNGKANKPFWARGVDMLGYSLNSLRLSNLDFKDASSARRSRVVKLQLNPHDC 261 Query: 347 LRLVSGCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHN 168 RL++GCK+R IKL GA++AAG++A + SK LP+HQWEKY+V+TL+D RS L+PPL +N Sbjct: 262 QRLLAGCKSREIKLSGALAAAGLIAVHASKHLPDHQWEKYAVVTLLDCRSILEPPLSSNN 321 Query: 167 FGFYHTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 GFYH+AI++THD++GG ++ W+LA RC+ +F AKNSNKHF D++D+NFLMCK Sbjct: 322 LGFYHSAIMNTHDINGGNTL--WELAKRCHIAFANAKNSNKHFTDMSDLNFLMCK 374 >ref|XP_002530334.1| conserved hypothetical protein [Ricinus communis] gi|223530138|gb|EEF32050.1| conserved hypothetical protein [Ricinus communis] Length = 474 Score = 216 bits (550), Expect = 3e-54 Identities = 98/174 (56%), Positives = 131/174 (75%) Frame = -3 Query: 524 LPFVELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDTL 345 +P + +P GK+ K W RG+D+VGYSLNS + NL F+D SAR S +RLQ++SD T Sbjct: 197 VPIEDCIPDGKSSKWFWARGMDVVGYSLNSFRLANLNFIDASSARRSQVIRLQINSDQTF 256 Query: 344 RLVSGCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHNF 165 +LV GCK+RGIKLCGA++AAG++AA+++K LP Q KY+V+TL+D RS LDP L HN Sbjct: 257 KLVEGCKSRGIKLCGALAAAGLIAAHSTKDLPHDQSHKYAVVTLVDCRSILDPVLSSHNL 316 Query: 164 GFYHTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 GFYH+AIL+THD++GG + W++A RCY SF AK +NKHF D+ D+NFLM K Sbjct: 317 GFYHSAILNTHDINGGDKL--WEVAQRCYMSFANAKKNNKHFTDMGDLNFLMGK 368 >ref|XP_004303543.1| PREDICTED: uncharacterized protein LOC101300265 [Fragaria vesca subsp. vesca] Length = 471 Score = 216 bits (549), Expect = 4e-54 Identities = 91/170 (53%), Positives = 137/170 (80%) Frame = -3 Query: 512 ELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDTLRLVS 333 +L+P+GK K W RG+D++GYSLNSL+ NL F D+ +++ V+L+++ + T +L++ Sbjct: 202 DLIPNGKASKPFWARGVDMLGYSLNSLRLSNLEFKDVSLEKTTQMVKLRINPEHTDKLLA 261 Query: 332 GCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHNFGFYH 153 GCK++GIKLCG ++AAG++AA+ SK LP+HQWEKY+V+TL+D R LDPPL ++ GFYH Sbjct: 262 GCKSKGIKLCGVLAAAGLIAAHASKHLPDHQWEKYAVVTLLDCRPLLDPPLSANDLGFYH 321 Query: 152 TAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 +AI+++HD++G ++ W+LA RCYT+F +AKNSNKHF D++D+NFLMCK Sbjct: 322 SAIVNSHDINGENTL--WELAKRCYTAFADAKNSNKHFSDMSDLNFLMCK 369 >ref|XP_006481747.1| PREDICTED: uncharacterized protein LOC102616692 isoform X1 [Citrus sinensis] Length = 478 Score = 214 bits (546), Expect = 8e-54 Identities = 103/175 (58%), Positives = 131/175 (74%) Frame = -3 Query: 527 NLPFVELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDT 348 +L E +PSGK K W RG+D++GYSLNSL+ N+ FVD S R S +RLQL+ D+T Sbjct: 201 SLGIEEFIPSGKANKPFWARGVDMLGYSLNSLRLSNISFVDADSPRFSQVLRLQLNRDET 260 Query: 347 LRLVSGCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHN 168 RLV GCK+RGIKLCGA++AAG++AA ++K P HQ EKY+V+TL+D RS L+P L Sbjct: 261 ARLVEGCKSRGIKLCGALAAAGLIAARSTKYFPSHQREKYAVVTLVDCRSILEPVLSDDY 320 Query: 167 FGFYHTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 GFYH+AIL+THDV+ G E W+LA R YTSF AKNS+KHF D+ND+NFLMCK Sbjct: 321 LGFYHSAILNTHDVN--GEEELWELATRSYTSFANAKNSDKHFTDMNDLNFLMCK 373 >ref|XP_006430166.1| hypothetical protein CICLE_v10011628mg [Citrus clementina] gi|557532223|gb|ESR43406.1| hypothetical protein CICLE_v10011628mg [Citrus clementina] Length = 478 Score = 213 bits (543), Expect = 2e-53 Identities = 103/175 (58%), Positives = 131/175 (74%) Frame = -3 Query: 527 NLPFVELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDT 348 +L E +PSGK K W RG+D++GYSLNSL+ N+ FVD S R S +RLQL+ D+T Sbjct: 201 SLGIEEFIPSGKANKPFWARGVDMLGYSLNSLRLSNISFVDADSPRFSQVLRLQLNRDET 260 Query: 347 LRLVSGCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHN 168 RLV GCK+RGIKLCGA++AAG++AA ++K P HQ EKY+V+TL+D RS L+P L Sbjct: 261 GRLVEGCKSRGIKLCGALAAAGLIAARSTKYFPSHQREKYAVVTLVDCRSILEPVLSDDY 320 Query: 167 FGFYHTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 GFYH+AIL+THDV+ G E W+LA R YTSF AKNS+KHF D+ND+NFLMCK Sbjct: 321 LGFYHSAILNTHDVN--GEEELWELATRSYTSFANAKNSDKHFTDMNDLNFLMCK 373 >ref|XP_002308465.2| hypothetical protein POPTR_0006s22750g [Populus trichocarpa] gi|550336884|gb|EEE91988.2| hypothetical protein POPTR_0006s22750g [Populus trichocarpa] Length = 478 Score = 213 bits (543), Expect = 2e-53 Identities = 100/175 (57%), Positives = 131/175 (74%) Frame = -3 Query: 527 NLPFVELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDT 348 +L + +PSGK K W RG+D++GYSLNS + NL FVD S R S VRLQ++SDDT Sbjct: 203 SLGIEDYIPSGKGNKPFWARGIDMLGYSLNSFRLSNLDFVDADSPRGSQVVRLQMNSDDT 262 Query: 347 LRLVSGCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHN 168 +L+ GC +RGIKL GA++AAG++AA ++K LP+HQ EKY+V+TLID RS LDP L + Sbjct: 263 QKLLDGCMSRGIKLSGALAAAGLIAAQSTKDLPDHQMEKYAVVTLIDCRSILDPVLSGDH 322 Query: 167 FGFYHTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 GFYH+A+L+THDV GG + WDLA RCY ++ AKN+NKHF D+ D+NFLMCK Sbjct: 323 IGFYHSAMLNTHDVSGG--VMLWDLAKRCYMAYTNAKNNNKHFTDMGDLNFLMCK 375 >ref|XP_007027791.1| GATA zinc finger domain-containing protein C1393.08 isoform 1 [Theobroma cacao] gi|508716396|gb|EOY08293.1| GATA zinc finger domain-containing protein C1393.08 isoform 1 [Theobroma cacao] Length = 478 Score = 213 bits (541), Expect = 3e-53 Identities = 102/175 (58%), Positives = 132/175 (75%) Frame = -3 Query: 527 NLPFVELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDT 348 +L +L+PSGK K W RG+D++GYSLNS + NL FVD SAR S VRLQ++ D+T Sbjct: 203 SLGIEDLIPSGKANKPFWARGVDMLGYSLNSFRLANLNFVDANSARRSQVVRLQMNPDET 262 Query: 347 LRLVSGCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHN 168 LV+GCK+RGIKLCGA++AAG++AA ++K PEHQ EKY+V+TL D RS LDP L ++ Sbjct: 263 DGLVAGCKSRGIKLCGALAAAGLIAARSTKAYPEHQREKYAVVTLTDCRSILDPVLGSNH 322 Query: 167 FGFYHTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 GFYH+AIL+THDV + W+LA RCY SF AKN++KHF D+ND+NFLMCK Sbjct: 323 LGFYHSAILNTHDVTAHEQV--WELARRCYMSFSNAKNNDKHFTDMNDLNFLMCK 375 >ref|XP_004246498.1| PREDICTED: uncharacterized protein LOC101249753 [Solanum lycopersicum] Length = 472 Score = 210 bits (534), Expect = 2e-52 Identities = 98/170 (57%), Positives = 127/170 (74%) Frame = -3 Query: 512 ELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDTLRLVS 333 E +P GK K W RG+D+VGY LNSL+ CNL F+D S R S V+LQL+ +T ++ Sbjct: 210 EYIPDGKASKPFWARGIDMVGYGLNSLRFCNLKFMDSESTRGSQVVKLQLNKQETDHILD 269 Query: 332 GCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHNFGFYH 153 GCKTRGIKLCG ++AAG++AA++ K L E+QWEKY+++TLI+ RS LDP L P GFYH Sbjct: 270 GCKTRGIKLCGLLAAAGLIAAHSLKGLKENQWEKYAIVTLINCRSILDPVLSPDFPGFYH 329 Query: 152 TAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 +AIL+THDV GG + W+LA R YTSF AKN+NKHF D+ D+NFLMC+ Sbjct: 330 SAILNTHDVKGGD--DLWELAKRSYTSFINAKNNNKHFTDMGDLNFLMCR 377 >ref|XP_006341098.1| PREDICTED: uncharacterized protein LOC102599218 [Solanum tuberosum] Length = 470 Score = 208 bits (530), Expect = 6e-52 Identities = 98/170 (57%), Positives = 129/170 (75%) Frame = -3 Query: 512 ELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDTLRLVS 333 E +P+GK K W RG+D+VGY LNSL+ NL F+D S R S V+LQL+ ++T R++ Sbjct: 208 EYIPAGKASKPFWARGIDMVGYGLNSLRFSNLKFMDSESTRGSQVVKLQLNKEETDRILD 267 Query: 332 GCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHNFGFYH 153 GCKTR IKLCG ++AAG++AA++SK L E+QWEKY+++TLI+ RS LDP L P GFYH Sbjct: 268 GCKTRDIKLCGLLAAAGLIAAHSSKGLNENQWEKYAIVTLINCRSILDPVLSPDFPGFYH 327 Query: 152 TAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 +AIL+THDV GG + W+LA R YTSF AKN+NKHF D+ D+NFLMC+ Sbjct: 328 SAILNTHDVKGGD--DLWELAKRSYTSFINAKNNNKHFTDMGDLNFLMCR 375 >ref|XP_006403785.1| hypothetical protein EUTSA_v10010329mg [Eutrema salsugineum] gi|557104904|gb|ESQ45238.1| hypothetical protein EUTSA_v10010329mg [Eutrema salsugineum] Length = 477 Score = 207 bits (527), Expect = 1e-51 Identities = 100/171 (58%), Positives = 131/171 (76%), Gaps = 1/171 (0%) Frame = -3 Query: 512 ELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDI-CSARSSTFVRLQLSSDDTLRLV 336 E+VPSGK K W RG+D++GYSLN+ + NL FVD S R S VR++L D+TL+LV Sbjct: 208 EMVPSGKGNKPFWARGIDVLGYSLNAFRFSNLSFVDAEDSNRRSQVVRMKLERDETLKLV 267 Query: 335 SGCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHNFGFY 156 +GCK RGIKL AI+A+G+++AY+SK+LP++Q EKY+V+TL D RS L+PPL P++FGFY Sbjct: 268 AGCKARGIKLWAAIAASGLISAYSSKKLPQNQGEKYAVVTLSDCRSILEPPLTPNDFGFY 327 Query: 155 HTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 H+ IL THD+ G + WDLA RCY SF AKNSNK F D++D+NFLMCK Sbjct: 328 HSGILHTHDI--TGEEKLWDLAKRCYDSFTSAKNSNKQFTDMSDLNFLMCK 376 >ref|XP_006291061.1| hypothetical protein CARUB_v10017176mg [Capsella rubella] gi|482559768|gb|EOA23959.1| hypothetical protein CARUB_v10017176mg [Capsella rubella] Length = 469 Score = 207 bits (527), Expect = 1e-51 Identities = 101/171 (59%), Positives = 129/171 (75%), Gaps = 1/171 (0%) Frame = -3 Query: 512 ELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDIC-SARSSTFVRLQLSSDDTLRLV 336 EL+PSGK K W RG+D++GYSLN+ + NL FVD S R S VRL+L D TL+LV Sbjct: 200 ELIPSGKGNKPFWARGIDVLGYSLNAFRFSNLNFVDADDSNRRSQVVRLRLERDQTLKLV 259 Query: 335 SGCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHNFGFY 156 +GCK RGIKL A++++G++AAY+SK+LP +Q EKY+V+TL D RS LDPPL ++FGFY Sbjct: 260 AGCKARGIKLWAALASSGLIAAYSSKKLPPYQGEKYAVVTLSDCRSILDPPLTSNDFGFY 319 Query: 155 HTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 H IL THD+ G ++ WDLA RCY SF AKNSNKHF D++D+NFLMCK Sbjct: 320 HAGILHTHDITGEETL--WDLAKRCYDSFTSAKNSNKHFTDMSDLNFLMCK 368 >ref|XP_007145293.1| hypothetical protein PHAVU_007G226900g [Phaseolus vulgaris] gi|593689350|ref|XP_007145294.1| hypothetical protein PHAVU_007G226900g [Phaseolus vulgaris] gi|561018483|gb|ESW17287.1| hypothetical protein PHAVU_007G226900g [Phaseolus vulgaris] gi|561018484|gb|ESW17288.1| hypothetical protein PHAVU_007G226900g [Phaseolus vulgaris] Length = 471 Score = 204 bits (518), Expect = 1e-50 Identities = 99/175 (56%), Positives = 129/175 (73%) Frame = -3 Query: 527 NLPFVELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDT 348 +L +L+P GK K W RGLD++GYS N+L+ NL FVD S R S VRLQL++++T Sbjct: 195 SLAIEDLIPEGKMHKPFWARGLDVLGYSFNALRFSNLNFVDAASLRRSRIVRLQLNAEET 254 Query: 347 LRLVSGCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHN 168 L++GCK+RGIKLCGA++AAGMMAA+TSK LP +Q EKY+V+TL+D R LDP L ++ Sbjct: 255 KNLLAGCKSRGIKLCGALAAAGMMAAWTSKCLPNYQREKYAVVTLVDCRPLLDPVLPSNH 314 Query: 167 FGFYHTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 GFYH+AIL+THDV W+LA R YTSF A N NKHF D++D+N+LMCK Sbjct: 315 AGFYHSAILNTHDV---CEEALWELAKRSYTSFINAMNCNKHFSDMSDLNYLMCK 366 >ref|XP_004164825.1| PREDICTED: uncharacterized LOC101208906 [Cucumis sativus] Length = 526 Score = 203 bits (516), Expect = 2e-50 Identities = 94/171 (54%), Positives = 131/171 (76%), Gaps = 1/171 (0%) Frame = -3 Query: 512 ELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDTLRLVS 333 +L+P+GK K+LW RG D++GYSLNS + NL F D + R S +RL+++SD+T +L++ Sbjct: 256 DLIPNGKANKSLWARGFDMLGYSLNSFRLANLEFKDPNTERFSQMIRLRMNSDETQKLLA 315 Query: 332 GCKTRGIKLCGAISAAGMMAAYTSK-QLPEHQWEKYSVITLIDSRSSLDPPLHPHNFGFY 156 GCK RGIKLCGA++AAG++A SK LP +Q EKY+V+TL D RS LDPPL H+ GFY Sbjct: 316 GCKLRGIKLCGALAAAGLIATRCSKDHLPPYQKEKYAVVTLNDCRSLLDPPLTSHHLGFY 375 Query: 155 HTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 H+AIL+THD+ ++ W++A+RCY SF AK++NKHF D++D+NFLMCK Sbjct: 376 HSAILNTHDISAEDTV--WEVASRCYFSFSNAKDNNKHFSDMSDLNFLMCK 424 >ref|XP_004149221.1| PREDICTED: uncharacterized protein LOC101208906 [Cucumis sativus] Length = 526 Score = 203 bits (516), Expect = 2e-50 Identities = 94/171 (54%), Positives = 131/171 (76%), Gaps = 1/171 (0%) Frame = -3 Query: 512 ELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDTLRLVS 333 +L+P+GK K+LW RG D++GYSLNS + NL F D + R S +RL+++SD+T +L++ Sbjct: 256 DLIPNGKANKSLWARGFDMLGYSLNSFRLANLEFKDPNTERFSQMIRLRMNSDETQKLLA 315 Query: 332 GCKTRGIKLCGAISAAGMMAAYTSK-QLPEHQWEKYSVITLIDSRSSLDPPLHPHNFGFY 156 GCK RGIKLCGA++AAG++A SK LP +Q EKY+V+TL D RS LDPPL H+ GFY Sbjct: 316 GCKLRGIKLCGALAAAGLIATRCSKDHLPPYQKEKYAVVTLNDCRSLLDPPLTSHHLGFY 375 Query: 155 HTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 H+AIL+THD+ ++ W++A+RCY SF AK++NKHF D++D+NFLMCK Sbjct: 376 HSAILNTHDISAEDTV--WEVASRCYFSFSNAKDNNKHFSDMSDLNFLMCK 424 >gb|EXB54583.1| hypothetical protein L484_019153 [Morus notabilis] Length = 442 Score = 202 bits (515), Expect = 3e-50 Identities = 95/175 (54%), Positives = 129/175 (73%) Frame = -3 Query: 527 NLPFVELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDT 348 +L E++P GK K W RG+D++GYSLN+ + NL F D S RSS VRLQ++ DT Sbjct: 167 SLGIEEIIPKGKADKPFWARGVDVLGYSLNAFRLSNLEFRDAVSPRSSRVVRLQINRRDT 226 Query: 347 LRLVSGCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHN 168 RL+ GCK+R IKLCGA++AAG++AA +SK LP+HQ EKY V+TL + RS L+PPL + Sbjct: 227 ERLLEGCKSREIKLCGALAAAGLIAARSSKNLPDHQREKYGVVTLTNCRSILEPPLSSQH 286 Query: 167 FGFYHTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 GFYH+AIL+THD+ GG ++ W+LA R Y +F AK +NKHF D++D+N+LMCK Sbjct: 287 LGFYHSAILNTHDITGGETL--WELATRTYMAFANAKENNKHFTDMSDLNYLMCK 339 >ref|XP_007027792.1| GATA zinc finger domain-containing protein C1393.08 isoform 2 [Theobroma cacao] gi|508716397|gb|EOY08294.1| GATA zinc finger domain-containing protein C1393.08 isoform 2 [Theobroma cacao] Length = 481 Score = 199 bits (505), Expect = 5e-49 Identities = 99/178 (55%), Positives = 130/178 (73%), Gaps = 3/178 (1%) Frame = -3 Query: 527 NLPFVELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDT 348 +L +L+PSGK K W RG+D++GYSLNS + NL FVD SAR S VRLQ++ D+T Sbjct: 203 SLGIEDLIPSGKANKPFWARGVDMLGYSLNSFRLANLNFVDANSARRSQVVRLQMNPDET 262 Query: 347 LRLVS---GCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLH 177 LV+ ++RGIKLCGA++AAG++AA ++K PEHQ EKY+V+TL D RS LDP L Sbjct: 263 DGLVAVSDELQSRGIKLCGALAAAGLIAARSTKAYPEHQREKYAVVTLTDCRSILDPVLG 322 Query: 176 PHNFGFYHTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 ++ GFYH+AIL+THDV + W+LA RCY SF AKN++KHF D+ND+NFLMCK Sbjct: 323 SNHLGFYHSAILNTHDVTAHEQV--WELARRCYMSFSNAKNNDKHFTDMNDLNFLMCK 378 >ref|XP_007027793.1| Uncharacterized protein TCM_022632 [Theobroma cacao] gi|508716398|gb|EOY08295.1| Uncharacterized protein TCM_022632 [Theobroma cacao] Length = 485 Score = 197 bits (500), Expect = 2e-48 Identities = 93/173 (53%), Positives = 126/173 (72%) Frame = -3 Query: 527 NLPFVELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDICSARSSTFVRLQLSSDDT 348 +L +L+P GK KK LW RG+D++GYS+NSL+ NL F D S RSS VRL +S DDT Sbjct: 209 SLAIEDLIPKGKAKKTLWARGVDMLGYSVNSLRLTNLKFKDAKSPRSSQVVRLLISPDDT 268 Query: 347 LRLVSGCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHN 168 R+++GCK RGIKLCGA+ AAG++AA+TS +HQ +KY ++TL D RS L+PPL H+ Sbjct: 269 ERILAGCKARGIKLCGALGAAGLIAAHTSNCRSDHQRKKYGIVTLTDCRSILEPPLSNHH 328 Query: 167 FGFYHTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLM 9 FGFYH+AIL+TH + G + W+LA + YT+F K+ N+HF D+ D+NFLM Sbjct: 329 FGFYHSAILNTHVIKGVEKL--WELAKKMYTAFTNYKSCNRHFSDMADLNFLM 379 >gb|AAK59422.1| unknown protein [Arabidopsis thaliana] Length = 475 Score = 197 bits (500), Expect = 2e-48 Identities = 96/171 (56%), Positives = 125/171 (73%), Gaps = 1/171 (0%) Frame = -3 Query: 512 ELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDI-CSARSSTFVRLQLSSDDTLRLV 336 EL+PSGK K W RG+D++GYSLN+ + NL FVD S R S VRL+L D TL+LV Sbjct: 206 ELIPSGKGDKPFWARGIDVLGYSLNAFRFSNLNFVDAENSNRRSQLVRLKLDRDQTLKLV 265 Query: 335 SGCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHNFGFY 156 +GCK RG+KL A++++ ++AAY+SK LP +Q EKY+V+TL D RS L+PPL ++FGFY Sbjct: 266 AGCKARGLKLWAALASSALIAAYSSKNLPPYQGEKYAVVTLSDCRSILEPPLTSNDFGFY 325 Query: 155 HTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 H IL THD+ G + WDLA RCY SF +KNSNK F D++D+NFLMCK Sbjct: 326 HAGILHTHDL--TGEEKLWDLAKRCYDSFTSSKNSNKQFTDMSDLNFLMCK 374 >ref|NP_566970.1| uncharacterized protein [Arabidopsis thaliana] gi|23296330|gb|AAN13043.1| unknown protein [Arabidopsis thaliana] gi|332645448|gb|AEE78969.1| uncharacterized protein AT3G52610 [Arabidopsis thaliana] Length = 475 Score = 197 bits (500), Expect = 2e-48 Identities = 96/171 (56%), Positives = 125/171 (73%), Gaps = 1/171 (0%) Frame = -3 Query: 512 ELVPSGKTKKALWTRGLDLVGYSLNSLKCCNLPFVDI-CSARSSTFVRLQLSSDDTLRLV 336 EL+PSGK K W RG+D++GYSLN+ + NL FVD S R S VRL+L D TL+LV Sbjct: 206 ELIPSGKGDKPFWARGIDVLGYSLNAFRFSNLNFVDAENSNRRSQLVRLKLDRDQTLKLV 265 Query: 335 SGCKTRGIKLCGAISAAGMMAAYTSKQLPEHQWEKYSVITLIDSRSSLDPPLHPHNFGFY 156 +GCK RG+KL A++++ ++AAY+SK LP +Q EKY+V+TL D RS L+PPL ++FGFY Sbjct: 266 AGCKARGLKLWAALASSALIAAYSSKNLPPYQGEKYAVVTLSDCRSILEPPLTSNDFGFY 325 Query: 155 HTAILDTHDVHGGGSIEFWDLANRCYTSFKEAKNSNKHFKDLNDMNFLMCK 3 H IL THD+ G + WDLA RCY SF +KNSNK F D++D+NFLMCK Sbjct: 326 HAGILHTHDL--TGEEKLWDLAKRCYDSFTSSKNSNKQFTDMSDLNFLMCK 374