BLASTX nr result

ID: Forsythia23_contig00008069 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia23_contig00008069
         (1123 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011077851.1| PREDICTED: uncharacterized protein LOC105161...   341   e-114
ref|XP_009627514.1| PREDICTED: uncharacterized protein LOC104118...   294   e-101
ref|XP_006341098.1| PREDICTED: uncharacterized protein LOC102599...   293   e-100
ref|XP_009791810.1| PREDICTED: uncharacterized protein LOC104238...   292   e-100
emb|CDP15441.1| unnamed protein product [Coffea canephora]            294   8e-99
ref|XP_004246498.1| PREDICTED: uncharacterized protein LOC101249...   287   1e-98
ref|XP_009336970.1| PREDICTED: uncharacterized protein LOC103929...   301   5e-98
ref|XP_008387017.1| PREDICTED: uncharacterized protein LOC103449...   299   5e-98
ref|XP_007027791.1| GATA zinc finger domain-containing protein C...   300   9e-98
ref|XP_007203877.1| hypothetical protein PRUPE_ppa005088mg [Prun...   297   9e-98
ref|XP_009342999.1| PREDICTED: uncharacterized protein LOC103934...   300   1e-97
ref|XP_008243529.1| PREDICTED: uncharacterized protein LOC103341...   291   9e-97
ref|XP_002308465.2| hypothetical protein POPTR_0006s22750g [Popu...   294   5e-96
ref|XP_011004215.1| PREDICTED: uncharacterized protein LOC105110...   292   6e-96
ref|XP_002530334.1| conserved hypothetical protein [Ricinus comm...   287   1e-95
ref|XP_004303543.2| PREDICTED: uncharacterized protein LOC101300...   289   5e-95
ref|XP_007027792.1| GATA zinc finger domain-containing protein C...   286   1e-93
ref|XP_012485792.1| PREDICTED: uncharacterized protein LOC105799...   278   2e-92
emb|CAN71367.1| hypothetical protein VITISV_014691 [Vitis vinifera]   275   3e-92
ref|XP_006430166.1| hypothetical protein CICLE_v10011628mg [Citr...   272   4e-92

>ref|XP_011077851.1| PREDICTED: uncharacterized protein LOC105161749 [Sesamum indicum]
          Length = 468

 Score =  341 bits (875), Expect(2) = e-114
 Identities = 181/293 (61%), Positives = 207/293 (70%), Gaps = 8/293 (2%)
 Frame = -1

Query: 1048 VEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNAIHKLQIAHPILNS 869
            V  AEPKIRP G TE SWCKAVP GTGITV             LQNA+ KLQI+HPI+NS
Sbjct: 8    VHDAEPKIRPAGTTELSWCKAVPGGTGITVLALLLCKPLDIPFLQNALRKLQISHPIINS 67

Query: 868  QLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLCP----SVSPYQLLLEHELKTN 701
            +L FD  S+ FSY+TPPTP +++Q FDLPST+QIL++  P    SVSP+ L+LEHEL TN
Sbjct: 68   KLRFDSVSNTFSYVTPPTPHIRVQPFDLPSTSQILEAQFPEESNSVSPFHLILEHELNTN 127

Query: 700  SWQNLDSSLDTAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRRGLLETIG-GEAAA 524
            SWQN D S DT VFFASLYTLE  +WVLA RLHTSACDR +  AL R L+  +G GE   
Sbjct: 128  SWQNPDPSSDTDVFFASLYTLEKSKWVLALRLHTSACDRAAALALLRELMGLVGEGEGEG 187

Query: 523  VERV---EMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLANLSFKDTESPKASQ 353
            VE     EME  LGIE YIPSG+ NKPFWARGVDMLGYSLNS RLANLSFKDTESP+ S+
Sbjct: 188  VETEQGKEMEVSLGIEEYIPSGQGNKPFWARGVDMLGYSLNSLRLANLSFKDTESPRMSR 247

Query: 352  VERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPDDQWEKYAV 194
              RL +N+EDT  ILS C+ + IK            + S KGIPDD WEKYAV
Sbjct: 248  FIRLQMNTEDTCRILSSCERQEIKLCALLAAAALIASHSSKGIPDDYWEKYAV 300



 Score =  100 bits (248), Expect(2) = e-114
 Identities = 44/52 (84%), Positives = 47/52 (90%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAILNTHDIKGGEDLWELA R +TSF NAK N KHFSDMAD+NFLMC+
Sbjct: 320 GFYHSAILNTHDIKGGEDLWELAKRVHTSFTNAKNNKKHFSDMADINFLMCK 371


>ref|XP_009627514.1| PREDICTED: uncharacterized protein LOC104118047 [Nicotiana
            tomentosiformis]
          Length = 480

 Score =  294 bits (753), Expect(2) = e-101
 Identities = 165/317 (52%), Positives = 204/317 (64%), Gaps = 21/317 (6%)
 Frame = -1

Query: 1081 MSDQESSN--APPVEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNA 908
            M++QESS   APP E    + RP G TE+SWCKAVP GTGITV             LQNA
Sbjct: 1    MAEQESSTTAAPPCEF---QTRPAGNTEHSWCKAVPSGTGITVLALLLSKSPDTSLLQNA 57

Query: 907  IHKLQIAHPILNSQLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLC----PSVS 740
            +HKLQ ++PIL S+LH++ T++++SYI P T  LQIQ FDL STAQILQ L      S+S
Sbjct: 58   LHKLQNSNPILKSKLHYEPTTNSYSYIIPSTSHLQIQPFDLSSTAQILQQLQNPNHTSIS 117

Query: 739  PYQLLLEHELKTNSWQNLDSSL---DTAVFFASLYTLENERWVLAFRLHTSACDRTSMTA 569
             + L+LEHE+  NSW N  +S    DT VFFAS+Y L +E+W +  R+HTS CDR +  A
Sbjct: 118  DFHLILEHEINKNSWVNTATSSEISDTNVFFASIYRLSDEKWAVTLRIHTSMCDRAAALA 177

Query: 568  LRRGLLETIGGEAAAVER------VEMEFG------LGIENYIPSGKSNKPFWARGVDML 425
            L R LLE +  E            VE+E G      LGIE YIP+GK+NKPFWARG+DM+
Sbjct: 178  LLRKLLELMSSEKIQGMNENEGGGVELELGEKMEVVLGIEEYIPTGKANKPFWARGIDMV 237

Query: 424  GYSLNSFRLANLSFKDTESPKASQVERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXX 245
            GY LN+ R ANL F D+ES + SQV +L LN EDTD IL GCK+R+IK            
Sbjct: 238  GYGLNALRFANLKFIDSESTRGSQVVKLQLNKEDTDRILDGCKTRDIKLCGLLAAAGLMA 297

Query: 244  ARSFKGIPDDQWEKYAV 194
            A S KG+P+DQWEKYAV
Sbjct: 298  AHSSKGLPEDQWEKYAV 314



 Score =  102 bits (254), Expect(2) = e-101
 Identities = 44/52 (84%), Positives = 48/52 (92%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAILNTHD+KGGED+WELA R+YTSF NAK NNKHFSDM DLNFLMC+
Sbjct: 334 GFYHSAILNTHDVKGGEDIWELAMRSYTSFINAKNNNKHFSDMGDLNFLMCK 385


>ref|XP_006341098.1| PREDICTED: uncharacterized protein LOC102599218 [Solanum tuberosum]
          Length = 470

 Score =  293 bits (750), Expect(2) = e-100
 Identities = 160/306 (52%), Positives = 205/306 (66%), Gaps = 10/306 (3%)
 Frame = -1

Query: 1081 MSDQESSNAPPVEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNAIH 902
            M++Q S+ A P   +E KIR  G TE+SWCKAVP GTGITV             LQNA+H
Sbjct: 1    MAEQGSAAAAP--PSELKIRLAGSTEHSWCKAVPGGTGITVLALLLSKPPDISLLQNALH 58

Query: 901  KLQIAHPILNSQLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSL----CPSVSPY 734
            KLQ ++PIL SQLH++ +++++SYI P T  LQIQ FDL ST QIL+ L      SVS +
Sbjct: 59   KLQNSNPILKSQLHYESSTNSYSYIIPSTSHLQIQPFDLSSTVQILRRLKTSDLTSVSDF 118

Query: 733  QLLLEHELKTNSWQNLDSSLD--TAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRR 560
             L+LEHE+  NSW N  +S D  T +FFAS+Y LENE+ V+A R+HTSACDR +  AL +
Sbjct: 119  HLILEHEINQNSWINTGTSSDSDTDIFFASIYQLENEKSVVALRIHTSACDRAAALALLK 178

Query: 559  GLLETIGGEAAAVERVE----MEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLAN 392
             LLE + GE      +E    ME GLGIE YIP+GK++KPFWARG+DM+GY LNS R +N
Sbjct: 179  KLLELVSGEDGEGTELELLKKMEVGLGIEEYIPAGKASKPFWARGIDMVGYGLNSLRFSN 238

Query: 391  LSFKDTESPKASQVERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPDDQ 212
            L F D+ES + SQV +L LN E+TD IL GCK+R+IK            A S KG+ ++Q
Sbjct: 239  LKFMDSESTRGSQVVKLQLNKEETDRILDGCKTRDIKLCGLLAAAGLIAAHSSKGLNENQ 298

Query: 211  WEKYAV 194
            WEKYA+
Sbjct: 299  WEKYAI 304



 Score =  102 bits (253), Expect(2) = e-100
 Identities = 44/52 (84%), Positives = 48/52 (92%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAILNTHD+KGG+DLWELA R+YTSF NAK NNKHF+DM DLNFLMCR
Sbjct: 324 GFYHSAILNTHDVKGGDDLWELAKRSYTSFINAKNNNKHFTDMGDLNFLMCR 375


>ref|XP_009791810.1| PREDICTED: uncharacterized protein LOC104238976 [Nicotiana
            sylvestris]
          Length = 480

 Score =  292 bits (747), Expect(2) = e-100
 Identities = 161/317 (50%), Positives = 203/317 (64%), Gaps = 21/317 (6%)
 Frame = -1

Query: 1081 MSDQESSN--APPVEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNA 908
            M+++ESS   APP E    + RP G TE+SWCKAVP GTGITV             LQNA
Sbjct: 1    MAEEESSTTAAPPCEF---ETRPAGNTEHSWCKAVPSGTGITVLALLLSKSPDISVLQNA 57

Query: 907  IHKLQIAHPILNSQLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLC----PSVS 740
            +HKLQ ++PIL S+LH++ T++++SYI P T  LQIQ FDL STAQILQ L      S+S
Sbjct: 58   LHKLQNSNPILKSKLHYEPTTNSYSYIIPSTSHLQIQPFDLSSTAQILQQLKNPNHTSIS 117

Query: 739  PYQLLLEHELKTNSWQNLDSSLDTA---VFFASLYTLENERWVLAFRLHTSACDRTSMTA 569
             + L+LEHE+  NSW N+ +S D +   V F S+Y L +E+W +  R+HTS CDR +  A
Sbjct: 118  DFHLILEHEINNNSWANVATSSDHSDSDVLFVSIYRLSDEKWAVTLRIHTSTCDRAAALA 177

Query: 568  LRRGLLETI------------GGEAAAVERVEMEFGLGIENYIPSGKSNKPFWARGVDML 425
            L R LLE +            GG A      +ME GLGIE YIP+GK+NKPFWARG+DM+
Sbjct: 178  LLRKLLELMSSKKGKGMNENEGGRAELELGEKMEVGLGIEEYIPAGKANKPFWARGIDMV 237

Query: 424  GYSLNSFRLANLSFKDTESPKASQVERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXX 245
            GY LN+ R ANL+F D+ES + SQV +L LN EDTD IL GCK+R IK            
Sbjct: 238  GYGLNALRFANLNFIDSESTRGSQVVKLQLNKEDTDRILDGCKTRGIKLCGVLAAAGLIA 297

Query: 244  ARSFKGIPDDQWEKYAV 194
            A S K +P+DQWEKYAV
Sbjct: 298  AHSSKDLPEDQWEKYAV 314



 Score =  100 bits (250), Expect(2) = e-100
 Identities = 44/52 (84%), Positives = 47/52 (90%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAILN HD+KGGEDLWELA R+YTSF NAK NNKHFSDM DLNFLMC+
Sbjct: 334 GFYHSAILNMHDVKGGEDLWELAMRSYTSFINAKNNNKHFSDMGDLNFLMCK 385


>emb|CDP15441.1| unnamed protein product [Coffea canephora]
          Length = 483

 Score =  294 bits (753), Expect(2) = 8e-99
 Identities = 160/310 (51%), Positives = 199/310 (64%), Gaps = 14/310 (4%)
 Frame = -1

Query: 1081 MSDQESSNAPPVEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNAIH 902
            MS+QES ++P     EPK RP GGTE SWCKAVP GTGITV             LQ  + 
Sbjct: 1    MSEQESLSSPMPRVLEPKSRPAGGTEQSWCKAVPGGTGITVLALLLSKAPDVPFLQTTLR 60

Query: 901  KLQIAHPILNSQLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLCPSVSP----- 737
             LQ  HPIL S+LH+D TS+ +SYI P TPQLQIQ FDL S ++IL+ L  S S      
Sbjct: 61   NLQNTHPILKSKLHYDSTSTTYSYIIPATPQLQIQPFDLASASEILRGLIRSNSTTFSTT 120

Query: 736  -YQLLLEHELKTNSWQNLDSSL---DTAVFFASLYTLENERWVLAFRLHTSACDRTSMTA 569
             + L+LEHEL    W N D S    D  +FFAS+YTL + +WV+A RLHTS CDRT+  +
Sbjct: 121  DFHLILEHELNRIVWPNPDPSSEADDVDLFFASVYTLSDAKWVVALRLHTSVCDRTTAVS 180

Query: 568  LRRGLLETIG-----GEAAAVERVEMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSF 404
            L R LL+ +      G    ++   +E  LGIE+Y+PSGK NKPFWARGVDMLGYSLNSF
Sbjct: 181  LLRELLKLMSADNGEGTQKEIDEELLEVRLGIEDYVPSGKGNKPFWARGVDMLGYSLNSF 240

Query: 403  RLANLSFKDTESPKASQVERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGI 224
            RL+NL+F+DT  P++SQV RL +N++ T  ++SGC++RNIK            A S K  
Sbjct: 241  RLSNLTFQDTGLPRSSQVIRLQINADVTQKLISGCQARNIKLCGLLASAALIAAHSAKCF 300

Query: 223  PDDQWEKYAV 194
            PDD WEKYAV
Sbjct: 301  PDDHWEKYAV 310



 Score = 95.5 bits (236), Expect(2) = 8e-99
 Identities = 41/52 (78%), Positives = 48/52 (92%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAILN+HDIKGGE+LWELA RT+ S+ NAK NNKHFSDM+D+NFLMC+
Sbjct: 330 GFYHSAILNSHDIKGGENLWELAERTHASYNNAKNNNKHFSDMSDVNFLMCK 381


>ref|XP_004246498.1| PREDICTED: uncharacterized protein LOC101249753 [Solanum
            lycopersicum]
          Length = 472

 Score =  287 bits (734), Expect(2) = 1e-98
 Identities = 156/308 (50%), Positives = 200/308 (64%), Gaps = 12/308 (3%)
 Frame = -1

Query: 1081 MSDQESSNAPPVEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNAIH 902
            M++Q S+ A P  ++E K RP G TE+SWCKAVP GTGITV             LQNA+H
Sbjct: 1    MAEQGSAAAAP--SSELKSRPAGSTEHSWCKAVPGGTGITVLALLLSKPPDISLLQNALH 58

Query: 901  KLQIAHPILNSQLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSL----CPSVSPY 734
            KLQ ++PIL SQLH++ +++++SYI P T  LQIQ FDL ST QIL+ L      SVS +
Sbjct: 59   KLQNSNPILKSQLHYESSTNSYSYIIPSTSHLQIQPFDLSSTVQILRRLKTSDLTSVSDF 118

Query: 733  QLLLEHELKTNSWQNLDSSLD--TAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRR 560
             L+LEHE+  NSW N  +S D  T +FFAS+Y LENE+ V A R+HTS CDR +  A+ +
Sbjct: 119  HLILEHEINHNSWMNTGTSSDSDTDIFFASIYQLENEKSVFALRIHTSVCDRAAALAVLK 178

Query: 559  GLLETIGGEAAAVERVE------MEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRL 398
             LL+ +  E    E  E      ME GLGIE YIP GK++KPFWARG+DM+GY LNS R 
Sbjct: 179  KLLKLVSCEKEDEEGTELEILKKMEVGLGIEEYIPDGKASKPFWARGIDMVGYGLNSLRF 238

Query: 397  ANLSFKDTESPKASQVERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPD 218
             NL F D+ES + SQV +L LN ++TD IL GCK+R IK            A S KG+ +
Sbjct: 239  CNLKFMDSESTRGSQVVKLQLNKQETDHILDGCKTRGIKLCGLLAAAGLIAAHSLKGLKE 298

Query: 217  DQWEKYAV 194
            +QWEKYA+
Sbjct: 299  NQWEKYAI 306



 Score =  102 bits (253), Expect(2) = 1e-98
 Identities = 44/52 (84%), Positives = 48/52 (92%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAILNTHD+KGG+DLWELA R+YTSF NAK NNKHF+DM DLNFLMCR
Sbjct: 326 GFYHSAILNTHDVKGGDDLWELAKRSYTSFINAKNNNKHFTDMGDLNFLMCR 377


>ref|XP_009336970.1| PREDICTED: uncharacterized protein LOC103929485 [Pyrus x
            bretschneideri]
          Length = 477

 Score =  301 bits (772), Expect(2) = 5e-98
 Identities = 159/302 (52%), Positives = 197/302 (65%), Gaps = 6/302 (1%)
 Frame = -1

Query: 1081 MSDQESSNAPPVEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNAIH 902
            MS+ +  N  P    EPK RPVGGTE SWCKAVP GTGITV             LQ A+H
Sbjct: 1    MSESDHQNPNPPAMPEPKTRPVGGTELSWCKAVPSGTGITVLALLLSRPPNFSNLQTALH 60

Query: 901  KLQIAHPILNSQLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLCPSVSPYQLLL 722
             LQ +HPIL S+  FD  +  +S++TPPTP LQIQ FDLPSTA ILQ    +++P   +L
Sbjct: 61   NLQNSHPILRSKHQFDPATGTYSFLTPPTPHLQIQPFDLPSTAPILQINPDNIAPLHQIL 120

Query: 721  EHELKTNSWQNLDSSL--DTAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRRGLL- 551
            EHE+ +N+WQNLD SL  DT V +AS Y +   RWVL  RLHT+ACDR +  A+ + LL 
Sbjct: 121  EHEMNSNTWQNLDPSLESDTDVMYASTYAISESRWVLVLRLHTAACDRAAAVAVLKELLG 180

Query: 550  ---ETIGGEAAAVERVEMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLANLSFK 380
                T GG A    + + E  LGIE+ IP+GK+NKPFWARGVDMLGYSLNS RL+NL F+
Sbjct: 181  ELKSTGGGGAERELKGDGEVSLGIEDLIPNGKANKPFWARGVDMLGYSLNSLRLSNLEFQ 240

Query: 379  DTESPKASQVERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPDDQWEKY 200
            D  + + SQV +L L+ EDTD +L+GCKSR+IK            AR+ K +PD QWEKY
Sbjct: 241  DASAERRSQVVKLQLSPEDTDRLLAGCKSRDIKLCGALAAAGMIAARASKQLPDHQWEKY 300

Query: 199  AV 194
             V
Sbjct: 301  GV 302



 Score = 85.5 bits (210), Expect(2) = 5e-98
 Identities = 39/53 (73%), Positives = 44/53 (83%), Gaps = 1/53 (1%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKG-GEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAI+NTHDI G G  LW+LA R Y SFA AK +NKHF+DMADLNFLMC+
Sbjct: 322 GFYHSAIMNTHDINGEGNTLWDLAKRCYMSFAGAKNSNKHFTDMADLNFLMCK 374


>ref|XP_008387017.1| PREDICTED: uncharacterized protein LOC103449472 [Malus domestica]
          Length = 477

 Score =  299 bits (765), Expect(2) = 5e-98
 Identities = 158/302 (52%), Positives = 195/302 (64%), Gaps = 6/302 (1%)
 Frame = -1

Query: 1081 MSDQESSNAPPVEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNAIH 902
            MS+ +  N  P    EPK R VGGTE+SWCKAVPCGTGITV             LQ A+H
Sbjct: 1    MSESDHQNPNPPAMPEPKTRXVGGTEHSWCKAVPCGTGITVLALLLTRPPNFSNLQTALH 60

Query: 901  KLQIAHPILNSQLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLCPSVSPYQLLL 722
             LQ +HPIL S+ HFD  +  +S++T PTP LQIQ FDLPSTA ILQ    +++P   +L
Sbjct: 61   NLQNSHPILRSKHHFDPATGTYSFLTLPTPHLQIQPFDLPSTAPILQINPDNIAPLHQIL 120

Query: 721  EHELKTNSWQNLDSSL--DTAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRRGLL- 551
            EHE+  N WQNLD S+  DT V +AS Y +   RWVL  RLHTSACDR +  A+ + LL 
Sbjct: 121  EHEMNLNPWQNLDPSVESDTDVIYASTYAISESRWVLVLRLHTSACDRAAAVAVLKELLG 180

Query: 550  ---ETIGGEAAAVERVEMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLANLSFK 380
                T GG A    + + E  LGIE+ IP+GK+NKPFWARGVDM+GYSLNS RL+NL F+
Sbjct: 181  ELKSTGGGGAERELKGDGEVSLGIEDLIPNGKANKPFWARGVDMMGYSLNSLRLSNLEFQ 240

Query: 379  DTESPKASQVERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPDDQWEKY 200
            D  S + SQV +L L+ EDT  +L+GCKSR+IK            AR+ K +PD QWEKY
Sbjct: 241  DASSERRSQVVKLQLSPEDTHRLLAGCKSRDIKLCGALAAAGMIAARASKQLPDHQWEKY 300

Query: 199  AV 194
             V
Sbjct: 301  GV 302



 Score = 88.2 bits (217), Expect(2) = 5e-98
 Identities = 40/53 (75%), Positives = 46/53 (86%), Gaps = 1/53 (1%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKG-GEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAI+NTHDI G G  LW+LA R YTSFA+AK +NKHF+DMADLNFLMC+
Sbjct: 322 GFYHSAIMNTHDINGEGNTLWDLAKRCYTSFASAKNSNKHFTDMADLNFLMCK 374


>ref|XP_007027791.1| GATA zinc finger domain-containing protein C1393.08 isoform 1
            [Theobroma cacao] gi|508716396|gb|EOY08293.1| GATA zinc
            finger domain-containing protein C1393.08 isoform 1
            [Theobroma cacao]
          Length = 478

 Score =  300 bits (769), Expect(2) = 9e-98
 Identities = 162/302 (53%), Positives = 196/302 (64%), Gaps = 10/302 (3%)
 Frame = -1

Query: 1069 ESSNAPPVEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNAIHKLQI 890
            E    P  +  EPK+RP GGTEYSWC+AVP GTGITV             L+ A+ +LQ+
Sbjct: 3    EFDQNPTPKTPEPKVRPAGGTEYSWCRAVPGGTGITVLSLLLSNPPDISLLEAALCRLQV 62

Query: 889  AHPILNSQLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLC--PSVSPYQLLLEH 716
            +HPIL S+LHFD   + FS+IT   P  +IQ FDLPST+ ILQSL   P++  +Q LLEH
Sbjct: 63   SHPILRSRLHFDTCRNTFSFITHRNPHAKIQSFDLPSTSHILQSLSGDPNIDSHQFLLEH 122

Query: 715  ELKTNSWQNLDSS-LDTAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRRGLLETIG 539
            EL  NSW   D    D  VFF SLYTL   RWV+ FRLHTSACDR +  AL + LLE +G
Sbjct: 123  ELNRNSWNLPDGDQADRDVFFVSLYTLSETRWVVVFRLHTSACDRAAAVALLKELLELVG 182

Query: 538  GEAAAVERV-------EMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLANLSFK 380
            G  + VE +       E+E  LGIE+ IPSGK+NKPFWARGVDMLGYSLNSFRLANL+F 
Sbjct: 183  GGRSKVEEIAKGNDEKEVELSLGIEDLIPSGKANKPFWARGVDMLGYSLNSFRLANLNFV 242

Query: 379  DTESPKASQVERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPDDQWEKY 200
            D  S + SQV RL +N ++TD +++GCKSR IK            ARS K  P+ Q EKY
Sbjct: 243  DANSARRSQVVRLQMNPDETDGLVAGCKSRGIKLCGALAAAGLIAARSTKAYPEHQREKY 302

Query: 199  AV 194
            AV
Sbjct: 303  AV 304



 Score = 85.9 bits (211), Expect(2) = 9e-98
 Identities = 37/52 (71%), Positives = 43/52 (82%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAILNTHD+   E +WELA R Y SF+NAK N+KHF+DM DLNFLMC+
Sbjct: 324 GFYHSAILNTHDVTAHEQVWELARRCYMSFSNAKNNDKHFTDMNDLNFLMCK 375


>ref|XP_007203877.1| hypothetical protein PRUPE_ppa005088mg [Prunus persica]
            gi|462399408|gb|EMJ05076.1| hypothetical protein
            PRUPE_ppa005088mg [Prunus persica]
          Length = 477

 Score =  297 bits (760), Expect(2) = 9e-98
 Identities = 160/303 (52%), Positives = 193/303 (63%), Gaps = 7/303 (2%)
 Frame = -1

Query: 1081 MSDQESSNAPPVEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNAIH 902
            MS+    N  P    EPK RPVGGTEYSWCKAVP GTGITV             LQ A+H
Sbjct: 1    MSEWGDQNLNPEAMPEPKTRPVGGTEYSWCKAVPSGTGITVLALLLSKPPNFSILQTALH 60

Query: 901  KLQIAHPILNSQLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLC-PSVSPYQLL 725
             LQ +HP+L S+  FD T++ FS++TPPTP LQIQ FDLPSTA ILQ+   P++  + L+
Sbjct: 61   NLQYSHPVLRSKHLFDPTTNTFSFLTPPTPHLQIQPFDLPSTALILQNQSQPNIPAFHLI 120

Query: 724  LEHELKTNSWQNLD--SSLDTAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRRGLL 551
            LEHEL  N+W+N +  S  DT V FAS+YT+   RW LA R+HTSACDR +  AL R LL
Sbjct: 121  LEHELNINTWRNPNPSSDADTDVLFASVYTISESRWALALRVHTSACDRAAAVALLRALL 180

Query: 550  ----ETIGGEAAAVERVEMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLANLSF 383
                 T  G A    +   E  LGIE+ IP+GK+NKPFWARGVDMLGYSLNS RL+NL F
Sbjct: 181  GEMKSTGRGGAERELKGNGEVSLGIEDLIPNGKANKPFWARGVDMLGYSLNSLRLSNLDF 240

Query: 382  KDTESPKASQVERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPDDQWEK 203
            KD  S + S+V +L LN  D   +L+GCKSR IK              + K +PD QWEK
Sbjct: 241  KDASSARRSRVVKLQLNPHDCQRLLAGCKSREIKLSGALAAAGLIAVHASKHLPDHQWEK 300

Query: 202  YAV 194
            YAV
Sbjct: 301  YAV 303



 Score = 89.4 bits (220), Expect(2) = 9e-98
 Identities = 38/52 (73%), Positives = 45/52 (86%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAI+NTHDI GG  LWELA R + +FANAK +NKHF+DM+DLNFLMC+
Sbjct: 323 GFYHSAIMNTHDINGGNTLWELAKRCHIAFANAKNSNKHFTDMSDLNFLMCK 374


>ref|XP_009342999.1| PREDICTED: uncharacterized protein LOC103934958 [Pyrus x
            bretschneideri]
          Length = 477

 Score =  300 bits (768), Expect(2) = 1e-97
 Identities = 159/302 (52%), Positives = 196/302 (64%), Gaps = 6/302 (1%)
 Frame = -1

Query: 1081 MSDQESSNAPPVEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNAIH 902
            MS+ +  N  P    EPK RPVGGTE SWCKAVP GTGITV             LQ A+H
Sbjct: 1    MSESDHQNPNPPAMPEPKTRPVGGTELSWCKAVPSGTGITVLALLLSRPPNFSNLQTALH 60

Query: 901  KLQIAHPILNSQLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLCPSVSPYQLLL 722
             LQ +HPIL S+  FD  +  +S++TPPTP LQIQ FDLPSTA ILQ    +++P   +L
Sbjct: 61   NLQNSHPILRSKHQFDPATGNYSFLTPPTPHLQIQPFDLPSTAPILQINPDNIAPLHQIL 120

Query: 721  EHELKTNSWQNLDSSL--DTAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRRGLL- 551
            EHE+  N+WQNLD SL  DT V +AS Y +   RWVL  RLHT+ACDR +  A+ + LL 
Sbjct: 121  EHEMNLNTWQNLDPSLESDTDVMYASTYAISESRWVLVLRLHTAACDRAAAVAVLKELLG 180

Query: 550  ---ETIGGEAAAVERVEMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLANLSFK 380
                T GG A    + + E  LGIE+ IP+GK+NKPFWARGVDMLGYSLNS RL+NL F+
Sbjct: 181  ELKSTGGGGAERELKGDGEVSLGIEDLIPNGKANKPFWARGVDMLGYSLNSLRLSNLEFQ 240

Query: 379  DTESPKASQVERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPDDQWEKY 200
            D  + + SQV +L L+ EDTD +L+GCKSR+IK            AR+ K +PD QWEKY
Sbjct: 241  DASAERRSQVVKLQLSPEDTDRLLAGCKSRDIKLCGALAAAGMIAARASKQLPDHQWEKY 300

Query: 199  AV 194
             V
Sbjct: 301  GV 302



 Score = 85.5 bits (210), Expect(2) = 1e-97
 Identities = 39/53 (73%), Positives = 44/53 (83%), Gaps = 1/53 (1%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKG-GEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAI+NTHDI G G  LW+LA R Y SFA AK +NKHF+DMADLNFLMC+
Sbjct: 322 GFYHSAIMNTHDINGEGNTLWDLAKRCYMSFAGAKNSNKHFTDMADLNFLMCK 374


>ref|XP_008243529.1| PREDICTED: uncharacterized protein LOC103341759 [Prunus mume]
          Length = 464

 Score =  291 bits (746), Expect(2) = 9e-97
 Identities = 156/288 (54%), Positives = 188/288 (65%), Gaps = 7/288 (2%)
 Frame = -1

Query: 1036 EPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNAIHKLQIAHPILNSQLHF 857
            EPK RPVGGTEYSWCKAVP GTGITV             LQ A+H +Q +HP+L S+  F
Sbjct: 3    EPKTRPVGGTEYSWCKAVPSGTGITVLALLLSKPPNFSILQTALHNIQNSHPVLRSKHLF 62

Query: 856  DLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLC-PSVSPYQLLLEHELKTNSWQNLD- 683
            D T++ FS++TPPTP LQIQ FDLPST  ILQ+   P++  + L+LEHEL  N+W+N + 
Sbjct: 63   DPTTNTFSFLTPPTPHLQIQRFDLPSTVLILQNQNQPNIPAFHLILEHELNINTWRNPNP 122

Query: 682  -SSLDTAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRRGLL----ETIGGEAAAVE 518
             S  D  V FAS+YT+   RW LA R+HTSACDR +  AL R LL     T GGEA    
Sbjct: 123  SSDADADVLFASVYTISESRWALALRVHTSACDRAAAVALLRALLAEMKSTGGGEAEREL 182

Query: 517  RVEMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLANLSFKDTESPKASQVERLC 338
            +   E  LGIE+ IP+GK+NKPFWARGVDMLGYSLNS RL+NL FKD  S + SQV +L 
Sbjct: 183  KGNGEISLGIEDLIPNGKANKPFWARGVDMLGYSLNSLRLSNLDFKDASSARRSQVVKLQ 242

Query: 337  LNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPDDQWEKYAV 194
            LN  D   +L+GCKSR IK            A + K + D QWEKYAV
Sbjct: 243  LNPHDCQRLLAGCKSREIKLSGALAAAGLIAAHASKHLSDHQWEKYAV 290



 Score = 91.3 bits (225), Expect(2) = 9e-97
 Identities = 39/52 (75%), Positives = 45/52 (86%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAI+NTHDI GG  LWELA R Y +FANAK +NKHF+DM+DLNFLMC+
Sbjct: 310 GFYHSAIMNTHDINGGNTLWELAKRCYIAFANAKNSNKHFTDMSDLNFLMCK 361


>ref|XP_002308465.2| hypothetical protein POPTR_0006s22750g [Populus trichocarpa]
            gi|550336884|gb|EEE91988.2| hypothetical protein
            POPTR_0006s22750g [Populus trichocarpa]
          Length = 478

 Score =  294 bits (752), Expect(2) = 5e-96
 Identities = 158/305 (51%), Positives = 193/305 (63%), Gaps = 7/305 (2%)
 Frame = -1

Query: 1087 HHMSDQESSNAPPVEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNA 908
            +H  DQ   N    +A EPK RPVG TEYSWC++VP GTGITV             LQ  
Sbjct: 3    NHGQDQLDPNP---QAPEPKARPVGATEYSWCRSVPLGTGITVLALLLSKQPDIHLLQTT 59

Query: 907  IHKLQIAHPILNSQLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLCPSVSPYQL 728
            + KLQ + P+L ++L F+ T++ FS+ITPP P +QIQ FDLPSTA I+ +   ++ PY +
Sbjct: 60   LDKLQNSRPLLRTKLRFNSTTNTFSFITPPAPHVQIQPFDLPSTADIISNSDQNIDPYHI 119

Query: 727  LLEHELKTNSWQN-LDSSLD--TAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRRG 557
            +LEHEL  NSW   LD S D  T VFF +LYTL   RW +  RLHTS CDR +   L R 
Sbjct: 120  ILEHELNKNSWSAYLDQSSDAETNVFFITLYTLSENRWAVVLRLHTSTCDRAAAVGLLRE 179

Query: 556  LLETIGGE----AAAVERVEMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLANL 389
            LL  +GGE           E+E  LGIE+YIPSGK NKPFWARG+DMLGYSLNSFRL+NL
Sbjct: 180  LLVLMGGENQGGITKEYENEVEVSLGIEDYIPSGKGNKPFWARGIDMLGYSLNSFRLSNL 239

Query: 388  SFKDTESPKASQVERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPDDQW 209
             F D +SP+ SQV RL +NS+DT  +L GC SR IK            A+S K +PD Q 
Sbjct: 240  DFVDADSPRGSQVVRLQMNSDDTQKLLDGCMSRGIKLSGALAAAGLIAAQSTKDLPDHQM 299

Query: 208  EKYAV 194
            EKYAV
Sbjct: 300  EKYAV 304



 Score = 86.7 bits (213), Expect(2) = 5e-96
 Identities = 36/52 (69%), Positives = 43/52 (82%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSA+LNTHD+ GG  LW+LA R Y ++ NAK NNKHF+DM DLNFLMC+
Sbjct: 324 GFYHSAMLNTHDVSGGVMLWDLAKRCYMAYTNAKNNNKHFTDMGDLNFLMCK 375


>ref|XP_011004215.1| PREDICTED: uncharacterized protein LOC105110757 [Populus euphratica]
          Length = 478

 Score =  292 bits (747), Expect(2) = 6e-96
 Identities = 160/305 (52%), Positives = 193/305 (63%), Gaps = 7/305 (2%)
 Frame = -1

Query: 1087 HHMSDQESSNAPPVEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNA 908
            +H  DQ   N     A EPK RPVG TEYSWC++VP GTGITV             LQ  
Sbjct: 3    NHGQDQLDPNP---HAPEPKARPVGATEYSWCRSVPLGTGITVLALLLSKQPNIHLLQAT 59

Query: 907  IHKLQIAHPILNSQLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLCPSVSPYQL 728
            + KLQ + P+L ++L F+ T+++FS+ITPP P +QIQ FDL STA I+ +   ++ PY +
Sbjct: 60   LDKLQNSRPLLRTKLRFNSTTNSFSFITPPAPHVQIQPFDLSSTADIISNGDQNIDPYHI 119

Query: 727  LLEHELKTNSWQN-LDSSLD--TAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRRG 557
            +LEHEL  NSW   LD S D  T VFF SLYTL   RWV+ FRLHTS CDR +   L R 
Sbjct: 120  ILEHELNKNSWSAYLDQSSDAETNVFFISLYTLSENRWVVVFRLHTSTCDRAAAVGLLRE 179

Query: 556  LLETIGGE----AAAVERVEMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLANL 389
            LL  +GGE     A     E+E  LGIE+YIP GK NKPFWARGVDMLGYSLNSFR++NL
Sbjct: 180  LLALMGGENQGGIAKEYENEVEVSLGIEDYIPRGKGNKPFWARGVDMLGYSLNSFRISNL 239

Query: 388  SFKDTESPKASQVERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPDDQW 209
             F D  SP+ SQV RL +NS+DT  +L GC SR IK            A+S K +PD Q 
Sbjct: 240  DFVDAGSPRGSQVVRLQMNSDDTQKLLDGCMSRGIKLSGALSAAGLIAAQSTKDLPDHQM 299

Query: 208  EKYAV 194
            EKYAV
Sbjct: 300  EKYAV 304



 Score = 88.2 bits (217), Expect(2) = 6e-96
 Identities = 37/52 (71%), Positives = 43/52 (82%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAILNTHD+ GG  LW+LA R Y ++ NAK NNKHF+DM DLNFLMC+
Sbjct: 324 GFYHSAILNTHDVSGGVMLWDLAKRCYVAYTNAKNNNKHFTDMGDLNFLMCK 375


>ref|XP_002530334.1| conserved hypothetical protein [Ricinus communis]
            gi|223530138|gb|EEF32050.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 474

 Score =  287 bits (735), Expect(2) = 1e-95
 Identities = 154/291 (52%), Positives = 189/291 (64%), Gaps = 7/291 (2%)
 Frame = -1

Query: 1045 EAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNAIHKLQIAHPILNSQ 866
            +  EP  R VGGTE+SWCKA+P GTGITV              Q A+H+LQ +HPIL S+
Sbjct: 10   QTLEPIARAVGGTEHSWCKAIPAGTGITVLGLLLSKAPNIPFFQAALHQLQSSHPILRSK 69

Query: 865  LHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLC-------PSVSPYQLLLEHELK 707
            LHFD  +++FSYITPP+P LQIQ FDLPST  I  S+         +++PY LLLEHE+ 
Sbjct: 70   LHFDTPTASFSYITPPSPHLQIQFFDLPSTTAIHNSITTTTTDNNDNITPYHLLLEHEMN 129

Query: 706  TNSWQNLDSSLDTAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRRGLLETIGGEAA 527
             NSW +  SS D  +FFAS+YTL   RWVL  RLHTSACDR S  AL R LLE +GG   
Sbjct: 130  KNSWSS--SSSDNDLFFASVYTLSETRWVLVLRLHTSACDRASAAALLRELLEQMGG-GG 186

Query: 526  AVERVEMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLANLSFKDTESPKASQVE 347
             +E  + E G+ IE+ IP GKS+K FWARG+D++GYSLNSFRLANL+F D  S + SQV 
Sbjct: 187  EIENYKEELGVPIEDCIPDGKSSKWFWARGMDVVGYSLNSFRLANLNFIDASSARRSQVI 246

Query: 346  RLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPDDQWEKYAV 194
            RL +NS+ T  ++ GCKSR IK            A S K +P DQ  KYAV
Sbjct: 247  RLQINSDQTFKLVEGCKSRGIKLCGALAAAGLIAAHSTKDLPHDQSHKYAV 297



 Score = 92.0 bits (227), Expect(2) = 1e-95
 Identities = 41/50 (82%), Positives = 44/50 (88%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLM 9
           GFYHSAILNTHDI GG+ LWE+A R Y SFANAKKNNKHF+DM DLNFLM
Sbjct: 317 GFYHSAILNTHDINGGDKLWEVAQRCYMSFANAKKNNKHFTDMGDLNFLM 366


>ref|XP_004303543.2| PREDICTED: uncharacterized protein LOC101300265 [Fragaria vesca
            subsp. vesca]
          Length = 525

 Score =  289 bits (740), Expect(2) = 5e-95
 Identities = 155/303 (51%), Positives = 195/303 (64%), Gaps = 5/303 (1%)
 Frame = -1

Query: 1087 HHMSDQESSNAPPVEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNA 908
            H+MSD+    AP   + +   RPVGGTEYSWCKAVP GTGITV             LQNA
Sbjct: 53   HNMSDRNPIPAPT--SPDSIARPVGGTEYSWCKAVPVGTGITVLALLLTKPPNIPLLQNA 110

Query: 907  IHKLQIAHPILNSQLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLCPSVSPYQL 728
            +H LQ +HPIL S LHFD  +++FS++TPP+P LQIQ FDLPSTA IL+    +VSP+  
Sbjct: 111  LHNLQNSHPILRSNLHFDSATNSFSFLTPPSPHLQIQPFDLPSTASILRP-SSAVSPFHQ 169

Query: 727  LLEHELKTNSWQN---LDSSLDTAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRRG 557
            +LEHEL  N+W+N     +  DT VFFAS Y L   RW L  RLHTSACDR +  AL + 
Sbjct: 170  ILEHELNLNTWRNPHPPSADADTDVFFASTYGLSESRWALVLRLHTSACDRAAAVALLKE 229

Query: 556  LLETIGGEAAAVERV--EMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLANLSF 383
            LL ++GG       +  + E    +E+ IP+GK++KPFWARGVDMLGYSLNS RL+NL F
Sbjct: 230  LLGSVGGGGGTEMEIKGKGEVLSALEDLIPNGKASKPFWARGVDMLGYSLNSLRLSNLEF 289

Query: 382  KDTESPKASQVERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPDDQWEK 203
            KD    K +Q+ +L +N E TD +L+GCKS+ IK            A + K +PD QWEK
Sbjct: 290  KDVSLEKTTQMVKLRINPEHTDKLLAGCKSKGIKLCGVLAAAGLIAAHASKHLPDHQWEK 349

Query: 202  YAV 194
            YAV
Sbjct: 350  YAV 352



 Score = 87.8 bits (216), Expect(2) = 5e-95
 Identities = 38/52 (73%), Positives = 45/52 (86%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAI+N+HDI G   LWELA R YT+FA+AK +NKHFSDM+DLNFLMC+
Sbjct: 372 GFYHSAIVNSHDINGENTLWELAKRCYTAFADAKNSNKHFSDMSDLNFLMCK 423


>ref|XP_007027792.1| GATA zinc finger domain-containing protein C1393.08 isoform 2
            [Theobroma cacao] gi|508716397|gb|EOY08294.1| GATA zinc
            finger domain-containing protein C1393.08 isoform 2
            [Theobroma cacao]
          Length = 481

 Score =  286 bits (733), Expect(2) = 1e-93
 Identities = 159/305 (52%), Positives = 194/305 (63%), Gaps = 13/305 (4%)
 Frame = -1

Query: 1069 ESSNAPPVEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNAIHKLQI 890
            E    P  +  EPK+RP GGTEYSWC+AVP GTGITV             L+ A+ +LQ+
Sbjct: 3    EFDQNPTPKTPEPKVRPAGGTEYSWCRAVPGGTGITVLSLLLSNPPDISLLEAALCRLQV 62

Query: 889  AHPILNSQLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLC--PSVSPYQLLLEH 716
            +HPIL S+LHFD   + FS+IT   P  +IQ FDLPST+ ILQSL   P++  +Q LLEH
Sbjct: 63   SHPILRSRLHFDTCRNTFSFITHRNPHAKIQSFDLPSTSHILQSLSGDPNIDSHQFLLEH 122

Query: 715  ELKTNSWQNLD-SSLDTAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRRGLLETIG 539
            EL  NSW   D    D  VFF SLYTL   RWV+ FRLHTSACDR +  AL + LLE +G
Sbjct: 123  ELNRNSWNLPDGDQADRDVFFVSLYTLSETRWVVVFRLHTSACDRAAAVALLKELLELVG 182

Query: 538  GEAAAVERV-------EMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLANLSFK 380
            G  + VE +       E+E  LGIE+ IPSGK+NKPFWARGVDMLGYSLNSFRLANL+F 
Sbjct: 183  GGRSKVEEIAKGNDEKEVELSLGIEDLIPSGKANKPFWARGVDMLGYSLNSFRLANLNFV 242

Query: 379  DTESPKASQVERLCLNSEDTDMILS---GCKSRNIKXXXXXXXXXXXXARSFKGIPDDQW 209
            D  S + SQV RL +N ++TD +++     +SR IK            ARS K  P+ Q 
Sbjct: 243  DANSARRSQVVRLQMNPDETDGLVAVSDELQSRGIKLCGALAAAGLIAARSTKAYPEHQR 302

Query: 208  EKYAV 194
            EKYAV
Sbjct: 303  EKYAV 307



 Score = 85.9 bits (211), Expect(2) = 1e-93
 Identities = 37/52 (71%), Positives = 43/52 (82%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAILNTHD+   E +WELA R Y SF+NAK N+KHF+DM DLNFLMC+
Sbjct: 327 GFYHSAILNTHDVTAHEQVWELARRCYMSFSNAKNNDKHFTDMNDLNFLMCK 378


>ref|XP_012485792.1| PREDICTED: uncharacterized protein LOC105799668 [Gossypium raimondii]
            gi|763769134|gb|KJB36349.1| hypothetical protein
            B456_006G154200 [Gossypium raimondii]
          Length = 478

 Score =  278 bits (711), Expect(2) = 2e-92
 Identities = 151/302 (50%), Positives = 190/302 (62%), Gaps = 7/302 (2%)
 Frame = -1

Query: 1078 SDQESSNAPPVEAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNAIHK 899
            S+Q  +  P  +  EPK+R VG TEYSWC+AVP GTGITV             L+  + +
Sbjct: 4    SEQTQTQNPDPKTPEPKVRAVGVTEYSWCRAVPGGTGITVLSLLLSNVPDISFLEALLCR 63

Query: 898  LQIAHPILNSQLHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLCPS--VSPYQLL 725
            LQ++HPIL S++ FD + + F ++TP  P +QIQ FDL ST+ ILQS      +  + +L
Sbjct: 64   LQVSHPILRSRVRFDASCNTFYFVTPSNPHVQIQSFDLQSTSHILQSSLGDSHIDSHHVL 123

Query: 724  LEHELKTNSWQNLDSS-----LDTAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRR 560
            LEHEL  NSW   D +      D  VFF S+YT+ + RW L FRLHTSACDR +   L R
Sbjct: 124  LEHELNRNSWNRTDGAGDGDQADWDVFFVSIYTISDTRWFLVFRLHTSACDRAAAVGLLR 183

Query: 559  GLLETIGGEAAAVERVEMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLANLSFK 380
             LLE +GG  A  E  E+   +GIE+ IPSGK+NKP WARGVD+LGYSLNSFRLANL+F 
Sbjct: 184  ELLEMVGGGRAKAEE-EIVQEVGIEDLIPSGKANKPLWARGVDLLGYSLNSFRLANLNFI 242

Query: 379  DTESPKASQVERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPDDQWEKY 200
            D  S + SQV RL +N +DTD +++GCKSR IK            ARS K  PD Q EKY
Sbjct: 243  DANSARHSQVVRLKMNPDDTDRLVAGCKSRGIKLCGALAAAGMIAARSTKPFPDHQKEKY 302

Query: 199  AV 194
            AV
Sbjct: 303  AV 304



 Score = 90.5 bits (223), Expect(2) = 2e-92
 Identities = 39/52 (75%), Positives = 45/52 (86%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAILNTHD+   +++WELA R YTSF+NAK NNKHFSDM DLNFLMC+
Sbjct: 324 GFYHSAILNTHDVTALDEVWELANRCYTSFSNAKNNNKHFSDMNDLNFLMCK 375


>emb|CAN71367.1| hypothetical protein VITISV_014691 [Vitis vinifera]
          Length = 465

 Score =  275 bits (702), Expect(2) = 3e-92
 Identities = 147/290 (50%), Positives = 187/290 (64%), Gaps = 6/290 (2%)
 Frame = -1

Query: 1045 EAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNAIHKLQIAHPILNSQ 866
            E  + K R  GGTEYSWCKAVP GTGIT              LQ A+HKLQ A+PIL S+
Sbjct: 5    ETTQLKGRAAGGTEYSWCKAVPGGTGITALAILLSKAPDFSLLQAALHKLQNAYPILRSK 64

Query: 865  LHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSL----CPSVSPYQLLLEHELKTNS 698
            LHFD  ++AFS+ T   P LQ++ FDL ST+ ILQ+L      SVSP+  + EH+L  N+
Sbjct: 65   LHFDPKTNAFSFFTTQNPYLQLETFDLSSTSGILQTLPDPETDSVSPFHRIFEHQLNLNT 124

Query: 697  WQNLD--SSLDTAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRRGLLETIGGEAAA 524
            W N D  S+ +T +FFAS+YTL N+ WV+  RLHT+ACDRTS  AL R LL  +GG    
Sbjct: 125  WHNPDPSSNTETDLFFASVYTLSNDEWVVTLRLHTAACDRTSAVALLRKLLALMGGGREK 184

Query: 523  VERVEMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLANLSFKDTESPKASQVER 344
                E E  LGIE+ IP+GK+NKPFWARGV+MLGYSLNSFRL+NL+F D  SP++S+V R
Sbjct: 185  ELEKETELSLGIEDMIPNGKANKPFWARGVNMLGYSLNSFRLSNLNFIDANSPRSSEVVR 244

Query: 343  LCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPDDQWEKYAV 194
            L L ++ T +I + C+SR IK            A +   +PD +W KY V
Sbjct: 245  LHLPADHTALITAACESREIKLCGALAAAALIAAHASNHLPDGRWAKYGV 294



 Score = 93.2 bits (230), Expect(2) = 3e-92
 Identities = 42/52 (80%), Positives = 44/52 (84%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAILNTHD+ G E LWELA RTY S+AN K  NKHFSDMADLNFLMCR
Sbjct: 314 GFYHSAILNTHDVNGSETLWELARRTYGSYANDKNYNKHFSDMADLNFLMCR 365


>ref|XP_006430166.1| hypothetical protein CICLE_v10011628mg [Citrus clementina]
            gi|557532223|gb|ESR43406.1| hypothetical protein
            CICLE_v10011628mg [Citrus clementina]
          Length = 478

 Score =  272 bits (696), Expect(2) = 4e-92
 Identities = 148/292 (50%), Positives = 185/292 (63%), Gaps = 8/292 (2%)
 Frame = -1

Query: 1045 EAAEPKIRPVGGTEYSWCKAVPCGTGITVXXXXXXXXXXXXXLQNAIHKLQIAHPILNSQ 866
            EA EP  RPVGGTEYSWCKAVP GTGITV             LQ A++ LQ  HPIL S+
Sbjct: 11   EAPEPMARPVGGTEYSWCKAVPTGTGITVLALLLSKHPNIQRLQTALNNLQNNHPILRSK 70

Query: 865  LHFDLTSSAFSYITPPTPQLQIQLFDLPSTAQILQSLCPSVSPYQLLLEHELKTNSWQN- 689
            LH    +  FS+ITPP P +QIQ  D+ ST+Q +     +VSP+QL+LEHEL  N+W N 
Sbjct: 71   LHSGADAKTFSFITPPEPHIQIQHLDISSTSQTISDKTGAVSPFQLILEHELNRNTWTNP 130

Query: 688  ---LDSSLDTAVFFASLYTLENERWVLAFRLHTSACDRTSMTALRRGLLETI-GGEAAAV 521
                +++ D+ +F  S+YT    +WV+  RLHTS CDR S  A+ + LL  + G E   +
Sbjct: 131  SHQSNTNSDSNLFNVSIYTPSETQWVVTLRLHTSICDRASAVAVLKELLRLMTGREEGGI 190

Query: 520  ERV---EMEFGLGIENYIPSGKSNKPFWARGVDMLGYSLNSFRLANLSFKDTESPKASQV 350
            E+    + E  LGIE +IPSGK+NKPFWARGVDMLGYSLNS RL+N+SF D +SP+ SQV
Sbjct: 191  EKEYDRKGEVSLGIEEFIPSGKANKPFWARGVDMLGYSLNSLRLSNISFVDADSPRFSQV 250

Query: 349  ERLCLNSEDTDMILSGCKSRNIKXXXXXXXXXXXXARSFKGIPDDQWEKYAV 194
             RL LN ++T  ++ GCKSR IK            ARS K  P  Q EKYAV
Sbjct: 251  LRLQLNRDETGRLVEGCKSRGIKLCGALAAAGLIAARSTKYFPSHQREKYAV 302



 Score = 95.1 bits (235), Expect(2) = 4e-92
 Identities = 41/52 (78%), Positives = 48/52 (92%)
 Frame = -3

Query: 158 GFYHSAILNTHDIKGGEDLWELATRTYTSFANAKKNNKHFSDMADLNFLMCR 3
           GFYHSAILNTHD+ G E+LWELATR+YTSFANAK ++KHF+DM DLNFLMC+
Sbjct: 322 GFYHSAILNTHDVNGEEELWELATRSYTSFANAKNSDKHFTDMNDLNFLMCK 373


Top