BLASTX nr result

ID: Catharanthus22_contig00029248 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00029248
         (1700 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002525443.1| transcription factor, putative [Ricinus comm...   540   e-151
ref|XP_004249601.1| PREDICTED: uncharacterized protein LOC101256...   539   e-150
ref|XP_006338933.1| PREDICTED: uncharacterized protein LOC102592...   533   e-148
gb|EOY11198.1| Homeodomain-like superfamily protein isoform 1 [T...   531   e-148
gb|EOY11199.1| Homeodomain-like superfamily protein isoform 2 [T...   530   e-148
ref|XP_006339130.1| PREDICTED: uncharacterized protein LOC102602...   526   e-146
ref|XP_006339131.1| PREDICTED: uncharacterized protein LOC102602...   523   e-146
ref|XP_002325408.2| myb family transcription factor family prote...   520   e-144
ref|XP_006338935.1| PREDICTED: uncharacterized protein LOC102592...   514   e-143
ref|XP_002319702.2| myb family transcription factor family prote...   512   e-142
ref|XP_006339132.1| PREDICTED: uncharacterized protein LOC102602...   507   e-141
ref|XP_004249439.1| PREDICTED: uncharacterized protein LOC101257...   506   e-140
ref|XP_002282324.1| PREDICTED: uncharacterized protein LOC100248...   496   e-137
gb|EMJ04158.1| hypothetical protein PRUPE_ppa015076mg [Prunus pe...   486   e-134
ref|XP_006443380.1| hypothetical protein CICLE_v10020171mg [Citr...   483   e-133
ref|XP_004249440.1| PREDICTED: uncharacterized protein LOC101257...   481   e-133
ref|XP_002282336.1| PREDICTED: uncharacterized protein LOC100248...   478   e-132
ref|XP_003540247.1| PREDICTED: uncharacterized protein LOC100810...   471   e-130
ref|XP_006443379.1| hypothetical protein CICLE_v10020171mg [Citr...   468   e-129
gb|ESW22064.1| hypothetical protein PHAVU_005G123900g [Phaseolus...   462   e-127

>ref|XP_002525443.1| transcription factor, putative [Ricinus communis]
            gi|223535256|gb|EEF36933.1| transcription factor,
            putative [Ricinus communis]
          Length = 419

 Score =  540 bits (1390), Expect = e-151
 Identities = 281/419 (67%), Positives = 325/419 (77%), Gaps = 5/419 (1%)
 Frame = +3

Query: 165  MYPHH-HQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 341
            MY HH HQGK++H+SSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWT DLHE FI
Sbjct: 1    MYHHHQHQGKSVHSSSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTSDLHEHFI 60

Query: 342  EAVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNK--SPAA 515
            EAVNQLGGADKATPK+V+KLMGI GLTLYHLKSHLQKYRLSKNLHGQANSG NK  + A 
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANSGSNKIGTGAV 120

Query: 516  AEERISEGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQG 695
              +RISE + T  +N S+  QTNK LHIGEA+QMQIEVQRRLHEQLEVQRHLQLRIEAQG
Sbjct: 121  VGDRISETNVTHINNLSMGTQTNKGLHIGEALQMQIEVQRRLHEQLEVQRHLQLRIEAQG 180

Query: 696  KYLQAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGLCXXX 875
            KYLQ+VLEKAQETLGRQNLGS+GLEAAKVQ+SELVSKVSTQCLNSAFS++KEL GLC   
Sbjct: 181  KYLQSVLEKAQETLGRQNLGSIGLEAAKVQLSELVSKVSTQCLNSAFSELKELQGLCHQQ 240

Query: 876  XXXXXXXDCSMDSCLTSCEGPLRDQDMHNNKMAIGTLNFKTVMESKNISNEPSLRQSEQR 1055
                   DCSMDSCLTSCEG  ++Q++HN  M +   N   ++ESK+I+    L Q+E +
Sbjct: 241  TQTAPPTDCSMDSCLTSCEGSQKEQEIHNTGMGLRPYNGNALLESKDITEGHVLHQTELK 300

Query: 1056 WCEDLNDKRRFLLSM-NEEAEKECAMEKSCSNLSMSIGLQG-GWNTNIYSEKGITETNRD 1229
            W EDL D + FL  + N  A +  A E+S S+LSM++GLQG   N + +SE    + N  
Sbjct: 301  WSEDLKDNKMFLSPLGNNAARRNFAAERSTSDLSMTVGLQGENGNASSFSEGRYKDRNDG 360

Query: 1230 TKFFDQPCSRHDAVPTVKQRSSQEYKMPFFAPKLDLNTDDETDAASKCKQFDLNGFSWS 1406
              F DQ     D+V   K   SQ Y++P+FA KLDLN+ +E DAAS CKQ DLNGFSW+
Sbjct: 361  DSFPDQTNKSLDSVKLPKGDVSQGYRLPYFATKLDLNSHEEIDAASSCKQLDLNGFSWN 419


>ref|XP_004249601.1| PREDICTED: uncharacterized protein LOC101256236 [Solanum
            lycopersicum]
          Length = 414

 Score =  539 bits (1388), Expect = e-150
 Identities = 280/419 (66%), Positives = 319/419 (76%), Gaps = 5/419 (1%)
 Frame = +3

Query: 165  MYPHHHQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIE 344
            MY HHHQ K++H S+RMS+P ERHLFLQGGNG GDSGLVLSTDAKPRLKWTPDLHERFIE
Sbjct: 1    MYHHHHQDKSMHPSTRMSVP-ERHLFLQGGNGNGDSGLVLSTDAKPRLKWTPDLHERFIE 59

Query: 345  AVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNKSPAAAEE 524
            AVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQAN+ G    AA  E
Sbjct: 60   AVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANASGANKAAAGVE 119

Query: 525  RISEGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYL 704
            RISE S T  SN S+ PQ NKN+ I EAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYL
Sbjct: 120  RISENSATCMSNPSMVPQPNKNIQISEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYL 179

Query: 705  QAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGLCXXXXXX 884
            Q+VLEKAQETLGRQN+ ++GLEA KVQ+SE VSK S QCLNS F+D+KELSG        
Sbjct: 180  QSVLEKAQETLGRQNMETVGLEAVKVQLSEFVSKASNQCLNSPFTDIKELSGFHSQQTQA 239

Query: 885  XXXXDCSMDSCLTSCEGPLRDQDMHNNKMAIGTLNFKTVMESKNISNEPSLRQSEQRWCE 1064
                D S+DSCLTS +G LRD  MH+N++ +    F   +E K+I N+  L+Q+E RWC+
Sbjct: 240  TQPTDRSIDSCLTSRDGSLRDNTMHDNQIGLRPFGFTPSIECKDIENDTRLQQTELRWCD 299

Query: 1065 DLNDKRRFLLSMNEEAEKECAMEKSCSNLSMSIGLQ-----GGWNTNIYSEKGITETNRD 1229
            +L + RR    MNE  EK    E +C+NLSMSIGLQ     G  N   +S+     T RD
Sbjct: 300  NLKENRRLFSPMNEGREKTFTRETNCNNLSMSIGLQDEKLNGSMN---HSDGNFNGTERD 356

Query: 1230 TKFFDQPCSRHDAVPTVKQRSSQEYKMPFFAPKLDLNTDDETDAASKCKQFDLNGFSWS 1406
             K F Q  +R ++VP  + +SSQEYK+ +F PKLDLN  DETDAAS CKQFDLNGFSWS
Sbjct: 357  VKLFHQVTNRSESVPQ-RHKSSQEYKLSYFEPKLDLNMHDETDAASSCKQFDLNGFSWS 414


>ref|XP_006338933.1| PREDICTED: uncharacterized protein LOC102592272 isoform X1 [Solanum
            tuberosum] gi|565343634|ref|XP_006338934.1| PREDICTED:
            uncharacterized protein LOC102592272 isoform X2 [Solanum
            tuberosum]
          Length = 416

 Score =  533 bits (1372), Expect = e-148
 Identities = 282/420 (67%), Positives = 321/420 (76%), Gaps = 7/420 (1%)
 Frame = +3

Query: 165  MYPHHHQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIE 344
            MY HHHQ K++H S+RMS+P ERHLFLQGGNG GDSGLVLSTDAKPRLKWTPDLHERFIE
Sbjct: 1    MYHHHHQEKSMHPSTRMSVP-ERHLFLQGGNGNGDSGLVLSTDAKPRLKWTPDLHERFIE 59

Query: 345  AVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQAN-SGGNKSPAAAE 521
            AVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQAN SG NK+ A A 
Sbjct: 60   AVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANASGTNKAVAVAG 119

Query: 522  -ERISEGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGK 698
             ERISE S T  SN S+ PQ NKN+ I EAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGK
Sbjct: 120  VERISENSATCMSNPSMVPQPNKNIQISEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGK 179

Query: 699  YLQAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGLCXXXX 878
            YLQ+VLEKAQETLGRQN+ ++GLEA KVQ+SE VSK S QCLNS F D+KELSG      
Sbjct: 180  YLQSVLEKAQETLGRQNMETVGLEAVKVQLSEFVSKASNQCLNSPFPDIKELSGFHSQHT 239

Query: 879  XXXXXXDCSMDSCLTSCEGPLRDQDMHNNKMAIGTLNFKTVMESKNISNEPSLRQSEQRW 1058
                  D S+DSCLTS +G LRD  MH+N++ +   +F   +E K+I N+  L+Q+E RW
Sbjct: 240  QATQPTDRSIDSCLTSRDGSLRDNTMHDNQIGLRPFDFTPSIECKDIENDARLQQTELRW 299

Query: 1059 CEDLNDKRRFLLSMNEEAEKECAMEKSCSNLSMSIGLQ-----GGWNTNIYSEKGITETN 1223
            C++L + RR    MNE  EK    E +C+NLSMSIGLQ     G  N   +S+     T 
Sbjct: 300  CDNLKENRRLFSPMNEGREKTFTRETNCNNLSMSIGLQDEKLNGSMN---HSDGSFNGTE 356

Query: 1224 RDTKFFDQPCSRHDAVPTVKQRSSQEYKMPFFAPKLDLNTDDETDAASKCKQFDLNGFSW 1403
            RD K F Q  +R ++VP  + +SSQEYK+ +F PKLDLN  DETDAAS CKQFDLNGFSW
Sbjct: 357  RDVKLFHQVTNRSESVPQ-RHKSSQEYKLSYFQPKLDLNMHDETDAASSCKQFDLNGFSW 415


>gb|EOY11198.1| Homeodomain-like superfamily protein isoform 1 [Theobroma cacao]
          Length = 478

 Score =  531 bits (1367), Expect = e-148
 Identities = 278/425 (65%), Positives = 324/425 (76%), Gaps = 10/425 (2%)
 Frame = +3

Query: 162  EMYPHHHQ--GKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHER 335
            +MY HHHQ  GKNIH SSRM IPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHER
Sbjct: 64   KMYHHHHQHQGKNIHPSSRMPIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHER 123

Query: 336  FIEAVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNK--SP 509
            FIEAVNQLGGADKATPK+V+KLMGI GLTLYHLKSHLQKYRLSKNLHGQAN+G NK  + 
Sbjct: 124  FIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANNGSNKIGAV 183

Query: 510  AAAEERISEGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEA 689
            A A +R+SE +GT  +N SI PQ N  L IGEA+QMQIEVQRRLHEQLEVQRHLQLRIEA
Sbjct: 184  AMAGDRMSEANGTHVNNLSIGPQANNGLQIGEALQMQIEVQRRLHEQLEVQRHLQLRIEA 243

Query: 690  QGKYLQAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGLCX 869
            QGKYLQAVLEKAQETLGRQNLGS+GLEAAKVQ+SELVSKVS QCLNSAFSD+K+L GLC 
Sbjct: 244  QGKYLQAVLEKAQETLGRQNLGSVGLEAAKVQLSELVSKVSNQCLNSAFSDLKDLQGLCP 303

Query: 870  XXXXXXXXXDCSMDSCLTSCEGPLRDQDMHNNKMAIGTLNFK-TVMESKNISNEPSLRQS 1046
                     DCSMDSCLTSCEG  ++Q++HNN M +   N    ++E + I+ +P L Q+
Sbjct: 304  QQTQATPPTDCSMDSCLTSCEGSQKEQEIHNNGMCLRPYNTSGALLEQREIAEDPLLPQT 363

Query: 1047 EQRWCEDLNDKRRFLLSMNEEAEKECAM-EKSCSNLSMSIGLQG----GWNTNIYSEKGI 1211
            E +  ED+ + + FL S+ ++AE+     ++S S+LSMS+GLQG    G N++ +SE   
Sbjct: 364  ELKSFEDIKENKMFLSSLGKDAERRMFFADRSSSDLSMSVGLQGEKGNGGNSSSFSEAKF 423

Query: 1212 TETNRDTKFFDQPCSRHDAVPTVKQRSSQEYKMPFFAPKLDLNTDDETDAASKCKQFDLN 1391
               N D  F D+   R D V           ++P+FA KLDLN  +E DAAS CKQFDLN
Sbjct: 424  KGRNEDDSFLDRGNKRADEV----------NRLPYFATKLDLNVHEENDAASSCKQFDLN 473

Query: 1392 GFSWS 1406
            G SW+
Sbjct: 474  GLSWN 478


>gb|EOY11199.1| Homeodomain-like superfamily protein isoform 2 [Theobroma cacao]
            gi|508719303|gb|EOY11200.1| Homeodomain-like superfamily
            protein isoform 2 [Theobroma cacao]
          Length = 414

 Score =  530 bits (1366), Expect = e-148
 Identities = 278/424 (65%), Positives = 323/424 (76%), Gaps = 10/424 (2%)
 Frame = +3

Query: 165  MYPHHHQ--GKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERF 338
            MY HHHQ  GKNIH SSRM IPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERF
Sbjct: 1    MYHHHHQHQGKNIHPSSRMPIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERF 60

Query: 339  IEAVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNK--SPA 512
            IEAVNQLGGADKATPK+V+KLMGI GLTLYHLKSHLQKYRLSKNLHGQAN+G NK  + A
Sbjct: 61   IEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANNGSNKIGAVA 120

Query: 513  AAEERISEGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQ 692
             A +R+SE +GT  +N SI PQ N  L IGEA+QMQIEVQRRLHEQLEVQRHLQLRIEAQ
Sbjct: 121  MAGDRMSEANGTHVNNLSIGPQANNGLQIGEALQMQIEVQRRLHEQLEVQRHLQLRIEAQ 180

Query: 693  GKYLQAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGLCXX 872
            GKYLQAVLEKAQETLGRQNLGS+GLEAAKVQ+SELVSKVS QCLNSAFSD+K+L GLC  
Sbjct: 181  GKYLQAVLEKAQETLGRQNLGSVGLEAAKVQLSELVSKVSNQCLNSAFSDLKDLQGLCPQ 240

Query: 873  XXXXXXXXDCSMDSCLTSCEGPLRDQDMHNNKMAIGTLNFK-TVMESKNISNEPSLRQSE 1049
                    DCSMDSCLTSCEG  ++Q++HNN M +   N    ++E + I+ +P L Q+E
Sbjct: 241  QTQATPPTDCSMDSCLTSCEGSQKEQEIHNNGMCLRPYNTSGALLEQREIAEDPLLPQTE 300

Query: 1050 QRWCEDLNDKRRFLLSMNEEAEKECAM-EKSCSNLSMSIGLQG----GWNTNIYSEKGIT 1214
             +  ED+ + + FL S+ ++AE+     ++S S+LSMS+GLQG    G N++ +SE    
Sbjct: 301  LKSFEDIKENKMFLSSLGKDAERRMFFADRSSSDLSMSVGLQGEKGNGGNSSSFSEAKFK 360

Query: 1215 ETNRDTKFFDQPCSRHDAVPTVKQRSSQEYKMPFFAPKLDLNTDDETDAASKCKQFDLNG 1394
              N D  F D+   R D V           ++P+FA KLDLN  +E DAAS CKQFDLNG
Sbjct: 361  GRNEDDSFLDRGNKRADEV----------NRLPYFATKLDLNVHEENDAASSCKQFDLNG 410

Query: 1395 FSWS 1406
             SW+
Sbjct: 411  LSWN 414


>ref|XP_006339130.1| PREDICTED: uncharacterized protein LOC102602766 isoform X1 [Solanum
            tuberosum]
          Length = 415

 Score =  526 bits (1354), Expect = e-146
 Identities = 280/419 (66%), Positives = 320/419 (76%), Gaps = 5/419 (1%)
 Frame = +3

Query: 165  MYPHHHQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIE 344
            MY HHHQ  N+H S+RMS P ERHLFLQGGN  GDSGLVLSTDAKPRLKWTPDLHERFIE
Sbjct: 1    MYHHHHQASNMHPSTRMSFP-ERHLFLQGGNANGDSGLVLSTDAKPRLKWTPDLHERFIE 59

Query: 345  AVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNKSPAAAEE 524
            AV QLGGADKATPKSVLKLMGI GLTLYHLKSHLQKYRLSKN HGQAN  G    AA+ E
Sbjct: 60   AVTQLGGADKATPKSVLKLMGIPGLTLYHLKSHLQKYRLSKNHHGQANLSGVNKAAASME 119

Query: 525  RISEGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYL 704
            +I E +G+ TSN SI PQ N N+ I EAIQMQI+VQRRLHEQLEVQRHLQLRIEAQGKYL
Sbjct: 120  KICESTGSPTSNPSIGPQPNNNIPISEAIQMQIDVQRRLHEQLEVQRHLQLRIEAQGKYL 179

Query: 705  QAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGL-CXXXXX 881
            QAVLEKAQETLG QNLG++G EAAKVQ+S+LVSKVS QCLNSAFS+++ELSG        
Sbjct: 180  QAVLEKAQETLGTQNLGTIGFEAAKVQLSDLVSKVSNQCLNSAFSEIQELSGFHTPQTQA 239

Query: 882  XXXXXDCSMDSCLTSCEGPLRD-QDMHNNKMAIGTLNFKTVMESKNISNEPSLRQSEQRW 1058
                 DCSMDSCLTS EGPLRD Q+MHNN++ + TLNF    E   I N+  L+Q+  RW
Sbjct: 240  TQRLADCSMDSCLTSSEGPLRDLQEMHNNQLGLRTLNFGPCTE--EIENQTRLQQTALRW 297

Query: 1059 CEDLNDKRRFLLSMNEEAEKECAMEKSCSNLSMSIGLQGGWN--TNIYSEKGITETNRDT 1232
             +DL + R F   M+E+ EKE A E + SNLSM++G+QGG     + Y +  +   + D 
Sbjct: 298  RDDLKENRLF-PKMDEDTEKEFAKETNWSNLSMNVGIQGGKRNVNSSYVDGRLNGIDADI 356

Query: 1233 KFFDQPCS-RHDAVPTVKQRSSQEYKMPFFAPKLDLNTDDETDAASKCKQFDLNGFSWS 1406
            K F Q  + R D+    KQ S QEYK+P+FAPKLDLNTDD+TDAAS CKQ DLNGFSW+
Sbjct: 357  KLFHQAATDRSDSTKPEKQVSPQEYKLPYFAPKLDLNTDDQTDAASNCKQLDLNGFSWN 415


>ref|XP_006339131.1| PREDICTED: uncharacterized protein LOC102602766 isoform X2 [Solanum
            tuberosum]
          Length = 414

 Score =  523 bits (1347), Expect = e-146
 Identities = 283/420 (67%), Positives = 323/420 (76%), Gaps = 6/420 (1%)
 Frame = +3

Query: 165  MYPHHHQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIE 344
            MY HHHQ  N+H S+RMS P ERHLFLQGGN  GDSGLVLSTDAKPRLKWTPDLHERFIE
Sbjct: 1    MYHHHHQASNMHPSTRMSFP-ERHLFLQGGNANGDSGLVLSTDAKPRLKWTPDLHERFIE 59

Query: 345  AVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQAN-SGGNKSPAAAE 521
            AV QLGGADKATPKSVLKLMGI GLTLYHLKSHLQKYRLSKN HGQAN SG NK  AA+ 
Sbjct: 60   AVTQLGGADKATPKSVLKLMGIPGLTLYHLKSHLQKYRLSKNHHGQANLSGVNK--AASM 117

Query: 522  ERISEGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKY 701
            E+I E +G+ TSN SI PQ N N+ I EAIQMQI+VQRRLHEQLEVQRHLQLRIEAQGKY
Sbjct: 118  EKICESTGSPTSNPSIGPQPNNNIPISEAIQMQIDVQRRLHEQLEVQRHLQLRIEAQGKY 177

Query: 702  LQAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGL-CXXXX 878
            LQAVLEKAQETLG QNLG++G EAAKVQ+S+LVSKVS QCLNSAFS+++ELSG       
Sbjct: 178  LQAVLEKAQETLGTQNLGTIGFEAAKVQLSDLVSKVSNQCLNSAFSEIQELSGFHTPQTQ 237

Query: 879  XXXXXXDCSMDSCLTSCEGPLRD-QDMHNNKMAIGTLNFKTVMESKNISNEPSLRQSEQR 1055
                  DCSMDSCLTS EGPLRD Q+MHNN++ + TLNF    E   I N+  L+Q+  R
Sbjct: 238  ATQRLADCSMDSCLTSSEGPLRDLQEMHNNQLGLRTLNFGPCTE--EIENQTRLQQTALR 295

Query: 1056 WCEDLNDKRRFLLSMNEEAEKECAMEKSCSNLSMSIGLQGGWN--TNIYSEKGITETNRD 1229
            W +DL + R F   M+E+ EKE A E + SNLSM++G+QGG     + Y +  +   + D
Sbjct: 296  WRDDLKENRLF-PKMDEDTEKEFAKETNWSNLSMNVGIQGGKRNVNSSYVDGRLNGIDAD 354

Query: 1230 TKFFDQPCS-RHDAVPTVKQRSSQEYKMPFFAPKLDLNTDDETDAASKCKQFDLNGFSWS 1406
             K F Q  + R D+    KQ S QEYK+P+FAPKLDLNTDD+TDAAS CKQ DLNGFSW+
Sbjct: 355  IKLFHQAATDRSDSTKPEKQVSPQEYKLPYFAPKLDLNTDDQTDAASNCKQLDLNGFSWN 414


>ref|XP_002325408.2| myb family transcription factor family protein [Populus trichocarpa]
            gi|550316805|gb|EEE99789.2| myb family transcription
            factor family protein [Populus trichocarpa]
          Length = 420

 Score =  520 bits (1338), Expect = e-144
 Identities = 268/420 (63%), Positives = 323/420 (76%), Gaps = 6/420 (1%)
 Frame = +3

Query: 165  MYPHH-HQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 341
            MY HH HQGKNIH+SSR SIPPERHLFLQ GNGPGDSGLVLSTDAKPRLKWTPDLHERFI
Sbjct: 1    MYQHHQHQGKNIHSSSRNSIPPERHLFLQVGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 60

Query: 342  EAVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNKSPAAAE 521
            EAVNQLGGADKATPK+V+KLMGI GLTLYHLKSHLQKYRLSKNLHGQANSG NKS   A 
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANSGSNKSGTVAV 120

Query: 522  --ERISEGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQG 695
              +R+ E + T  +N SI  QTNK+LH  EA+Q+QIEVQRRLHEQLEVQRHLQLRIEAQG
Sbjct: 121  VGDRMPEVNATHINNLSIGSQTNKSLHFSEALQVQIEVQRRLHEQLEVQRHLQLRIEAQG 180

Query: 696  KYLQAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGLCXXX 875
            KYLQ+VLEKAQETLGRQNLG++GLEAAKVQ+SELVSKVS++CLNSAFS++K+L GLC   
Sbjct: 181  KYLQSVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSSKCLNSAFSELKDLQGLCPPL 240

Query: 876  XXXXXXXDCSMDSCLTSCEGPLRDQDMHNNKMAIGTLNFKTVMESKNISNEPSLRQSEQR 1055
                   DCSMDSCLTS EG  ++Q++HN  M +   N   ++E K I+ E +L+Q+E +
Sbjct: 241  TQPTHPNDCSMDSCLTSIEGSQKEQEIHNTGMGLRPYNGNALLEPKVIAGEHALQQTELK 300

Query: 1056 WCEDLNDKRRFLLSMNEEAEKEC-AMEKSCSNLSMSIGLQG--GWNTNIYSEKGITETNR 1226
            W ED  D + FL SM  + ++   + E+SCSNLS+ +GLQG  G  ++ ++E      + 
Sbjct: 301  WGEDQRDNKMFLSSMRNDTDRRTFSAERSCSNLSIGVGLQGERGNVSSSFAEARFKGRSE 360

Query: 1227 DTKFFDQPCSRHDAVPTVKQRSSQEYKMPFFAPKLDLNTDDETDAASKCKQFDLNGFSWS 1406
            D  F D+   R DA+    ++ S  Y++ ++A KLDLN+  E DAAS C+Q DLNGFSW+
Sbjct: 361  DDSFQDKTNRRIDAIKLENEKLSPGYRLSYYATKLDLNSHGEIDAASGCRQLDLNGFSWN 420


>ref|XP_006338935.1| PREDICTED: uncharacterized protein LOC102592272 isoform X3 [Solanum
            tuberosum]
          Length = 410

 Score =  514 bits (1324), Expect = e-143
 Identities = 276/420 (65%), Positives = 315/420 (75%), Gaps = 7/420 (1%)
 Frame = +3

Query: 165  MYPHHHQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIE 344
            MY HHHQ K++H S+RMS+P ERHLFLQGGNG GDSGLVLSTDAKPRLKWTPDLHERFIE
Sbjct: 1    MYHHHHQEKSMHPSTRMSVP-ERHLFLQGGNGNGDSGLVLSTDAKPRLKWTPDLHERFIE 59

Query: 345  AVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQAN-SGGNKSPAAAE 521
            AVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQAN SG NK+ A A 
Sbjct: 60   AVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANASGTNKAVAVAG 119

Query: 522  -ERISEGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGK 698
             ERISE S T  SN S+ PQ NKN+ I EAIQMQIEVQRRLHEQLE      LRIEAQGK
Sbjct: 120  VERISENSATCMSNPSMVPQPNKNIQISEAIQMQIEVQRRLHEQLE------LRIEAQGK 173

Query: 699  YLQAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGLCXXXX 878
            YLQ+VLEKAQETLGRQN+ ++GLEA KVQ+SE VSK S QCLNS F D+KELSG      
Sbjct: 174  YLQSVLEKAQETLGRQNMETVGLEAVKVQLSEFVSKASNQCLNSPFPDIKELSGFHSQHT 233

Query: 879  XXXXXXDCSMDSCLTSCEGPLRDQDMHNNKMAIGTLNFKTVMESKNISNEPSLRQSEQRW 1058
                  D S+DSCLTS +G LRD  MH+N++ +   +F   +E K+I N+  L+Q+E RW
Sbjct: 234  QATQPTDRSIDSCLTSRDGSLRDNTMHDNQIGLRPFDFTPSIECKDIENDARLQQTELRW 293

Query: 1059 CEDLNDKRRFLLSMNEEAEKECAMEKSCSNLSMSIGLQ-----GGWNTNIYSEKGITETN 1223
            C++L + RR    MNE  EK    E +C+NLSMSIGLQ     G  N   +S+     T 
Sbjct: 294  CDNLKENRRLFSPMNEGREKTFTRETNCNNLSMSIGLQDEKLNGSMN---HSDGSFNGTE 350

Query: 1224 RDTKFFDQPCSRHDAVPTVKQRSSQEYKMPFFAPKLDLNTDDETDAASKCKQFDLNGFSW 1403
            RD K F Q  +R ++VP  + +SSQEYK+ +F PKLDLN  DETDAAS CKQFDLNGFSW
Sbjct: 351  RDVKLFHQVTNRSESVPQ-RHKSSQEYKLSYFQPKLDLNMHDETDAASSCKQFDLNGFSW 409


>ref|XP_002319702.2| myb family transcription factor family protein [Populus trichocarpa]
            gi|550325041|gb|EEE95625.2| myb family transcription
            factor family protein [Populus trichocarpa]
          Length = 427

 Score =  512 bits (1319), Expect = e-142
 Identities = 269/427 (62%), Positives = 321/427 (75%), Gaps = 13/427 (3%)
 Frame = +3

Query: 165  MYPHH-HQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 341
            MY HH HQGK+IH+SSRM+IPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFI
Sbjct: 1    MYHHHQHQGKSIHSSSRMAIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 60

Query: 342  EAVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNK--SPAA 515
            EAVNQLGGADKATPK+V+KLMGI GLTLYHLKSHLQKYRLSKNLHGQAN G +K  + A 
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGSSKIGTVAV 120

Query: 516  AEERISEGSGTQTS--NSSIAPQTNK-----NLHIGEAIQMQIEVQRRLHEQLEVQRHLQ 674
              +R+ E + T  +  N SI  Q NK     +LH  EA+QMQIEVQRRLHEQLEVQRHLQ
Sbjct: 121  VGDRMPEANATHININNLSIGSQPNKILKSRSLHFSEALQMQIEVQRRLHEQLEVQRHLQ 180

Query: 675  LRIEAQGKYLQAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKEL 854
            LRIEAQGKYLQAVLEKAQETLGRQNLG++GLEAAKVQ+SELVSKVSTQCLNS FS++ +L
Sbjct: 181  LRIEAQGKYLQAVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCLNSTFSELNDL 240

Query: 855  SGLCXXXXXXXXXXDCSMDSCLTSCEGPLRDQDMHNNKMAIGTLNFKTVMESKNISNEPS 1034
             GLC          DCSMDSCLTSCEG  ++Q++HN  M +   N   ++E K I+ E +
Sbjct: 241  QGLCPQQTPPTQPNDCSMDSCLTSCEGSQKEQEIHNIGMGLRPCNSNALLEPKEIAEEHA 300

Query: 1035 LRQSEQRWCEDLNDKRRFLLSMNEEAEKEC-AMEKSCSNLSMSIGLQG--GWNTNIYSEK 1205
            L+Q+E +W E L D + FL S+  E E+   + E+SCS+LS+ +GLQG  G   + ++E 
Sbjct: 301  LQQTELKWGEYLRDNKMFLTSIGHETERRTFSAERSCSDLSIGVGLQGEKGNINSSFAEG 360

Query: 1206 GITETNRDTKFFDQPCSRHDAVPTVKQRSSQEYKMPFFAPKLDLNTDDETDAASKCKQFD 1385
                 + D  F DQ   R ++V    ++ S  Y++ +F  KLDLN+ DE DAAS CKQ D
Sbjct: 361  RFKGMSEDDSFQDQTNKRAESVKFEDEKMSPGYRLSYFTTKLDLNSHDEIDAASSCKQLD 420

Query: 1386 LNGFSWS 1406
            LNGFSW+
Sbjct: 421  LNGFSWN 427


>ref|XP_006339132.1| PREDICTED: uncharacterized protein LOC102602766 isoform X3 [Solanum
            tuberosum]
          Length = 409

 Score =  507 bits (1306), Expect = e-141
 Identities = 274/419 (65%), Positives = 314/419 (74%), Gaps = 5/419 (1%)
 Frame = +3

Query: 165  MYPHHHQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIE 344
            MY HHHQ  N+H S+RMS P ERHLFLQGGN  GDSGLVLSTDAKPRLKWTPDLHERFIE
Sbjct: 1    MYHHHHQASNMHPSTRMSFP-ERHLFLQGGNANGDSGLVLSTDAKPRLKWTPDLHERFIE 59

Query: 345  AVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNKSPAAAEE 524
            AV QLGGADKATPKSVLKLMGI GLTLYHLKSHLQKYRLSKN HGQAN  G    AA+ E
Sbjct: 60   AVTQLGGADKATPKSVLKLMGIPGLTLYHLKSHLQKYRLSKNHHGQANLSGVNKAAASME 119

Query: 525  RISEGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYL 704
            +I E +G+ TSN SI PQ N N+ I EAIQMQI+VQRRLHEQLE      LRIEAQGKYL
Sbjct: 120  KICESTGSPTSNPSIGPQPNNNIPISEAIQMQIDVQRRLHEQLE------LRIEAQGKYL 173

Query: 705  QAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGL-CXXXXX 881
            QAVLEKAQETLG QNLG++G EAAKVQ+S+LVSKVS QCLNSAFS+++ELSG        
Sbjct: 174  QAVLEKAQETLGTQNLGTIGFEAAKVQLSDLVSKVSNQCLNSAFSEIQELSGFHTPQTQA 233

Query: 882  XXXXXDCSMDSCLTSCEGPLRD-QDMHNNKMAIGTLNFKTVMESKNISNEPSLRQSEQRW 1058
                 DCSMDSCLTS EGPLRD Q+MHNN++ + TLNF    E   I N+  L+Q+  RW
Sbjct: 234  TQRLADCSMDSCLTSSEGPLRDLQEMHNNQLGLRTLNFGPCTE--EIENQTRLQQTALRW 291

Query: 1059 CEDLNDKRRFLLSMNEEAEKECAMEKSCSNLSMSIGLQGGWN--TNIYSEKGITETNRDT 1232
             +DL + R F   M+E+ EKE A E + SNLSM++G+QGG     + Y +  +   + D 
Sbjct: 292  RDDLKENRLF-PKMDEDTEKEFAKETNWSNLSMNVGIQGGKRNVNSSYVDGRLNGIDADI 350

Query: 1233 KFFDQPCS-RHDAVPTVKQRSSQEYKMPFFAPKLDLNTDDETDAASKCKQFDLNGFSWS 1406
            K F Q  + R D+    KQ S QEYK+P+FAPKLDLNTDD+TDAAS CKQ DLNGFSW+
Sbjct: 351  KLFHQAATDRSDSTKPEKQVSPQEYKLPYFAPKLDLNTDDQTDAASNCKQLDLNGFSWN 409


>ref|XP_004249439.1| PREDICTED: uncharacterized protein LOC101257914 isoform 1 [Solanum
            lycopersicum]
          Length = 409

 Score =  506 bits (1302), Expect = e-140
 Identities = 273/419 (65%), Positives = 315/419 (75%), Gaps = 5/419 (1%)
 Frame = +3

Query: 165  MYPHHHQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIE 344
            MY HHHQ  N+H S+RMS P ERHLFLQGGN  GDSGLVLSTDAKPRLKWTPDLHERFIE
Sbjct: 1    MYHHHHQAPNMHPSTRMSFP-ERHLFLQGGNANGDSGLVLSTDAKPRLKWTPDLHERFIE 59

Query: 345  AVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNKSPAAAEE 524
            AV QLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKN HGQAN  G    AA+ E
Sbjct: 60   AVTQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNHHGQANISGVNKAAASME 119

Query: 525  RISEGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYL 704
            +I E +G+  SN SI  Q N N+ I EAIQMQI+VQRRLHEQLE      LRIEAQGKYL
Sbjct: 120  KICESTGSPKSNPSIGHQPNNNIPISEAIQMQIDVQRRLHEQLE------LRIEAQGKYL 173

Query: 705  QAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGL-CXXXXX 881
            QAVLEKAQETLG QNLG++GLEAAKVQ+S+LVSKVS QCLNSAFS++KELSG        
Sbjct: 174  QAVLEKAQETLGTQNLGTIGLEAAKVQLSDLVSKVSNQCLNSAFSEIKELSGFHTPQTQA 233

Query: 882  XXXXXDCSMDSCLTSCEGPLRD-QDMHNNKMAIGTLNFKTVMESKNISNEPSLRQSEQRW 1058
                 DCSMDSCLTS EGPLRD Q+MHNN++ +  LNF+   E   I N+  L+Q+  RW
Sbjct: 234  TQRLADCSMDSCLTSSEGPLRDLQEMHNNQLGLRNLNFRPCTE--EIENQTRLQQTALRW 291

Query: 1059 CEDLNDKRRFLLSMNEEAEKECAMEKSCSNLSMSIGLQGGWN--TNIYSEKGITETNRDT 1232
             +DL + R F   ++E+ EKE A E + SNLSM++G+QGG     + Y ++ +   + D 
Sbjct: 292  RDDLKENRLF-PKIDEDTEKEFAKETNWSNLSMNVGIQGGKRNVNSSYVDERLNGIDADI 350

Query: 1233 KFFDQPCS-RHDAVPTVKQRSSQEYKMPFFAPKLDLNTDDETDAASKCKQFDLNGFSWS 1406
            K F Q  + R D+    KQ S QEYK+P+FAPKLDLNTDD+TDAAS CKQ DLNGFSW+
Sbjct: 351  KLFHQTATDRSDSTKPEKQVSPQEYKLPYFAPKLDLNTDDQTDAASNCKQLDLNGFSWN 409


>ref|XP_002282324.1| PREDICTED: uncharacterized protein LOC100248614 isoform 1 [Vitis
            vinifera]
          Length = 418

 Score =  496 bits (1277), Expect = e-137
 Identities = 266/418 (63%), Positives = 310/418 (74%), Gaps = 7/418 (1%)
 Frame = +3

Query: 174  HHHQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVN 353
            HHHQGKNIH SSR  I PER+LFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVN
Sbjct: 5    HHHQGKNIHPSSRTPITPERNLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVN 64

Query: 354  QLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNKSPAAAEERIS 533
            QLGGADKATPK+V+KLMGI GLTLYHLKSHLQKYRLSKNLHGQANS  +K+     ER+ 
Sbjct: 65   QLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANSATSKT--VVGERMP 122

Query: 534  EGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAV 713
            E +G   S+ +I  QTNK+LH+ E +QM IE QRRLHEQLEVQRHLQLRIEAQGKYLQAV
Sbjct: 123  EANGALMSSPNIGNQTNKSLHLSETLQM-IEAQRRLHEQLEVQRHLQLRIEAQGKYLQAV 181

Query: 714  LEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGLCXXXXXXXXX 893
            LEKAQETLGRQNLG++GLEAAKVQ+SELVSKVSTQCL+SAFS++KEL  LC         
Sbjct: 182  LEKAQETLGRQNLGAVGLEAAKVQLSELVSKVSTQCLHSAFSELKELQSLC-PQQTQTQP 240

Query: 894  XDCSMDSCLTSCEGPLRDQDMHNNKMAIGT-LNFKTVMESKNISNEPSLRQSEQRWCEDL 1070
             DCSMDSCLTSCEG  R+Q++HN  M +    N  T +E+K+ +  P L+ +  +WCED 
Sbjct: 241  TDCSMDSCLTSCEGSQREQEIHNCGMGLRPYTNGSTPLEAKDTAEPPGLQHTVLKWCEDT 300

Query: 1071 NDKRRFLLSMNEEAEKE-CAMEKSCSNLSMSIGLQG--GWNTNIYSEKGITETNRDTKFF 1241
             + R+F+ SM  +AE+     E+S S+LSM IGLQG  G  +N YSE           F 
Sbjct: 301  KENRQFISSMQRDAERRTMTAERSNSDLSMRIGLQGEKGNGSNSYSEGRFKGRAEADNFV 360

Query: 1242 DQPCSRHDAVPTVKQ---RSSQEYKMPFFAPKLDLNTDDETDAASKCKQFDLNGFSWS 1406
            D+     D+  +VKQ   + S  Y++P F  KLDLN  DE D    CKQFDLNGFSW+
Sbjct: 361  DRTNHGADSGNSVKQENEKMSHGYRLPCFGAKLDLNAHDENDVTLSCKQFDLNGFSWN 418


>gb|EMJ04158.1| hypothetical protein PRUPE_ppa015076mg [Prunus persica]
          Length = 421

 Score =  486 bits (1250), Expect = e-134
 Identities = 261/422 (61%), Positives = 314/422 (74%), Gaps = 9/422 (2%)
 Frame = +3

Query: 168  YPHHHQGKNIH----ASSRMSIPPERHLFLQGG-NGPGDSGLVLSTDAKPRLKWTPDLHE 332
            + H HQGKNIH    ASSRMSIPPERHL+LQG  NGPG+SGLVLSTDAKPRLKWTPDLHE
Sbjct: 14   HQHQHQGKNIHSSSSASSRMSIPPERHLYLQGDQNGPGESGLVLSTDAKPRLKWTPDLHE 73

Query: 333  RFIEAVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNKSPA 512
            RFIEAVNQLGGADKATPK+V+KLMGI GLTLYHLKSHLQKYRLSKNLHG A SG +K   
Sbjct: 74   RFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGHATSGTSKIAL 133

Query: 513  AAEERISEGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQ 692
               E       T  +N  +     + LHI E +QMQIEVQRRLHEQLEVQRHLQLRIEAQ
Sbjct: 134  DPNE-------TYNNNGIL---NCRGLHISETLQMQIEVQRRLHEQLEVQRHLQLRIEAQ 183

Query: 693  GKYLQAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGLCXX 872
            GKYLQ+VLEKAQETLGRQNLG++GLEAAKVQ+SELVSKVSTQCLNSAF+++KEL GLC  
Sbjct: 184  GKYLQSVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCLNSAFTELKELQGLCPQ 243

Query: 873  XXXXXXXXDCSMDSCLTSCEGPLRDQDMHNNKMAI-GTLNFKTVMESKNISNEPSLRQSE 1049
                    DCSM+SCLTSCEG  +DQ++HN+ M +    N + +++ K    EP L+++E
Sbjct: 244  QTQTTQPTDCSMESCLTSCEGSKKDQEIHNSAMGLRANYNGRELLDEK----EPMLQKTE 299

Query: 1050 QRWCEDLNDKRRFLLSM-NEEAEKECAMEKSCSNLSMSIGLQG-GWNTNIYSEKGITETN 1223
             +WCE+L +    L S+ N+ A++   +E+S S+LSMSIG QG  WN N  SE+ +   +
Sbjct: 300  LKWCEELKENNMLLSSISNDAAKRMFPVERSSSDLSMSIGCQGERWNINGNSEERLKGRS 359

Query: 1224 RDTKFFDQPCSRHDAVPTVKQRSSQEYK-MPFFAPKLDLNTDDETDAASKCKQFDLNGFS 1400
             D  F D+  +R D+     ++ S+  + +P+FA KLDLNT D+ DA S CKQFDLNGFS
Sbjct: 360  TDVSFLDRTNNRADSAKAETEKVSRGCRSVPYFAAKLDLNTHDDNDAPSSCKQFDLNGFS 419

Query: 1401 WS 1406
            WS
Sbjct: 420  WS 421


>ref|XP_006443380.1| hypothetical protein CICLE_v10020171mg [Citrus clementina]
            gi|568850794|ref|XP_006479082.1| PREDICTED:
            uncharacterized protein LOC102612777 isoform X1 [Citrus
            sinensis] gi|568850796|ref|XP_006479083.1| PREDICTED:
            uncharacterized protein LOC102612777 isoform X2 [Citrus
            sinensis] gi|557545642|gb|ESR56620.1| hypothetical
            protein CICLE_v10020171mg [Citrus clementina]
          Length = 401

 Score =  483 bits (1243), Expect = e-133
 Identities = 259/418 (61%), Positives = 300/418 (71%), Gaps = 4/418 (0%)
 Frame = +3

Query: 165  MYPHH-HQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 341
            MY HH +QGK++H+SSRM IP ERHLFLQGG+GPGDSGLVLSTDAKPRLKWTPDLHERFI
Sbjct: 1    MYHHHQNQGKSMHSSSRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFI 60

Query: 342  EAVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNK-SPAAA 518
            EAVNQLGGADKATPK+V+KLMGI GLTLYHLKSHLQKYRLSKNLHGQAN G NK  P   
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIGPVTV 120

Query: 519  E-ERISEGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQG 695
              ER+ E + T  +N SI PQ NK+LHI E IQMQIEVQRRLHEQLEVQRHLQLRIEAQG
Sbjct: 121  PGERMPEANATHMNNLSIGPQPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQG 180

Query: 696  KYLQAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGLCXXX 875
            KYLQAVLEKAQETLGRQNLG+ GLEAAKVQ+SELVSKVSTQCLNS FSD+KEL G C   
Sbjct: 181  KYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQ 240

Query: 876  XXXXXXXDCSMDSCLTSCEGPLRDQDMHNNKMAIGTLNFKTVMESKNISNEPSLRQSEQR 1055
                   DCSMDSCLTSCEG  +DQ++HN  + +   +    +E K I  EP L+Q+E +
Sbjct: 241  PQANQPTDCSMDSCLTSCEGSQKDQEIHNGGVRLRPYHGTPTLEPKEIVEEPMLQQTELK 300

Query: 1056 WCEDLNDKRRFLLSMNEEAEKECAMEKSCSNLSMSIGLQGGWNTNIYSEKGITETNRDTK 1235
            W +DL +  +FL S+ +        ++    LS+  G         +       +N D  
Sbjct: 301  WRKDLKES-KFLSSIGK--------DRGPGELSIGSG--------SFPAGRFKASNEDEH 343

Query: 1236 FFDQPCSRHDAVPTVKQRSSQEYKMPFFAPKLDLNT-DDETDAASKCKQFDLNGFSWS 1406
            F DQ   + +      +    EY++P F+ KLDLN  D E D AS CKQFDLNGFSW+
Sbjct: 344  FQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSWN 401


>ref|XP_004249440.1| PREDICTED: uncharacterized protein LOC101257914 isoform 2 [Solanum
            lycopersicum]
          Length = 398

 Score =  481 bits (1237), Expect = e-133
 Identities = 262/419 (62%), Positives = 304/419 (72%), Gaps = 5/419 (1%)
 Frame = +3

Query: 165  MYPHHHQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIE 344
            MY HHHQ  N+H S+RMS P ERHLFLQGGN  GDSGLVLSTDAKPRLKWTPDLHERFIE
Sbjct: 1    MYHHHHQAPNMHPSTRMSFP-ERHLFLQGGNANGDSGLVLSTDAKPRLKWTPDLHERFIE 59

Query: 345  AVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNKSPAAAEE 524
            AV QLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKN HGQAN  G    AA+ E
Sbjct: 60   AVTQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNHHGQANISGVNKAAASME 119

Query: 525  RISEGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYL 704
            +I E +G+  SN SI  Q N N+ I EAIQMQI+VQRRLHEQLE                
Sbjct: 120  KICESTGSPKSNPSIGHQPNNNIPISEAIQMQIDVQRRLHEQLE---------------- 163

Query: 705  QAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGL-CXXXXX 881
             AVLEKAQETLG QNLG++GLEAAKVQ+S+LVSKVS QCLNSAFS++KELSG        
Sbjct: 164  -AVLEKAQETLGTQNLGTIGLEAAKVQLSDLVSKVSNQCLNSAFSEIKELSGFHTPQTQA 222

Query: 882  XXXXXDCSMDSCLTSCEGPLRD-QDMHNNKMAIGTLNFKTVMESKNISNEPSLRQSEQRW 1058
                 DCSMDSCLTS EGPLRD Q+MHNN++ +  LNF+   E   I N+  L+Q+  RW
Sbjct: 223  TQRLADCSMDSCLTSSEGPLRDLQEMHNNQLGLRNLNFRPCTE--EIENQTRLQQTALRW 280

Query: 1059 CEDLNDKRRFLLSMNEEAEKECAMEKSCSNLSMSIGLQGGWN--TNIYSEKGITETNRDT 1232
             +DL + R F   ++E+ EKE A E + SNLSM++G+QGG     + Y ++ +   + D 
Sbjct: 281  RDDLKENRLF-PKIDEDTEKEFAKETNWSNLSMNVGIQGGKRNVNSSYVDERLNGIDADI 339

Query: 1233 KFFDQPCS-RHDAVPTVKQRSSQEYKMPFFAPKLDLNTDDETDAASKCKQFDLNGFSWS 1406
            K F Q  + R D+    KQ S QEYK+P+FAPKLDLNTDD+TDAAS CKQ DLNGFSW+
Sbjct: 340  KLFHQTATDRSDSTKPEKQVSPQEYKLPYFAPKLDLNTDDQTDAASNCKQLDLNGFSWN 398


>ref|XP_002282336.1| PREDICTED: uncharacterized protein LOC100248614 isoform 2 [Vitis
            vinifera]
          Length = 412

 Score =  478 bits (1229), Expect = e-132
 Identities = 260/418 (62%), Positives = 304/418 (72%), Gaps = 7/418 (1%)
 Frame = +3

Query: 174  HHHQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVN 353
            HHHQGKNIH SSR  I PER+LFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVN
Sbjct: 5    HHHQGKNIHPSSRTPITPERNLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFIEAVN 64

Query: 354  QLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNKSPAAAEERIS 533
            QLGGADKATPK+V+KLMGI GLTLYHLKSHLQKYRLSKNLHGQANS  +K+     ER+ 
Sbjct: 65   QLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANSATSKT--VVGERMP 122

Query: 534  EGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAV 713
            E +G   S+ +I  QTNK+LH+ E +QM IE QRRLHEQLE      LRIEAQGKYLQAV
Sbjct: 123  EANGALMSSPNIGNQTNKSLHLSETLQM-IEAQRRLHEQLE------LRIEAQGKYLQAV 175

Query: 714  LEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGLCXXXXXXXXX 893
            LEKAQETLGRQNLG++GLEAAKVQ+SELVSKVSTQCL+SAFS++KEL  LC         
Sbjct: 176  LEKAQETLGRQNLGAVGLEAAKVQLSELVSKVSTQCLHSAFSELKELQSLC-PQQTQTQP 234

Query: 894  XDCSMDSCLTSCEGPLRDQDMHNNKMAIGT-LNFKTVMESKNISNEPSLRQSEQRWCEDL 1070
             DCSMDSCLTSCEG  R+Q++HN  M +    N  T +E+K+ +  P L+ +  +WCED 
Sbjct: 235  TDCSMDSCLTSCEGSQREQEIHNCGMGLRPYTNGSTPLEAKDTAEPPGLQHTVLKWCEDT 294

Query: 1071 NDKRRFLLSMNEEAEKE-CAMEKSCSNLSMSIGLQG--GWNTNIYSEKGITETNRDTKFF 1241
             + R+F+ SM  +AE+     E+S S+LSM IGLQG  G  +N YSE           F 
Sbjct: 295  KENRQFISSMQRDAERRTMTAERSNSDLSMRIGLQGEKGNGSNSYSEGRFKGRAEADNFV 354

Query: 1242 DQPCSRHDAVPTVKQ---RSSQEYKMPFFAPKLDLNTDDETDAASKCKQFDLNGFSWS 1406
            D+     D+  +VKQ   + S  Y++P F  KLDLN  DE D    CKQFDLNGFSW+
Sbjct: 355  DRTNHGADSGNSVKQENEKMSHGYRLPCFGAKLDLNAHDENDVTLSCKQFDLNGFSWN 412


>ref|XP_003540247.1| PREDICTED: uncharacterized protein LOC100810396 [Glycine max]
          Length = 420

 Score =  471 bits (1213), Expect = e-130
 Identities = 260/425 (61%), Positives = 312/425 (73%), Gaps = 11/425 (2%)
 Frame = +3

Query: 165  MYPHH-HQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 341
            MY HH HQGKNIH+SSRM IP ERH+FLQ GNG GDSGLVLSTDAKPRLKWTPDLH RFI
Sbjct: 1    MYHHHQHQGKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFI 60

Query: 342  EAVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNK--SPAA 515
            EAVNQLGGADKATPK+V+KLMGI GLTLYHLKSHLQKYRLSKNLHGQ+N+   K  + A+
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTYKITTSAS 120

Query: 516  AEERISEGSGTQTSNSSIAPQTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQG 695
              ER+SE +GT  +  S+ PQ NK+LHI EA+QMQIEVQRRL+EQLEVQRHLQLRIEAQG
Sbjct: 121  TGERLSETNGTHMNKLSLGPQANKDLHISEALQMQIEVQRRLNEQLEVQRHLQLRIEAQG 180

Query: 696  KYLQAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGLCXXX 875
            KYLQ+VLEKAQETLGRQNLG +G+EAAKVQ+SELVSKVS+QCLNSAF++ K+L G     
Sbjct: 181  KYLQSVLEKAQETLGRQNLGVVGIEAAKVQLSELVSKVSSQCLNSAFTEPKDLQGFFPQQ 240

Query: 876  XXXXXXXDCSMDSCLTSCEGPLRDQDMHNNKMAIGTLNFKTVMESKNISNEP-SLRQSEQ 1052
                   DCSMDSCLTS +   ++Q++ N    +   N    ME K  +  P +LR  E 
Sbjct: 241  TQTNPPNDCSMDSCLTSSDRSQKEQEIQN---GLRHFNSHVFMEHKEATEAPNNLRNPEL 297

Query: 1053 RWCEDLNDKRRFL--LSMNEEAEKECAMEKSCSNLSMSIGLQGGWNT--NIYSEKGITET 1220
            +WCED   K  FL  LS NEE  +  A E S +NLSMSIGL+       N+Y E+ ITE+
Sbjct: 298  KWCED-GKKNTFLAPLSKNEE-RRNYAAESSPNNLSMSIGLERETENGINLYPERLITES 355

Query: 1221 NRDTKFFDQPCSRHDAVPTVKQRSSQEYKMP---FFAPKLDLNTDDETDAASKCKQFDLN 1391
              D +F  +   + + +  V ++ SQ+Y++P   F A +LDLNT  + +AA+ CKQ DLN
Sbjct: 356  QSDGEFQHRNRIKPETLKPVDEKVSQDYRLPASYFAAARLDLNTHGDNEAATTCKQLDLN 415

Query: 1392 GFSWS 1406
             FSWS
Sbjct: 416  RFSWS 420


>ref|XP_006443379.1| hypothetical protein CICLE_v10020171mg [Citrus clementina]
            gi|557545641|gb|ESR56619.1| hypothetical protein
            CICLE_v10020171mg [Citrus clementina]
          Length = 441

 Score =  468 bits (1204), Expect = e-129
 Identities = 260/458 (56%), Positives = 302/458 (65%), Gaps = 44/458 (9%)
 Frame = +3

Query: 165  MYPHH-HQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 341
            MY HH +QGK++H+SSRM IP ERHLFLQGG+GPGDSGLVLSTDAKPRLKWTPDLHERFI
Sbjct: 1    MYHHHQNQGKSMHSSSRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFI 60

Query: 342  EAVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNK------ 503
            EAVNQLGGADKATPK+V+KLMGI GLTLYHLKSHLQKYRLSKNLHGQAN G NK      
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIGKKTI 120

Query: 504  SPAAAE------------------------------------ERISEGSGTQTSNSSIAP 575
            S  +A                                     ER+ E + T  +N SI P
Sbjct: 121  SQKSANYQKDQNCNTYLACKAHTGIGGMKFKSSGVGPVTVPGERMPEANATHMNNLSIGP 180

Query: 576  QTNKNLHIGEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLG 755
            Q NK+LHI E IQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLG
Sbjct: 181  QPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLG 240

Query: 756  SMGLEAAKVQISELVSKVSTQCLNSAFSDVKELSGLCXXXXXXXXXXDCSMDSCLTSCEG 935
            + GLEAAKVQ+SELVSKVSTQCLNS FSD+KEL G C          DCSMDSCLTSCEG
Sbjct: 241  TAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQPQANQPTDCSMDSCLTSCEG 300

Query: 936  PLRDQDMHNNKMAIGTLNFKTVMESKNISNEPSLRQSEQRWCEDLNDKRRFLLSMNEEAE 1115
              +DQ++HN  + +   +    +E K I  EP L+Q+E +W +DL +  +FL S+ +   
Sbjct: 301  SQKDQEIHNGGVRLRPYHGTPTLEPKEIVEEPMLQQTELKWRKDLKES-KFLSSIGK--- 356

Query: 1116 KECAMEKSCSNLSMSIGLQGGWNTNIYSEKGITETNRDTKFFDQPCSRHDAVPTVKQRSS 1295
                 ++    LS+  G         +       +N D  F DQ   + +      +   
Sbjct: 357  -----DRGPGELSIGSG--------SFPAGRFKASNEDEHFQDQTNKKPEGAKLENENLL 403

Query: 1296 QEYKMPFFAPKLDLNT-DDETDAASKCKQFDLNGFSWS 1406
             EY++P F+ KLDLN  D E D AS CKQFDLNGFSW+
Sbjct: 404  PEYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSWN 441


>gb|ESW22064.1| hypothetical protein PHAVU_005G123900g [Phaseolus vulgaris]
          Length = 430

 Score =  462 bits (1189), Expect = e-127
 Identities = 254/433 (58%), Positives = 310/433 (71%), Gaps = 19/433 (4%)
 Frame = +3

Query: 165  MYPHH-HQGKNIHASSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 341
            MY HH HQGKNIH++SRM IP ERH+FLQ GNG GDSGLVLSTDAKPRLKWTPDLH RFI
Sbjct: 1    MYHHHRHQGKNIHSTSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFI 60

Query: 342  EAVNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANSGGNK--SPAA 515
            EAVNQLGGADKATPK+V+KLMGI GLTLYHLKSHLQKYRLSKNLHGQ+N+  +K  + A 
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGISGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKMTTSAT 120

Query: 516  AEERISEGSGTQTSNSSIAPQTN----------KNLHIGEAIQMQIEVQRRLHEQLEVQR 665
              ER+SE SGT  S  S+ PQ N          K+LHIGEA+QMQIEVQRRL+EQLEVQ+
Sbjct: 121  TGERLSETSGTHMSKLSLGPQANNHANFQCLLSKDLHIGEALQMQIEVQRRLNEQLEVQK 180

Query: 666  HLQLRIEAQGKYLQAVLEKAQETLGRQNLGSMGLEAAKVQISELVSKVSTQCLNSAFSDV 845
            HLQLRIEAQGKYLQ+VLEKAQ+TLGRQNLG +GLE AKVQ+SELVSKVS+QCLNSAFS++
Sbjct: 181  HLQLRIEAQGKYLQSVLEKAQDTLGRQNLGIIGLETAKVQLSELVSKVSSQCLNSAFSEL 240

Query: 846  KELSGLCXXXXXXXXXXDCSMDSCLTSCEGPLRDQDMHNNKMAIGTLNFKTVMESKNISN 1025
            KEL G C          DCSMDSCLTSC+   ++Q + N+     +  F    ES +  N
Sbjct: 241  KELQGFCPQQTHTNQPNDCSMDSCLTSCDILQKEQKIQNSLRQFNSHVFMEQKESTDARN 300

Query: 1026 EPSLRQSEQRWCEDLNDKRRFLLSMNE-EAEKECAMEKSCSNLSMSIGLQGGW--NTNIY 1196
              +LR SE +WC+D   K  FL  +++ E  ++ A E    NLSMSIGL+      +++Y
Sbjct: 301  --NLRNSELKWCDD-GKKNTFLAPLSKTEERRKYAAETGPGNLSMSIGLERETENRSSMY 357

Query: 1197 SEKGITETNRDTKFFDQPCSRHDAVPTVKQRSSQEYKMP---FFAPKLDLNTDDETDAAS 1367
             E  I E+  + +F  +   + + +  V ++  Q+Y+MP   F A +LDLN   + +AA+
Sbjct: 358  PESLIKESQSEGEFQHRNRIKTETMKAVDEKVCQDYRMPASYFVATRLDLNNHGDNEAAT 417

Query: 1368 KCKQFDLNGFSWS 1406
             CKQ DLN FSWS
Sbjct: 418  TCKQLDLNRFSWS 430


Top