BLASTX nr result

ID: Cocculus23_contig00010358 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00010358
         (1607 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007030696.1| Homeodomain-like superfamily protein isoform...   488   e-135
ref|XP_007030697.1| Homeodomain-like superfamily protein isoform...   487   e-135
ref|XP_002525443.1| transcription factor, putative [Ricinus comm...   471   e-130
ref|XP_002319702.2| myb family transcription factor family prote...   464   e-128
ref|XP_002282324.1| PREDICTED: uncharacterized protein LOC100248...   460   e-127
ref|XP_002325408.2| myb family transcription factor family prote...   456   e-125
ref|XP_006443380.1| hypothetical protein CICLE_v10020171mg [Citr...   454   e-125
ref|XP_002282336.1| PREDICTED: uncharacterized protein LOC100248...   442   e-121
ref|XP_006443379.1| hypothetical protein CICLE_v10020171mg [Citr...   439   e-120
ref|XP_007202959.1| hypothetical protein PRUPE_ppa015076mg [Prun...   420   e-114
ref|XP_004249601.1| PREDICTED: uncharacterized protein LOC101256...   414   e-113
ref|XP_003540247.1| PREDICTED: uncharacterized protein LOC100810...   411   e-112
ref|XP_006338933.1| PREDICTED: uncharacterized protein LOC102592...   409   e-111
ref|XP_006339131.1| PREDICTED: uncharacterized protein LOC102602...   402   e-109
ref|XP_006339130.1| PREDICTED: uncharacterized protein LOC102602...   399   e-108
ref|XP_004288533.1| PREDICTED: uncharacterized protein LOC101304...   395   e-107
ref|XP_006338935.1| PREDICTED: uncharacterized protein LOC102592...   391   e-106
ref|XP_007150070.1| hypothetical protein PHAVU_005G123900g [Phas...   386   e-104
ref|XP_006339132.1| PREDICTED: uncharacterized protein LOC102602...   381   e-103
ref|XP_006829830.1| hypothetical protein AMTR_s00119p00095480 [A...   380   e-103

>ref|XP_007030696.1| Homeodomain-like superfamily protein isoform 1 [Theobroma cacao]
            gi|508719301|gb|EOY11198.1| Homeodomain-like superfamily
            protein isoform 1 [Theobroma cacao]
          Length = 478

 Score =  488 bits (1255), Expect = e-135
 Identities = 268/426 (62%), Positives = 306/426 (71%), Gaps = 20/426 (4%)
 Frame = -3

Query: 1605 EMYHHHH-HQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHER 1429
            +MYHHHH HQG N  PSSRMPIPPERH+FLQGGN  GD+GLVLSTDAKPRLKWT +LHER
Sbjct: 64   KMYHHHHQHQGKNIHPSSRMPIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHER 123

Query: 1428 FVEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTNSGNSKNGCM 1249
            F+EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ N+G++K G +
Sbjct: 124  FIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANNGSNKIGAV 183

Query: 1248 GRMLGDRMSEINGAPVTNTSLRPQTNN-LQISEALQMQIEVQRRLHEQLEVQRHLQLRIE 1072
              M GDRMSE NG  V N S+ PQ NN LQI EALQMQIEVQRRLHEQLEVQRHLQLRIE
Sbjct: 184  A-MAGDRMSEANGTHVNNLSIGPQANNGLQIGEALQMQIEVQRRLHEQLEVQRHLQLRIE 242

Query: 1071 AQGKYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------ELPSLR 925
            AQGKYLQ+VLEKAQETLG+Q  GS G   AKV LSE VSKVS            +L  L 
Sbjct: 243  AQGKYLQAVLEKAQETLGRQNLGSVGLEAAKVQLSELVSKVSNQCLNSAFSDLKDLQGLC 302

Query: 924  YHQAQTTQPTDCSIDSCLTSCEGSHRDQEIPNTGIGLRLYQLNPPLC-TKEIREESRLEQ 748
              Q Q T PTDCS+DSCLTSCEGS ++QEI N G+ LR Y  +  L   +EI E+  L Q
Sbjct: 303  PQQTQATPPTDCSMDSCLTSCEGSQKEQEIHNNGMCLRPYNTSGALLEQREIAEDPLLPQ 362

Query: 747  YELKGPEEQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASGISSSDIR--- 577
             ELK  E+  ++K+F +S G+D E+ +F   RSS++ SMS  L  EK +G +SS      
Sbjct: 363  TELKSFEDIKENKMFLSSLGKDAERRMFFADRSSSDLSMSVGLQGEKGNGGNSSSFSEAK 422

Query: 576  ---AEEDGNFLNQGNSRGSLAQLENENKSNEFRLPSLTAQLDLNAQDENDAASSCKQFDL 406
                 ED +FL++GN R               RLP    +LDLN  +ENDAASSCKQFDL
Sbjct: 423  FKGRNEDDSFLDRGNKRADEVN----------RLPYFATKLDLNVHEENDAASSCKQFDL 472

Query: 405  NGFSWS 388
            NG SW+
Sbjct: 473  NGLSWN 478


>ref|XP_007030697.1| Homeodomain-like superfamily protein isoform 2 [Theobroma cacao]
            gi|590643063|ref|XP_007030698.1| Homeodomain-like
            superfamily protein isoform 2 [Theobroma cacao]
            gi|508719302|gb|EOY11199.1| Homeodomain-like superfamily
            protein isoform 2 [Theobroma cacao]
            gi|508719303|gb|EOY11200.1| Homeodomain-like superfamily
            protein isoform 2 [Theobroma cacao]
          Length = 414

 Score =  487 bits (1254), Expect = e-135
 Identities = 268/425 (63%), Positives = 305/425 (71%), Gaps = 20/425 (4%)
 Frame = -3

Query: 1602 MYHHHH-HQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERF 1426
            MYHHHH HQG N  PSSRMPIPPERH+FLQGGN  GD+GLVLSTDAKPRLKWT +LHERF
Sbjct: 1    MYHHHHQHQGKNIHPSSRMPIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERF 60

Query: 1425 VEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTNSGNSKNGCMG 1246
            +EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ N+G++K G + 
Sbjct: 61   IEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANNGSNKIGAVA 120

Query: 1245 RMLGDRMSEINGAPVTNTSLRPQTNN-LQISEALQMQIEVQRRLHEQLEVQRHLQLRIEA 1069
             M GDRMSE NG  V N S+ PQ NN LQI EALQMQIEVQRRLHEQLEVQRHLQLRIEA
Sbjct: 121  -MAGDRMSEANGTHVNNLSIGPQANNGLQIGEALQMQIEVQRRLHEQLEVQRHLQLRIEA 179

Query: 1068 QGKYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------ELPSLRY 922
            QGKYLQ+VLEKAQETLG+Q  GS G   AKV LSE VSKVS            +L  L  
Sbjct: 180  QGKYLQAVLEKAQETLGRQNLGSVGLEAAKVQLSELVSKVSNQCLNSAFSDLKDLQGLCP 239

Query: 921  HQAQTTQPTDCSIDSCLTSCEGSHRDQEIPNTGIGLRLYQLNPPLC-TKEIREESRLEQY 745
             Q Q T PTDCS+DSCLTSCEGS ++QEI N G+ LR Y  +  L   +EI E+  L Q 
Sbjct: 240  QQTQATPPTDCSMDSCLTSCEGSQKEQEIHNNGMCLRPYNTSGALLEQREIAEDPLLPQT 299

Query: 744  ELKGPEEQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASGISSSDIR---- 577
            ELK  E+  ++K+F +S G+D E+ +F   RSS++ SMS  L  EK +G +SS       
Sbjct: 300  ELKSFEDIKENKMFLSSLGKDAERRMFFADRSSSDLSMSVGLQGEKGNGGNSSSFSEAKF 359

Query: 576  --AEEDGNFLNQGNSRGSLAQLENENKSNEFRLPSLTAQLDLNAQDENDAASSCKQFDLN 403
                ED +FL++GN R               RLP    +LDLN  +ENDAASSCKQFDLN
Sbjct: 360  KGRNEDDSFLDRGNKRADEVN----------RLPYFATKLDLNVHEENDAASSCKQFDLN 409

Query: 402  GFSWS 388
            G SW+
Sbjct: 410  GLSWN 414


>ref|XP_002525443.1| transcription factor, putative [Ricinus communis]
            gi|223535256|gb|EEF36933.1| transcription factor,
            putative [Ricinus communis]
          Length = 419

 Score =  471 bits (1211), Expect = e-130
 Identities = 258/420 (61%), Positives = 299/420 (71%), Gaps = 15/420 (3%)
 Frame = -3

Query: 1602 MYHHHHHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFV 1423
            MYHHH HQG +   SSRM IPPERH+FLQGGN  GD+GLVLSTDAKPRLKWT +LHE F+
Sbjct: 1    MYHHHQHQGKSVHSSSRMSIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTSDLHEHFI 60

Query: 1422 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTNSGNSKNGCMGR 1243
            EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ NSG++K G  G 
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANSGSNKIG-TGA 119

Query: 1242 MLGDRMSEINGAPVTNTSLRPQTN-NLQISEALQMQIEVQRRLHEQLEVQRHLQLRIEAQ 1066
            ++GDR+SE N   + N S+  QTN  L I EALQMQIEVQRRLHEQLEVQRHLQLRIEAQ
Sbjct: 120  VVGDRISETNVTHINNLSMGTQTNKGLHIGEALQMQIEVQRRLHEQLEVQRHLQLRIEAQ 179

Query: 1065 GKYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------ELPSLRYH 919
            GKYLQSVLEKAQETLG+Q  GS G   AKV LSE VSKVST           EL  L + 
Sbjct: 180  GKYLQSVLEKAQETLGRQNLGSIGLEAAKVQLSELVSKVSTQCLNSAFSELKELQGLCHQ 239

Query: 918  QAQTTQPTDCSIDSCLTSCEGSHRDQEIPNTGIGLRLYQLNPPLCTKEIREESRLEQYEL 739
            Q QT  PTDCS+DSCLTSCEGS ++QEI NTG+GLR Y  N  L +K+I E   L Q EL
Sbjct: 240  QTQTAPPTDCSMDSCLTSCEGSQKEQEIHNTGMGLRPYNGNALLESKDITEGHVLHQTEL 299

Query: 738  KGPEEQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHRE--KASGISSSDIRAEED 565
            K  E+   +K+F +  G +  +  F  +RS+++ SM+  L  E   AS  S    +   D
Sbjct: 300  KWSEDLKDNKMFLSPLGNNAARRNFAAERSTSDLSMTVGLQGENGNASSFSEGRYKDRND 359

Query: 564  G-NFLNQGNSRGSLAQLENENKSNEFRLPSLTAQLDLNAQDENDAASSCKQFDLNGFSWS 388
            G +F +Q N      +L   + S  +RLP    +LDLN+ +E DAASSCKQ DLNGFSW+
Sbjct: 360  GDSFPDQTNKSLDSVKLPKGDVSQGYRLPYFATKLDLNSHEEIDAASSCKQLDLNGFSWN 419


>ref|XP_002319702.2| myb family transcription factor family protein [Populus trichocarpa]
            gi|550325041|gb|EEE95625.2| myb family transcription
            factor family protein [Populus trichocarpa]
          Length = 427

 Score =  464 bits (1193), Expect = e-128
 Identities = 259/429 (60%), Positives = 298/429 (69%), Gaps = 24/429 (5%)
 Frame = -3

Query: 1602 MYHHHHHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFV 1423
            MYHHH HQG +   SSRM IPPERH+FLQGGN  GD+GLVLSTDAKPRLKWT +LHERF+
Sbjct: 1    MYHHHQHQGKSIHSSSRMAIPPERHLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 60

Query: 1422 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTNSGNSKNGCMGR 1243
            EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ N G+SK G +  
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGSSKIGTVA- 119

Query: 1242 MLGDRMSEINGA--PVTNTSLRPQTN------NLQISEALQMQIEVQRRLHEQLEVQRHL 1087
            ++GDRM E N     + N S+  Q N      +L  SEALQMQIEVQRRLHEQLEVQRHL
Sbjct: 120  VVGDRMPEANATHININNLSIGSQPNKILKSRSLHFSEALQMQIEVQRRLHEQLEVQRHL 179

Query: 1086 QLRIEAQGKYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------E 940
            QLRIEAQGKYLQ+VLEKAQETLG+Q  G+ G   AKV LSE VSKVST           +
Sbjct: 180  QLRIEAQGKYLQAVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCLNSTFSELND 239

Query: 939  LPSLRYHQAQTTQPTDCSIDSCLTSCEGSHRDQEIPNTGIGLRLYQLNPPLCTKEIREES 760
            L  L   Q   TQP DCS+DSCLTSCEGS ++QEI N G+GLR    N  L  KEI EE 
Sbjct: 240  LQGLCPQQTPPTQPNDCSMDSCLTSCEGSQKEQEIHNIGMGLRPCNSNALLEPKEIAEEH 299

Query: 759  RLEQYELKGPEEQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASGISSSDI 580
             L+Q ELK  E    +K+F TS G + E+  F  +RS ++ S+   L  EK + I+SS  
Sbjct: 300  ALQQTELKWGEYLRDNKMFLTSIGHETERRTFSAERSCSDLSIGVGLQGEKGN-INSSFA 358

Query: 579  RA-----EEDGNFLNQGNSRGSLAQLENENKSNEFRLPSLTAQLDLNAQDENDAASSCKQ 415
                    ED +F +Q N R    + E+E  S  +RL   T +LDLN+ DE DAASSCKQ
Sbjct: 359  EGRFKGMSEDDSFQDQTNKRAESVKFEDEKMSPGYRLSYFTTKLDLNSHDEIDAASSCKQ 418

Query: 414  FDLNGFSWS 388
             DLNGFSW+
Sbjct: 419  LDLNGFSWN 427


>ref|XP_002282324.1| PREDICTED: uncharacterized protein LOC100248614 isoform 1 [Vitis
            vinifera]
          Length = 418

 Score =  460 bits (1184), Expect = e-127
 Identities = 260/426 (61%), Positives = 303/426 (71%), Gaps = 21/426 (4%)
 Frame = -3

Query: 1602 MYHHHHHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFV 1423
            MYHHHHHQG N  PSSR PI PER++FLQGGN  GD+GLVLSTDAKPRLKWT +LHERF+
Sbjct: 1    MYHHHHHQGKNIHPSSRTPITPERNLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 60

Query: 1422 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTNSGNSKNGCMGR 1243
            EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ NS  SK      
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANSATSKT----- 115

Query: 1242 MLGDRMSEINGAPVTNTSLRPQTN-NLQISEALQMQIEVQRRLHEQLEVQRHLQLRIEAQ 1066
            ++G+RM E NGA +++ ++  QTN +L +SE LQM IE QRRLHEQLEVQRHLQLRIEAQ
Sbjct: 116  VVGERMPEANGALMSSPNIGNQTNKSLHLSETLQM-IEAQRRLHEQLEVQRHLQLRIEAQ 174

Query: 1065 GKYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------ELPSLRYH 919
            GKYLQ+VLEKAQETLG+Q  G+ G   AKV LSE VSKVST           EL SL   
Sbjct: 175  GKYLQAVLEKAQETLGRQNLGAVGLEAAKVQLSELVSKVSTQCLHSAFSELKELQSLCPQ 234

Query: 918  QAQTTQPTDCSIDSCLTSCEGSHRDQEIPNTGIGLRLY-QLNPPLCTKEIREESRLEQYE 742
            Q Q TQPTDCS+DSCLTSCEGS R+QEI N G+GLR Y   + PL  K+  E   L+   
Sbjct: 235  QTQ-TQPTDCSMDSCLTSCEGSQREQEIHNCGMGLRPYTNGSTPLEAKDTAEPPGLQHTV 293

Query: 741  LKGPEEQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASGISSSDI-----R 577
            LK  E+  +++ F +S  RD E+     +RS+++ SM   L  EK +G +S        R
Sbjct: 294  LKWCEDTKENRQFISSMQRDAERRTMTAERSNSDLSMRIGLQGEKGNGSNSYSEGRFKGR 353

Query: 576  AEEDGNFL---NQGNSRGSLAQLENENKSNEFRLPSLTAQLDLNAQDENDAASSCKQFDL 406
            AE D NF+   N G   G+  + ENE  S+ +RLP   A+LDLNA DEND   SCKQFDL
Sbjct: 354  AEAD-NFVDRTNHGADSGNSVKQENEKMSHGYRLPCFGAKLDLNAHDENDVTLSCKQFDL 412

Query: 405  NGFSWS 388
            NGFSW+
Sbjct: 413  NGFSWN 418


>ref|XP_002325408.2| myb family transcription factor family protein [Populus trichocarpa]
            gi|550316805|gb|EEE99789.2| myb family transcription
            factor family protein [Populus trichocarpa]
          Length = 420

 Score =  456 bits (1173), Expect = e-125
 Identities = 252/422 (59%), Positives = 297/422 (70%), Gaps = 17/422 (4%)
 Frame = -3

Query: 1602 MYHHHHHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFV 1423
            MY HH HQG N   SSR  IPPERH+FLQ GN  GD+GLVLSTDAKPRLKWT +LHERF+
Sbjct: 1    MYQHHQHQGKNIHSSSRNSIPPERHLFLQVGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 60

Query: 1422 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTNSGNSKNGCMGR 1243
            EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ NSG++K+G +  
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANSGSNKSGTVA- 119

Query: 1242 MLGDRMSEINGAPVTNTSLRPQTN-NLQISEALQMQIEVQRRLHEQLEVQRHLQLRIEAQ 1066
            ++GDRM E+N   + N S+  QTN +L  SEALQ+QIEVQRRLHEQLEVQRHLQLRIEAQ
Sbjct: 120  VVGDRMPEVNATHINNLSIGSQTNKSLHFSEALQVQIEVQRRLHEQLEVQRHLQLRIEAQ 179

Query: 1065 GKYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------ELPSLRYH 919
            GKYLQSVLEKAQETLG+Q  G+ G   AKV LSE VSKVS+           +L  L   
Sbjct: 180  GKYLQSVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSSKCLNSAFSELKDLQGLCPP 239

Query: 918  QAQTTQPTDCSIDSCLTSCEGSHRDQEIPNTGIGLRLYQLNPPLCTKEIREESRLEQYEL 739
              Q T P DCS+DSCLTS EGS ++QEI NTG+GLR Y  N  L  K I  E  L+Q EL
Sbjct: 240  LTQPTHPNDCSMDSCLTSIEGSQKEQEIHNTGMGLRPYNGNALLEPKVIAGEHALQQTEL 299

Query: 738  KGPEEQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASGISSSDIRA----- 574
            K  E+Q  +K+F +S   D ++  F  +RS +N S+   L  E+ + +SSS   A     
Sbjct: 300  KWGEDQRDNKMFLSSMRNDTDRRTFSAERSCSNLSIGVGLQGERGN-VSSSFAEARFKGR 358

Query: 573  EEDGNFLNQGNSRGSLAQLENENKSNEFRLPSLTAQLDLNAQDENDAASSCKQFDLNGFS 394
             ED +F ++ N R    +LENE  S  +RL     +LDLN+  E DAAS C+Q DLNGFS
Sbjct: 359  SEDDSFQDKTNRRIDAIKLENEKLSPGYRLSYYATKLDLNSHGEIDAASGCRQLDLNGFS 418

Query: 393  WS 388
            W+
Sbjct: 419  WN 420


>ref|XP_006443380.1| hypothetical protein CICLE_v10020171mg [Citrus clementina]
            gi|568850794|ref|XP_006479082.1| PREDICTED:
            uncharacterized protein LOC102612777 isoform X1 [Citrus
            sinensis] gi|568850796|ref|XP_006479083.1| PREDICTED:
            uncharacterized protein LOC102612777 isoform X2 [Citrus
            sinensis] gi|557545642|gb|ESR56620.1| hypothetical
            protein CICLE_v10020171mg [Citrus clementina]
          Length = 401

 Score =  454 bits (1167), Expect = e-125
 Identities = 259/418 (61%), Positives = 296/418 (70%), Gaps = 13/418 (3%)
 Frame = -3

Query: 1602 MYHHHHHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFV 1423
            MYHHH +QG +   SSRMPIP ERH+FLQGG+  GD+GLVLSTDAKPRLKWT +LHERF+
Sbjct: 1    MYHHHQNQGKSMHSSSRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFI 60

Query: 1422 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTNSGNSKNGCMGR 1243
            EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ N GN+K G +  
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIGPV-T 119

Query: 1242 MLGDRMSEINGAPVTNTSLRPQTN-NLQISEALQMQIEVQRRLHEQLEVQRHLQLRIEAQ 1066
            + G+RM E N   + N S+ PQ N +L ISE +QMQIEVQRRLHEQLEVQRHLQLRIEAQ
Sbjct: 120  VPGERMPEANATHMNNLSIGPQPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQ 179

Query: 1065 GKYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------ELPSLRYH 919
            GKYLQ+VLEKAQETLG+Q  G+ G   AKV LSE VSKVST           EL      
Sbjct: 180  GKYLQAVLEKAQETLGRQNLGTAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQ 239

Query: 918  QAQTTQPTDCSIDSCLTSCEGSHRDQEIPNTGIGLRLYQLNPPLCTKEIREESRLEQYEL 739
            Q Q  QPTDCS+DSCLTSCEGS +DQEI N G+ LR Y   P L  KEI EE  L+Q EL
Sbjct: 240  QPQANQPTDCSMDSCLTSCEGSQKDQEIHNGGVRLRPYHGTPTLEPKEIVEEPMLQQTEL 299

Query: 738  KGPEEQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASGISSSDIRAEEDGN 559
            K  ++  +SK F +S G+D      P + S  + S  A   R KAS          ED +
Sbjct: 300  KWRKDLKESK-FLSSIGKDRG----PGELSIGSGSFPA--GRFKAS---------NEDEH 343

Query: 558  FLNQGNSRGSLAQLENENKSNEFRLPSLTAQLDLNAQD-ENDAASSCKQFDLNGFSWS 388
            F +Q N +   A+LENEN   E+RLP  + +LDLNA D END AS CKQFDLNGFSW+
Sbjct: 344  FQDQTNKKPEGAKLENENLLPEYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSWN 401


>ref|XP_002282336.1| PREDICTED: uncharacterized protein LOC100248614 isoform 2 [Vitis
            vinifera]
          Length = 412

 Score =  442 bits (1136), Expect = e-121
 Identities = 254/426 (59%), Positives = 297/426 (69%), Gaps = 21/426 (4%)
 Frame = -3

Query: 1602 MYHHHHHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFV 1423
            MYHHHHHQG N  PSSR PI PER++FLQGGN  GD+GLVLSTDAKPRLKWT +LHERF+
Sbjct: 1    MYHHHHHQGKNIHPSSRTPITPERNLFLQGGNGPGDSGLVLSTDAKPRLKWTPDLHERFI 60

Query: 1422 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTNSGNSKNGCMGR 1243
            EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ NS  SK      
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANSATSKT----- 115

Query: 1242 MLGDRMSEINGAPVTNTSLRPQTN-NLQISEALQMQIEVQRRLHEQLEVQRHLQLRIEAQ 1066
            ++G+RM E NGA +++ ++  QTN +L +SE LQM IE QRRLHEQLE      LRIEAQ
Sbjct: 116  VVGERMPEANGALMSSPNIGNQTNKSLHLSETLQM-IEAQRRLHEQLE------LRIEAQ 168

Query: 1065 GKYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------ELPSLRYH 919
            GKYLQ+VLEKAQETLG+Q  G+ G   AKV LSE VSKVST           EL SL   
Sbjct: 169  GKYLQAVLEKAQETLGRQNLGAVGLEAAKVQLSELVSKVSTQCLHSAFSELKELQSLCPQ 228

Query: 918  QAQTTQPTDCSIDSCLTSCEGSHRDQEIPNTGIGLRLY-QLNPPLCTKEIREESRLEQYE 742
            Q Q TQPTDCS+DSCLTSCEGS R+QEI N G+GLR Y   + PL  K+  E   L+   
Sbjct: 229  QTQ-TQPTDCSMDSCLTSCEGSQREQEIHNCGMGLRPYTNGSTPLEAKDTAEPPGLQHTV 287

Query: 741  LKGPEEQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASGISSSDI-----R 577
            LK  E+  +++ F +S  RD E+     +RS+++ SM   L  EK +G +S        R
Sbjct: 288  LKWCEDTKENRQFISSMQRDAERRTMTAERSNSDLSMRIGLQGEKGNGSNSYSEGRFKGR 347

Query: 576  AEEDGNFL---NQGNSRGSLAQLENENKSNEFRLPSLTAQLDLNAQDENDAASSCKQFDL 406
            AE D NF+   N G   G+  + ENE  S+ +RLP   A+LDLNA DEND   SCKQFDL
Sbjct: 348  AEAD-NFVDRTNHGADSGNSVKQENEKMSHGYRLPCFGAKLDLNAHDENDVTLSCKQFDL 406

Query: 405  NGFSWS 388
            NGFSW+
Sbjct: 407  NGFSWN 412


>ref|XP_006443379.1| hypothetical protein CICLE_v10020171mg [Citrus clementina]
            gi|557545641|gb|ESR56619.1| hypothetical protein
            CICLE_v10020171mg [Citrus clementina]
          Length = 441

 Score =  439 bits (1129), Expect = e-120
 Identities = 260/457 (56%), Positives = 296/457 (64%), Gaps = 52/457 (11%)
 Frame = -3

Query: 1602 MYHHHHHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFV 1423
            MYHHH +QG +   SSRMPIP ERH+FLQGG+  GD+GLVLSTDAKPRLKWT +LHERF+
Sbjct: 1    MYHHHQNQGKSMHSSSRMPIPTERHLFLQGGSGPGDSGLVLSTDAKPRLKWTPDLHERFI 60

Query: 1422 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTNSGNSKNG---- 1255
            EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ N GN+K G    
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQANIGNNKIGKKTI 120

Query: 1254 ------------------CMGR-----------------MLGDRMSEINGAPVTNTSLRP 1180
                              C                    + G+RM E N   + N S+ P
Sbjct: 121  SQKSANYQKDQNCNTYLACKAHTGIGGMKFKSSGVGPVTVPGERMPEANATHMNNLSIGP 180

Query: 1179 QTN-NLQISEALQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQSVLEKAQETLGKQTPG 1003
            Q N +L ISE +QMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQ+VLEKAQETLG+Q  G
Sbjct: 181  QPNKSLHISETIQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYLQAVLEKAQETLGRQNLG 240

Query: 1002 STGFGTAKVPLSEFVSKVST-----------ELPSLRYHQAQTTQPTDCSIDSCLTSCEG 856
            + G   AKV LSE VSKVST           EL      Q Q  QPTDCS+DSCLTSCEG
Sbjct: 241  TAGLEAAKVQLSELVSKVSTQCLNSTFSDLKELQGFCPQQPQANQPTDCSMDSCLTSCEG 300

Query: 855  SHRDQEIPNTGIGLRLYQLNPPLCTKEIREESRLEQYELKGPEEQNKSKVFPTSGGRDIE 676
            S +DQEI N G+ LR Y   P L  KEI EE  L+Q ELK  ++  +SK F +S G+D  
Sbjct: 301  SQKDQEIHNGGVRLRPYHGTPTLEPKEIVEEPMLQQTELKWRKDLKESK-FLSSIGKDRG 359

Query: 675  KMIFPIQRSSNNFSMSARLHREKASGISSSDIRAEEDGNFLNQGNSRGSLAQLENENKSN 496
                P + S  + S  A   R KAS          ED +F +Q N +   A+LENEN   
Sbjct: 360  ----PGELSIGSGSFPA--GRFKAS---------NEDEHFQDQTNKKPEGAKLENENLLP 404

Query: 495  EFRLPSLTAQLDLNAQD-ENDAASSCKQFDLNGFSWS 388
            E+RLP  + +LDLNA D END AS CKQFDLNGFSW+
Sbjct: 405  EYRLPCFSTKLDLNAHDHENDVASGCKQFDLNGFSWN 441


>ref|XP_007202959.1| hypothetical protein PRUPE_ppa015076mg [Prunus persica]
            gi|462398490|gb|EMJ04158.1| hypothetical protein
            PRUPE_ppa015076mg [Prunus persica]
          Length = 421

 Score =  420 bits (1079), Expect = e-114
 Identities = 244/423 (57%), Positives = 285/423 (67%), Gaps = 20/423 (4%)
 Frame = -3

Query: 1596 HHHHHQGNN----SFPSSRMPIPPERHVFLQGG-NARGDAGLVLSTDAKPRLKWTCELHE 1432
            H H HQG N    S  SSRM IPPERH++LQG  N  G++GLVLSTDAKPRLKWT +LHE
Sbjct: 14   HQHQHQGKNIHSSSSASSRMSIPPERHLYLQGDQNGPGESGLVLSTDAKPRLKWTPDLHE 73

Query: 1431 RFVEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTNSGNSKNGC 1252
            RF+EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHG   SG SK   
Sbjct: 74   RFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGHATSGTSK--- 130

Query: 1251 MGRMLGDRMSEINGAPVTNTSLRPQTNNLQISEALQMQIEVQRRLHEQLEVQRHLQLRIE 1072
               +  D     N   + N         L ISE LQMQIEVQRRLHEQLEVQRHLQLRIE
Sbjct: 131  ---IALDPNETYNNNGILN------CRGLHISETLQMQIEVQRRLHEQLEVQRHLQLRIE 181

Query: 1071 AQGKYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------ELPSLR 925
            AQGKYLQSVLEKAQETLG+Q  G+ G   AKV LSE VSKVST           EL  L 
Sbjct: 182  AQGKYLQSVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCLNSAFTELKELQGLC 241

Query: 924  YHQAQTTQPTDCSIDSCLTSCEGSHRDQEIPNTGIGLRLYQLNPPLCTKEIREESRLEQY 745
              Q QTTQPTDCS++SCLTSCEGS +DQEI N+ +GLR       L  +   +E  L++ 
Sbjct: 242  PQQTQTTQPTDCSMESCLTSCEGSKKDQEIHNSAMGLRANYNGRELLDE---KEPMLQKT 298

Query: 744  ELKGPEEQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASGISSSDIRAE-- 571
            ELK  EE  ++ +  +S   D  K +FP++RSS++ SMS     E+ +   +S+ R +  
Sbjct: 299  ELKWCEELKENNMLLSSISNDAAKRMFPVERSSSDLSMSIGCQGERWNINGNSEERLKGR 358

Query: 570  -EDGNFLNQGNSRGSLAQLENENKSNEFR-LPSLTAQLDLNAQDENDAASSCKQFDLNGF 397
              D +FL++ N+R   A+ E E  S   R +P   A+LDLN  D+NDA SSCKQFDLNGF
Sbjct: 359  STDVSFLDRTNNRADSAKAETEKVSRGCRSVPYFAAKLDLNTHDDNDAPSSCKQFDLNGF 418

Query: 396  SWS 388
            SWS
Sbjct: 419  SWS 421


>ref|XP_004249601.1| PREDICTED: uncharacterized protein LOC101256236 [Solanum
            lycopersicum]
          Length = 414

 Score =  414 bits (1065), Expect = e-113
 Identities = 236/421 (56%), Positives = 286/421 (67%), Gaps = 18/421 (4%)
 Frame = -3

Query: 1596 HHHHHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFVEA 1417
            +HHHHQ  +  PS+RM +P ERH+FLQGGN  GD+GLVLSTDAKPRLKWT +LHERF+EA
Sbjct: 2    YHHHHQDKSMHPSTRMSVP-ERHLFLQGGNGNGDSGLVLSTDAKPRLKWTPDLHERFIEA 60

Query: 1416 VNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTNSGNSKNGCMGRML 1237
            VNQLGGADKATPK+V+KLMGI GLTLYHLKSHLQKYRLSKNLHGQ N+  +     G   
Sbjct: 61   VNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANASGANKAAAG--- 117

Query: 1236 GDRMSEINGAPVTNTSLRPQTN-NLQISEALQMQIEVQRRLHEQLEVQRHLQLRIEAQGK 1060
             +R+SE +   ++N S+ PQ N N+QISEA+QMQIEVQRRLHEQLEVQRHLQLRIEAQGK
Sbjct: 118  VERISENSATCMSNPSMVPQPNKNIQISEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQGK 177

Query: 1059 YLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------ELPSLRYHQA 913
            YLQSVLEKAQETLG+Q   + G    KV LSEFVSK S            EL      Q 
Sbjct: 178  YLQSVLEKAQETLGRQNMETVGLEAVKVQLSEFVSKASNQCLNSPFTDIKELSGFHSQQT 237

Query: 912  QTTQPTDCSIDSCLTSCEGSHRDQEIPNTGIGLRLYQLNPPLCTKEIREESRLEQYELKG 733
            Q TQPTD SIDSCLTS +GS RD  + +  IGLR +   P +  K+I  ++RL+Q EL+ 
Sbjct: 238  QATQPTDRSIDSCLTSRDGSLRDNTMHDNQIGLRPFGFTPSIECKDIENDTRLQQTELRW 297

Query: 732  PE--EQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASG-ISSSDIR---AE 571
             +  ++N+    P + GR+     F  + + NN SMS  L  EK +G ++ SD      E
Sbjct: 298  CDNLKENRRLFSPMNEGRE---KTFTRETNCNNLSMSIGLQDEKLNGSMNHSDGNFNGTE 354

Query: 570  EDGNFLNQGNSRGSLAQLENENKSNEFRLPSLTAQLDLNAQDENDAASSCKQFDLNGFSW 391
             D    +Q  +R S +  +    S E++L     +LDLN  DE DAASSCKQFDLNGFSW
Sbjct: 355  RDVKLFHQVTNR-SESVPQRHKSSQEYKLSYFEPKLDLNMHDETDAASSCKQFDLNGFSW 413

Query: 390  S 388
            S
Sbjct: 414  S 414


>ref|XP_003540247.1| PREDICTED: uncharacterized protein LOC100810396 [Glycine max]
          Length = 420

 Score =  411 bits (1057), Expect = e-112
 Identities = 238/425 (56%), Positives = 283/425 (66%), Gaps = 20/425 (4%)
 Frame = -3

Query: 1602 MYHHHHHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFV 1423
            MYHHH HQG N   SSRMPIP ERH+FLQ GN  GD+GLVLSTDAKPRLKWT +LH RF+
Sbjct: 1    MYHHHQHQGKNIHSSSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFI 60

Query: 1422 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTNSGNSKNGCMGR 1243
            EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQ+N+   K      
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQSNNVTYKI-TTSA 119

Query: 1242 MLGDRMSEINGAPVTNTSLRPQTN-NLQISEALQMQIEVQRRLHEQLEVQRHLQLRIEAQ 1066
              G+R+SE NG  +   SL PQ N +L ISEALQMQIEVQRRL+EQLEVQRHLQLRIEAQ
Sbjct: 120  STGERLSETNGTHMNKLSLGPQANKDLHISEALQMQIEVQRRLNEQLEVQRHLQLRIEAQ 179

Query: 1065 GKYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------ELPSLRYH 919
            GKYLQSVLEKAQETLG+Q  G  G   AKV LSE VSKVS+           +L      
Sbjct: 180  GKYLQSVLEKAQETLGRQNLGVVGIEAAKVQLSELVSKVSSQCLNSAFTEPKDLQGFFPQ 239

Query: 918  QAQTTQPTDCSIDSCLTSCEGSHRDQEIPNTGIGLRLYQLNPPLCTKEIRE-ESRLEQYE 742
            Q QT  P DCS+DSCLTS + S ++QEI N   GLR +  +  +  KE  E  + L   E
Sbjct: 240  QTQTNPPNDCSMDSCLTSSDRSQKEQEIQN---GLRHFNSHVFMEHKEATEAPNNLRNPE 296

Query: 741  LKGPEEQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASGISSSDIR----A 574
            LK  E+  K   F     ++ E+  +  + S NN SMS  L RE  +GI+    R    +
Sbjct: 297  LKWCED-GKKNTFLAPLSKNEERRNYAAESSPNNLSMSIGLERETENGINLYPERLITES 355

Query: 573  EEDGNFLNQGNSRGSLAQLENENKSNEFRLPS---LTAQLDLNAQDENDAASSCKQFDLN 403
            + DG F ++   +    +  +E  S ++RLP+     A+LDLN   +N+AA++CKQ DLN
Sbjct: 356  QSDGEFQHRNRIKPETLKPVDEKVSQDYRLPASYFAAARLDLNTHGDNEAATTCKQLDLN 415

Query: 402  GFSWS 388
             FSWS
Sbjct: 416  RFSWS 420


>ref|XP_006338933.1| PREDICTED: uncharacterized protein LOC102592272 isoform X1 [Solanum
            tuberosum] gi|565343634|ref|XP_006338934.1| PREDICTED:
            uncharacterized protein LOC102592272 isoform X2 [Solanum
            tuberosum]
          Length = 416

 Score =  409 bits (1052), Expect = e-111
 Identities = 236/421 (56%), Positives = 287/421 (68%), Gaps = 19/421 (4%)
 Frame = -3

Query: 1596 HHHHHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFVEA 1417
            +HHHHQ  +  PS+RM +P ERH+FLQGGN  GD+GLVLSTDAKPRLKWT +LHERF+EA
Sbjct: 2    YHHHHQEKSMHPSTRMSVP-ERHLFLQGGNGNGDSGLVLSTDAKPRLKWTPDLHERFIEA 60

Query: 1416 VNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTN-SGNSKNGCMGRM 1240
            VNQLGGADKATPK+V+KLMGI GLTLYHLKSHLQKYRLSKNLHGQ N SG +K   +  +
Sbjct: 61   VNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANASGTNKAVAVAGV 120

Query: 1239 LGDRMSEINGAPVTNTSLRPQTN-NLQISEALQMQIEVQRRLHEQLEVQRHLQLRIEAQG 1063
              +R+SE +   ++N S+ PQ N N+QISEA+QMQIEVQRRLHEQLEVQRHLQLRIEAQG
Sbjct: 121  --ERISENSATCMSNPSMVPQPNKNIQISEAIQMQIEVQRRLHEQLEVQRHLQLRIEAQG 178

Query: 1062 KYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------ELPSLRYHQ 916
            KYLQSVLEKAQETLG+Q   + G    KV LSEFVSK S            EL       
Sbjct: 179  KYLQSVLEKAQETLGRQNMETVGLEAVKVQLSEFVSKASNQCLNSPFPDIKELSGFHSQH 238

Query: 915  AQTTQPTDCSIDSCLTSCEGSHRDQEIPNTGIGLRLYQLNPPLCTKEIREESRLEQYELK 736
             Q TQPTD SIDSCLTS +GS RD  + +  IGLR +   P +  K+I  ++RL+Q EL+
Sbjct: 239  TQATQPTDRSIDSCLTSRDGSLRDNTMHDNQIGLRPFDFTPSIECKDIENDARLQQTELR 298

Query: 735  GPE--EQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASG-ISSSD---IRA 574
              +  ++N+    P + GR+     F  + + NN SMS  L  EK +G ++ SD      
Sbjct: 299  WCDNLKENRRLFSPMNEGRE---KTFTRETNCNNLSMSIGLQDEKLNGSMNHSDGSFNGT 355

Query: 573  EEDGNFLNQGNSRGSLAQLENENKSNEFRLPSLTAQLDLNAQDENDAASSCKQFDLNGFS 394
            E D    +Q  +R S +  +    S E++L     +LDLN  DE DAASSCKQFDLNGFS
Sbjct: 356  ERDVKLFHQVTNR-SESVPQRHKSSQEYKLSYFQPKLDLNMHDETDAASSCKQFDLNGFS 414

Query: 393  W 391
            W
Sbjct: 415  W 415


>ref|XP_006339131.1| PREDICTED: uncharacterized protein LOC102602766 isoform X2 [Solanum
            tuberosum]
          Length = 414

 Score =  402 bits (1033), Expect = e-109
 Identities = 231/424 (54%), Positives = 285/424 (67%), Gaps = 21/424 (4%)
 Frame = -3

Query: 1596 HHHHHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFVEA 1417
            +HHHHQ +N  PS+RM  P ERH+FLQGGNA GD+GLVLSTDAKPRLKWT +LHERF+EA
Sbjct: 2    YHHHHQASNMHPSTRMSFP-ERHLFLQGGNANGDSGLVLSTDAKPRLKWTPDLHERFIEA 60

Query: 1416 VNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTN-SGNSKNGCMGRM 1240
            V QLGGADKATPK+V+KLMGIPGLTLYHLKSHLQKYRLSKN HGQ N SG +K   M   
Sbjct: 61   VTQLGGADKATPKSVLKLMGIPGLTLYHLKSHLQKYRLSKNHHGQANLSGVNKAASM--- 117

Query: 1239 LGDRMSEINGAPVTNTSLRPQ-TNNLQISEALQMQIEVQRRLHEQLEVQRHLQLRIEAQG 1063
              +++ E  G+P +N S+ PQ  NN+ ISEA+QMQI+VQRRLHEQLEVQRHLQLRIEAQG
Sbjct: 118  --EKICESTGSPTSNPSIGPQPNNNIPISEAIQMQIDVQRRLHEQLEVQRHLQLRIEAQG 175

Query: 1062 KYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------ELPSLRYHQ 916
            KYLQ+VLEKAQETLG Q  G+ GF  AKV LS+ VSKVS            EL      Q
Sbjct: 176  KYLQAVLEKAQETLGTQNLGTIGFEAAKVQLSDLVSKVSNQCLNSAFSEIQELSGFHTPQ 235

Query: 915  AQTTQP-TDCSIDSCLTSCEGSHRD-QEIPNTGIGLRLYQLNPPLCTKEIREESRLEQYE 742
             Q TQ   DCS+DSCLTS EG  RD QE+ N  +GLR     P  CT+EI  ++RL+Q  
Sbjct: 236  TQATQRLADCSMDSCLTSSEGPLRDLQEMHNNQLGLRTLNFGP--CTEEIENQTRLQQTA 293

Query: 741  LKGPEEQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASGISSSDIRAEEDG 562
            L+  ++  ++++FP     D EK  F  + + +N SM+  +   K + ++SS +    +G
Sbjct: 294  LRWRDDLKENRLFPKM-DEDTEKE-FAKETNWSNLSMNVGIQGGKRN-VNSSYVDGRLNG 350

Query: 561  ------NFLNQGNSRGSLAQLENENKSNEFRLPSLTAQLDLNAQDENDAASSCKQFDLNG 400
                   F      R    + E +    E++LP    +LDLN  D+ DAAS+CKQ DLNG
Sbjct: 351  IDADIKLFHQAATDRSDSTKPEKQVSPQEYKLPYFAPKLDLNTDDQTDAASNCKQLDLNG 410

Query: 399  FSWS 388
            FSW+
Sbjct: 411  FSWN 414


>ref|XP_006339130.1| PREDICTED: uncharacterized protein LOC102602766 isoform X1 [Solanum
            tuberosum]
          Length = 415

 Score =  399 bits (1026), Expect = e-108
 Identities = 230/424 (54%), Positives = 284/424 (66%), Gaps = 21/424 (4%)
 Frame = -3

Query: 1596 HHHHHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFVEA 1417
            +HHHHQ +N  PS+RM  P ERH+FLQGGNA GD+GLVLSTDAKPRLKWT +LHERF+EA
Sbjct: 2    YHHHHQASNMHPSTRMSFP-ERHLFLQGGNANGDSGLVLSTDAKPRLKWTPDLHERFIEA 60

Query: 1416 VNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTN-SGNSKNGCMGRM 1240
            V QLGGADKATPK+V+KLMGIPGLTLYHLKSHLQKYRLSKN HGQ N SG +K       
Sbjct: 61   VTQLGGADKATPKSVLKLMGIPGLTLYHLKSHLQKYRLSKNHHGQANLSGVNKAAAS--- 117

Query: 1239 LGDRMSEINGAPVTNTSLRPQ-TNNLQISEALQMQIEVQRRLHEQLEVQRHLQLRIEAQG 1063
              +++ E  G+P +N S+ PQ  NN+ ISEA+QMQI+VQRRLHEQLEVQRHLQLRIEAQG
Sbjct: 118  -MEKICESTGSPTSNPSIGPQPNNNIPISEAIQMQIDVQRRLHEQLEVQRHLQLRIEAQG 176

Query: 1062 KYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------ELPSLRYHQ 916
            KYLQ+VLEKAQETLG Q  G+ GF  AKV LS+ VSKVS            EL      Q
Sbjct: 177  KYLQAVLEKAQETLGTQNLGTIGFEAAKVQLSDLVSKVSNQCLNSAFSEIQELSGFHTPQ 236

Query: 915  AQTTQP-TDCSIDSCLTSCEGSHRD-QEIPNTGIGLRLYQLNPPLCTKEIREESRLEQYE 742
             Q TQ   DCS+DSCLTS EG  RD QE+ N  +GLR     P  CT+EI  ++RL+Q  
Sbjct: 237  TQATQRLADCSMDSCLTSSEGPLRDLQEMHNNQLGLRTLNFGP--CTEEIENQTRLQQTA 294

Query: 741  LKGPEEQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASGISSSDIRAEEDG 562
            L+  ++  ++++FP     D EK  F  + + +N SM+  +   K + ++SS +    +G
Sbjct: 295  LRWRDDLKENRLFPKM-DEDTEKE-FAKETNWSNLSMNVGIQGGKRN-VNSSYVDGRLNG 351

Query: 561  ------NFLNQGNSRGSLAQLENENKSNEFRLPSLTAQLDLNAQDENDAASSCKQFDLNG 400
                   F      R    + E +    E++LP    +LDLN  D+ DAAS+CKQ DLNG
Sbjct: 352  IDADIKLFHQAATDRSDSTKPEKQVSPQEYKLPYFAPKLDLNTDDQTDAASNCKQLDLNG 411

Query: 399  FSWS 388
            FSW+
Sbjct: 412  FSWN 415


>ref|XP_004288533.1| PREDICTED: uncharacterized protein LOC101304811 [Fragaria vesca
            subsp. vesca]
          Length = 418

 Score =  395 bits (1015), Expect = e-107
 Identities = 230/432 (53%), Positives = 284/432 (65%), Gaps = 27/432 (6%)
 Frame = -3

Query: 1602 MYHHHHH--------QGNN--SFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLK 1453
            MYHHHHH        Q  N  S  S+RM IPPE H  LQ  +   D+GLVLSTDAKPRLK
Sbjct: 2    MYHHHHHLHQHQHQHQAKNIHSSSSNRMAIPPETHRLLQPESCPEDSGLVLSTDAKPRLK 61

Query: 1452 WTCELHERFVEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTNS 1273
            WT +LHERF+EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLH   NS
Sbjct: 62   WTPDLHERFIEAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHVHANS 121

Query: 1272 GNSKNGCMGRMLGDRMSEINGAPVTNTSLRPQTN-NLQISEALQMQIEVQRRLHEQLEVQ 1096
            G +       + G+R+SE+NG  + N S+ PQ+N  + I+E LQMQIEVQRRLH+QLEVQ
Sbjct: 122  GGTTKIVAVAVPGERISEVNGTHMNNMSIGPQSNKGIHINETLQMQIEVQRRLHQQLEVQ 181

Query: 1095 RHLQLRIEAQGKYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVSTELPSLRYHQ 916
            RHLQLRIEAQGKYLQSVLEKAQETLG+Q  G+ G   AKV LSE VSKVST+  +  + +
Sbjct: 182  RHLQLRIEAQGKYLQSVLEKAQETLGRQNLGTVGLEAAKVQLSELVSKVSTQCLNSAFTE 241

Query: 915  AQTTQ-------PTDCSIDSCLTSCEGSHRDQEIPNTG-IGLRLYQLNPPLCTKEIREES 760
             +  Q       PTDCS++SCLTS EGS +DQEI N   +GLR Y       +  +  ES
Sbjct: 242  MKEVQGSCPQNPPTDCSMESCLTSSEGSKKDQEIQNNSRMGLRAYN------SSRVLLES 295

Query: 759  RLEQYELKGPEEQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASGISSSDI 580
                  LK      ++ +F ++  ++ ++ +FP +  S +FSMS  L RE  +G   S  
Sbjct: 296  EKTMLHLK------ENSMFVSTLTKNADQRMFPSEPRSGDFSMSIGLEREILNG---SHC 346

Query: 579  RAEED-------GNFLNQGNSRGSLAQLENENK-SNEFRLPSLTAQLDLNAQDENDAASS 424
             +EE         +FL+  N+R    +++   K S  +  P   A+LDLN+ D+ DA+SS
Sbjct: 347  NSEERFKARNTIDSFLDNKNNRADSVKVDQSRKVSQGYSGPYFAAKLDLNSHDDTDASSS 406

Query: 423  CKQFDLNGFSWS 388
            CKQFDLN FSWS
Sbjct: 407  CKQFDLNDFSWS 418


>ref|XP_006338935.1| PREDICTED: uncharacterized protein LOC102592272 isoform X3 [Solanum
            tuberosum]
          Length = 410

 Score =  391 bits (1004), Expect = e-106
 Identities = 230/421 (54%), Positives = 281/421 (66%), Gaps = 19/421 (4%)
 Frame = -3

Query: 1596 HHHHHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFVEA 1417
            +HHHHQ  +  PS+RM +P ERH+FLQGGN  GD+GLVLSTDAKPRLKWT +LHERF+EA
Sbjct: 2    YHHHHQEKSMHPSTRMSVP-ERHLFLQGGNGNGDSGLVLSTDAKPRLKWTPDLHERFIEA 60

Query: 1416 VNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTN-SGNSKNGCMGRM 1240
            VNQLGGADKATPK+V+KLMGI GLTLYHLKSHLQKYRLSKNLHGQ N SG +K   +  +
Sbjct: 61   VNQLGGADKATPKSVLKLMGIQGLTLYHLKSHLQKYRLSKNLHGQANASGTNKAVAVAGV 120

Query: 1239 LGDRMSEINGAPVTNTSLRPQTN-NLQISEALQMQIEVQRRLHEQLEVQRHLQLRIEAQG 1063
              +R+SE +   ++N S+ PQ N N+QISEA+QMQIEVQRRLHEQLE      LRIEAQG
Sbjct: 121  --ERISENSATCMSNPSMVPQPNKNIQISEAIQMQIEVQRRLHEQLE------LRIEAQG 172

Query: 1062 KYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------ELPSLRYHQ 916
            KYLQSVLEKAQETLG+Q   + G    KV LSEFVSK S            EL       
Sbjct: 173  KYLQSVLEKAQETLGRQNMETVGLEAVKVQLSEFVSKASNQCLNSPFPDIKELSGFHSQH 232

Query: 915  AQTTQPTDCSIDSCLTSCEGSHRDQEIPNTGIGLRLYQLNPPLCTKEIREESRLEQYELK 736
             Q TQPTD SIDSCLTS +GS RD  + +  IGLR +   P +  K+I  ++RL+Q EL+
Sbjct: 233  TQATQPTDRSIDSCLTSRDGSLRDNTMHDNQIGLRPFDFTPSIECKDIENDARLQQTELR 292

Query: 735  GPE--EQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASG-ISSSD---IRA 574
              +  ++N+    P + GR+     F  + + NN SMS  L  EK +G ++ SD      
Sbjct: 293  WCDNLKENRRLFSPMNEGRE---KTFTRETNCNNLSMSIGLQDEKLNGSMNHSDGSFNGT 349

Query: 573  EEDGNFLNQGNSRGSLAQLENENKSNEFRLPSLTAQLDLNAQDENDAASSCKQFDLNGFS 394
            E D    +Q  +R S +  +    S E++L     +LDLN  DE DAASSCKQFDLNGFS
Sbjct: 350  ERDVKLFHQVTNR-SESVPQRHKSSQEYKLSYFQPKLDLNMHDETDAASSCKQFDLNGFS 408

Query: 393  W 391
            W
Sbjct: 409  W 409


>ref|XP_007150070.1| hypothetical protein PHAVU_005G123900g [Phaseolus vulgaris]
            gi|561023334|gb|ESW22064.1| hypothetical protein
            PHAVU_005G123900g [Phaseolus vulgaris]
          Length = 430

 Score =  386 bits (992), Expect = e-104
 Identities = 225/435 (51%), Positives = 280/435 (64%), Gaps = 30/435 (6%)
 Frame = -3

Query: 1602 MYHHHHHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFV 1423
            MYHHH HQG N   +SRMPIP ERH+FLQ GN  GD+GLVLSTDAKPRLKWT +LH RF+
Sbjct: 1    MYHHHRHQGKNIHSTSRMPIPSERHMFLQTGNGSGDSGLVLSTDAKPRLKWTPDLHARFI 60

Query: 1422 EAVNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTNSGNSKNGCMGR 1243
            EAVNQLGGADKATPKTVMKLMGI GLTLYHLKSHLQKYRLSKNLHGQ+N+   K      
Sbjct: 61   EAVNQLGGADKATPKTVMKLMGISGLTLYHLKSHLQKYRLSKNLHGQSNNVTHKM-TTSA 119

Query: 1242 MLGDRMSEINGAPVTNTSLRPQTNN-----------LQISEALQMQIEVQRRLHEQLEVQ 1096
              G+R+SE +G  ++  SL PQ NN           L I EALQMQIEVQRRL+EQLEVQ
Sbjct: 120  TTGERLSETSGTHMSKLSLGPQANNHANFQCLLSKDLHIGEALQMQIEVQRRLNEQLEVQ 179

Query: 1095 RHLQLRIEAQGKYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST--------- 943
            +HLQLRIEAQGKYLQSVLEKAQ+TLG+Q  G  G  TAKV LSE VSKVS+         
Sbjct: 180  KHLQLRIEAQGKYLQSVLEKAQDTLGRQNLGIIGLETAKVQLSELVSKVSSQCLNSAFSE 239

Query: 942  --ELPSLRYHQAQTTQPTDCSIDSCLTSCEGSHRDQEIPNTGIGLRLYQLNPPLCTKEIR 769
              EL      Q  T QP DCS+DSCLTSC+   ++Q+I N+   LR +  +  +  KE  
Sbjct: 240  LKELQGFCPQQTHTNQPNDCSMDSCLTSCDILQKEQKIQNS---LRQFNSHVFMEQKEST 296

Query: 768  E-ESRLEQYELKGPEEQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHRE---KAS 601
            +  + L   ELK  ++  K   F     +  E+  +  +    N SMS  L RE   ++S
Sbjct: 297  DARNNLRNSELKWCDD-GKKNTFLAPLSKTEERRKYAAETGPGNLSMSIGLERETENRSS 355

Query: 600  GISSSDIR-AEEDGNFLNQGNSRGSLAQLENENKSNEFRLPS---LTAQLDLNAQDENDA 433
                S I+ ++ +G F ++   +    +  +E    ++R+P+   +  +LDLN   +N+A
Sbjct: 356  MYPESLIKESQSEGEFQHRNRIKTETMKAVDEKVCQDYRMPASYFVATRLDLNNHGDNEA 415

Query: 432  ASSCKQFDLNGFSWS 388
            A++CKQ DLN FSWS
Sbjct: 416  ATTCKQLDLNRFSWS 430


>ref|XP_006339132.1| PREDICTED: uncharacterized protein LOC102602766 isoform X3 [Solanum
            tuberosum]
          Length = 409

 Score =  381 bits (978), Expect = e-103
 Identities = 224/424 (52%), Positives = 278/424 (65%), Gaps = 21/424 (4%)
 Frame = -3

Query: 1596 HHHHHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFVEA 1417
            +HHHHQ +N  PS+RM  P ERH+FLQGGNA GD+GLVLSTDAKPRLKWT +LHERF+EA
Sbjct: 2    YHHHHQASNMHPSTRMSFP-ERHLFLQGGNANGDSGLVLSTDAKPRLKWTPDLHERFIEA 60

Query: 1416 VNQLGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTN-SGNSKNGCMGRM 1240
            V QLGGADKATPK+V+KLMGIPGLTLYHLKSHLQKYRLSKN HGQ N SG +K       
Sbjct: 61   VTQLGGADKATPKSVLKLMGIPGLTLYHLKSHLQKYRLSKNHHGQANLSGVNKAAAS--- 117

Query: 1239 LGDRMSEINGAPVTNTSLRPQ-TNNLQISEALQMQIEVQRRLHEQLEVQRHLQLRIEAQG 1063
              +++ E  G+P +N S+ PQ  NN+ ISEA+QMQI+VQRRLHEQLE      LRIEAQG
Sbjct: 118  -MEKICESTGSPTSNPSIGPQPNNNIPISEAIQMQIDVQRRLHEQLE------LRIEAQG 170

Query: 1062 KYLQSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVST-----------ELPSLRYHQ 916
            KYLQ+VLEKAQETLG Q  G+ GF  AKV LS+ VSKVS            EL      Q
Sbjct: 171  KYLQAVLEKAQETLGTQNLGTIGFEAAKVQLSDLVSKVSNQCLNSAFSEIQELSGFHTPQ 230

Query: 915  AQTTQP-TDCSIDSCLTSCEGSHRD-QEIPNTGIGLRLYQLNPPLCTKEIREESRLEQYE 742
             Q TQ   DCS+DSCLTS EG  RD QE+ N  +GLR     P  CT+EI  ++RL+Q  
Sbjct: 231  TQATQRLADCSMDSCLTSSEGPLRDLQEMHNNQLGLRTLNFGP--CTEEIENQTRLQQTA 288

Query: 741  LKGPEEQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASGISSSDIRAEEDG 562
            L+  ++  ++++FP     D EK  F  + + +N SM+  +   K + ++SS +    +G
Sbjct: 289  LRWRDDLKENRLFPKM-DEDTEKE-FAKETNWSNLSMNVGIQGGKRN-VNSSYVDGRLNG 345

Query: 561  ------NFLNQGNSRGSLAQLENENKSNEFRLPSLTAQLDLNAQDENDAASSCKQFDLNG 400
                   F      R    + E +    E++LP    +LDLN  D+ DAAS+CKQ DLNG
Sbjct: 346  IDADIKLFHQAATDRSDSTKPEKQVSPQEYKLPYFAPKLDLNTDDQTDAASNCKQLDLNG 405

Query: 399  FSWS 388
            FSW+
Sbjct: 406  FSWN 409


>ref|XP_006829830.1| hypothetical protein AMTR_s00119p00095480 [Amborella trichopoda]
            gi|548835411|gb|ERM97246.1| hypothetical protein
            AMTR_s00119p00095480 [Amborella trichopoda]
          Length = 412

 Score =  380 bits (976), Expect = e-103
 Identities = 227/419 (54%), Positives = 268/419 (63%), Gaps = 19/419 (4%)
 Frame = -3

Query: 1587 HHQGNNSFPSSRMPIPPERHVFLQGGNARGDAGLVLSTDAKPRLKWTCELHERFVEAVNQ 1408
            ++QG N F SSR  +PPER + L G N +GD GLVLSTDAKPRLKWT ELHERFVEAV Q
Sbjct: 2    YYQGKNCFSSSRATMPPERPLLLHGANIQGDPGLVLSTDAKPRLKWTPELHERFVEAVAQ 61

Query: 1407 LGGADKATPKTVMKLMGIPGLTLYHLKSHLQKYRLSKNLHGQTNSGNSKNGCMGRMLGDR 1228
            LGGADKATPK VM++MGIPGLTLYHLKSHLQKYRLSKNL  Q+N G+ KNG       +R
Sbjct: 62   LGGADKATPKNVMRVMGIPGLTLYHLKSHLQKYRLSKNLRSQSN-GSDKNG--PAAAAER 118

Query: 1227 MSEING--APVTNTSLRPQTNNLQISEALQMQIEVQRRLHEQLEVQRHLQLRIEAQGKYL 1054
            MS  NG    + N ++      LQI+EALQM IEVQRRLHEQLEVQRHLQLRIEAQGKYL
Sbjct: 119  MSHTNGPHMGMANMAVSGTNKGLQINEALQMHIEVQRRLHEQLEVQRHLQLRIEAQGKYL 178

Query: 1053 QSVLEKAQETLGKQTPGSTGFGTAKVPLSEFVSKVS-----------TELPSLRYHQAQT 907
            QSVLEKAQETL KQ PGS+G    +  +SE VS+VS           TE PSL   QAQ 
Sbjct: 179  QSVLEKAQETLAKQNPGSSGLEATRAQISELVSQVSAECLNSAFSGLTEAPSLNNQQAQK 238

Query: 906  TQPTDCSIDSCLTSCEGSHRDQEIPNTGIGLRLYQLNPPLCTKEIREESRLEQYELKGPE 727
            +   DCS+DSCLTSCEG  +DQE  N  IGL  Y  N  L  K  REE R+++      E
Sbjct: 239  SHLADCSMDSCLTSCEGPQKDQETQNISIGLG-YHSNSLLWQKAEREEFRVQRPNHSTGE 297

Query: 726  EQNKSKVFPTSGGRDIEKMIFPIQRSSNNFSMSARLHREKASGISSSDIRAEEDG--NFL 553
                +K +  S  R +  +      + NN     R   +K +  ++SD R +E G     
Sbjct: 298  SLKDTKHYSPSPERKMHLLTSSFGDTINN----VRAPGDKGACSTNSDARRKERGFEGAC 353

Query: 552  NQGNSRGSLAQ----LENENKSNEFRLPSLTAQLDLNAQDENDAASSCKQFDLNGFSWS 388
             +   + S+A     L+ E      RL + TA LDLNA DENDA+S CK+FDLNGFSWS
Sbjct: 354  GEPPRKRSVASRTQALDIEQTEVFDRLSNHTAVLDLNAHDENDASSECKEFDLNGFSWS 412


Top