BLASTX nr result

ID: Forsythia22_contig00010031 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00010031
         (2168 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009762887.1| PREDICTED: mediator of RNA polymerase II tra...   204   3e-49
ref|XP_006347908.1| PREDICTED: chromatin modification-related pr...   201   2e-48
ref|XP_004229784.1| PREDICTED: uncharacterized protein LOC101243...   199   8e-48
ref|XP_010661316.1| PREDICTED: uncharacterized protein LOC104881...   195   1e-46
ref|XP_006347907.1| PREDICTED: chromatin modification-related pr...   192   7e-46
emb|CBI16584.3| unnamed protein product [Vitis vinifera]              190   5e-45
ref|XP_011093297.1| PREDICTED: uncharacterized protein LOC105173...   189   1e-44
ref|XP_010661317.1| PREDICTED: uncharacterized protein LOC104881...   189   1e-44
emb|CAN65350.1| hypothetical protein VITISV_000640 [Vitis vinifera]   189   1e-44
ref|XP_008232574.1| PREDICTED: uncharacterized protein LOC103331...   184   3e-43
ref|XP_010112019.1| hypothetical protein L484_001626 [Morus nota...   182   1e-42
ref|XP_007219291.1| hypothetical protein PRUPE_ppa018574mg [Prun...   176   9e-41
ref|XP_006347909.1| PREDICTED: chromatin modification-related pr...   175   1e-40
ref|XP_009611448.1| PREDICTED: glutenin, high molecular weight s...   175   2e-40
ref|XP_006490851.1| PREDICTED: uncharacterized protein LOC102607...   162   1e-36
gb|KDO85697.1| hypothetical protein CISIN_1g007901mg [Citrus sin...   160   3e-36
ref|XP_006445326.1| hypothetical protein CICLE_v10023836mg, part...   155   1e-34
gb|KDO85698.1| hypothetical protein CISIN_1g007901mg [Citrus sin...   155   1e-34
ref|XP_004308473.2| PREDICTED: uncharacterized protein LOC101306...   152   1e-33
ref|XP_007052171.1| Uncharacterized protein TCM_005599 [Theobrom...   149   1e-32

>ref|XP_009762887.1| PREDICTED: mediator of RNA polymerase II transcription subunit
            12-like [Nicotiana sylvestris]
          Length = 634

 Score =  204 bits (518), Expect = 3e-49
 Identities = 165/468 (35%), Positives = 203/468 (43%), Gaps = 31/468 (6%)
 Frame = -1

Query: 1811 QHQQQGEPT---PPGVTVPQP--------ITQQAGEQAQSSY-YYPHAVALNIDPQQXXX 1668
            Q + + EPT   PPGV +P P        I QQ  +Q Q  Y YYP         QQ   
Sbjct: 92   QPEPESEPTSIHPPGVPIPPPSSSDPYASIQQQQQQQPQPQYSYYPQ--------QQQQG 143

Query: 1667 XXXXXXXXGAMPTGLHQPIS-QALYGGTGILDGGPSRGAQRQFGPKPNVXXXXXXXXXXX 1491
                     +MP   H P + Q+ Y G G   G   RG     G   ++           
Sbjct: 144  INYGEVATVSMPIVSHTPANVQSPYRGRGKRGGRSYRG-----GAHAHLGGGKLQPSYDQ 198

Query: 1490 XXXGKHASVA----TQVSLPVADEGHSAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXX 1323
                + AS      +QV    +  G+S  V PPP+MA CELCRV+CNT E+LEQ      
Sbjct: 199  PNSFQGASQIQGGPSQVKPSSSASGNSGSVQPPPRMAVCELCRVECNTPEVLEQHKNGKK 258

Query: 1322 XXXXXKVYEELQRLNKDLTGGQNEQPLTFELKPEGSSQP-VQSEGD------------GT 1182
                 K  EEL++ NK + GGQ  Q   F  KP    QP V   G             G 
Sbjct: 259  HKKNLKANEELKKRNKSMDGGQGHQTANFNFKPNVYYQPEVGGAGQLPQGHLPFEVVTGN 318

Query: 1181 KQ-PPQENLPSQAVGEENRVSTXXXXXXXXXXXXXLAQNLRMNHYEXXXXXXXXXXXXXX 1005
            +Q  PQEN PS AV E+  + T             LA++L M+  +              
Sbjct: 319  RQSQPQENFPSHAVLEDGSI-TGKQKVEEAAPVEELARDLGMDRVQ-GQGRGLKRMLRGG 376

Query: 1004 XXXXXXXXNDGSRRPIEPPMPKGFVPLICELCNVKCESVITFQGHLVGKKHQSNAKRFQG 825
                    ++GSR+P  PP PK  VPLICELCNV CESV+ FQ HL GKKH SN K FQG
Sbjct: 377  RGGKLMKLHNGSRKPAVPPKPKKMVPLICELCNVTCESVVVFQSHLAGKKHLSNVKDFQG 436

Query: 824  HQDIIGXXXXXXXXXXXXXXXXXXQVLCQQNPNASTSLAPQIIPQGFPGPEGNFAQPGHS 645
             Q ++G                  Q LCQ N  ASTS+APQ          G   Q   S
Sbjct: 437  QQAMVGEAALQALYPALQALYPALQALCQPNTGASTSMAPQGHQPNLHEILGMLTQQALS 496

Query: 644  VWTQGQASTVGPVALTAMPPPLAMETQDPQTINLQGLASETGTQNAAT 501
               Q Q   +G VA +A+PP   +E QD Q   LQG  SE   +NAAT
Sbjct: 497  AIPQDQVLGIGIVANSALPPSSDLEAQDHQGSILQGSVSEKTRENAAT 544


>ref|XP_006347908.1| PREDICTED: chromatin modification-related protein EAF1-like isoform
            X2 [Solanum tuberosum]
          Length = 612

 Score =  201 bits (511), Expect = 2e-48
 Identities = 170/524 (32%), Positives = 213/524 (40%), Gaps = 43/524 (8%)
 Frame = -1

Query: 1811 QHQQQGEPT---PPGVTVPQPITQQAGEQAQSSYYYPHAVALNIDPQQXXXXXXXXXXXG 1641
            Q Q + EPT   PPGV +P      A +Q     YYP         QQ            
Sbjct: 85   QQQYRPEPTSIHPPGVPIPPSSDPYATQQQPQYSYYP---------QQQQGINYGEVATV 135

Query: 1640 AMPTGLHQP-ISQALYGGTGILDGGPSRGAQR------QFGPKPNVXXXXXXXXXXXXXX 1482
             MP   H P I+ + Y G G   G P RG         Q  P   +              
Sbjct: 136  TMPNVSHTPAIAPSPYKGKGKRGGRPYRGGAHGHLGGGQQQPSYTLPTYAENQPKVFQGA 195

Query: 1481 GKHASVATQVSLPVADEGHSAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXXXXXXXKV 1302
             +     +QV    +  G+SAPV  PP+MAWCELCR++CNT E+LEQ           K 
Sbjct: 196  SQIQGGLSQVKPSSSASGNSAPV-RPPRMAWCELCRIECNTPEVLEQHKNGKKHKKNLKA 254

Query: 1301 YEELQRLNKDLTGGQNEQPLTFELKPEGSSQP-VQSEGD------------GTKQP-PQE 1164
            YEE Q+LNK + G    Q    E+KP  S Q  V+  G             G +QP PQE
Sbjct: 255  YEERQKLNKQMDGAHGNQTTNSEVKPRVSYQSAVEGSGQLPLGNLPSETVTGDRQPLPQE 314

Query: 1163 NLPSQAVGEEN-------RVSTXXXXXXXXXXXXXLAQNLRMNHYEXXXXXXXXXXXXXX 1005
             LP QAV EE+       +V               + +NLR                   
Sbjct: 315  KLPPQAVIEEDVGITGKQKVEETDPVDHIQGQGRGVKRNLRGGR---------------- 358

Query: 1004 XXXXXXXXNDGSRRPIEPPMPKGFVPLICELCNVKCESVITFQGHLVGKKHQSNAKRFQG 825
                     +GSR+P   P P+  VPLICELCNVKCESV+ FQ HL G+KH SN K FQG
Sbjct: 359  -GGKLMKTQNGSRKPAVAPKPQKMVPLICELCNVKCESVVVFQSHLAGRKHLSNVKDFQG 417

Query: 824  HQDIIGXXXXXXXXXXXXXXXXXXQVLCQQNPNASTSLAPQIIPQGFPGPEGNFAQPGHS 645
             Q ++G                  Q LCQ +  ASTS+APQ          G   Q   S
Sbjct: 418  QQAMVGQAALQALYPALQALYPALQALCQPSSGASTSVAPQGHQHNLHEILGILTQQALS 477

Query: 644  VWTQGQASTVGPVALTAMPPPLAMETQDPQTINLQGLASETGTQNAATEEA--------- 492
               Q Q   +G    +A PPP  +  QD Q   LQG  SE   +NAA E+          
Sbjct: 478  AIPQDQLLGIGAAVASAFPPPSDLLAQDHQGSKLQGSVSEETRENAAAEDGRNCDLSVLP 537

Query: 491  ---SSNVQSDINRSSDSSVVVENVATGSELVLSGTRTD*VSSMS 369
               S   +S  N+  + ++ VE  A   E  L    +  VSS S
Sbjct: 538  STESKPEESTDNKHENVNLEVERKAMSVEEPLRFGTSGDVSSTS 581


>ref|XP_004229784.1| PREDICTED: uncharacterized protein LOC101243826 [Solanum
            lycopersicum]
          Length = 640

 Score =  199 bits (506), Expect = 8e-48
 Identities = 159/485 (32%), Positives = 198/485 (40%), Gaps = 46/485 (9%)
 Frame = -1

Query: 1811 QHQQQGEPT---PPGVTVPQPITQQAGEQAQSSYYYPHAVALNIDPQQXXXXXXXXXXXG 1641
            Q Q Q EPT   PPGV +P      A +Q     YYP    +N                 
Sbjct: 99   QQQYQHEPTSIHPPGVPIPPSTDPYATQQQPQYSYYPQQQGINYGE----------VATV 148

Query: 1640 AMPTGLHQPISQALYGGTGILDGGPSRGAQR------QFGPKPNVXXXXXXXXXXXXXXG 1479
             MP      I+ + Y G G   G P RG         Q  P   +               
Sbjct: 149  TMPNVSTPTITPSPYKGKGKRGGRPYRGGAHGHLGGGQLQPIYTLPTYAENQPKVFQGAS 208

Query: 1478 KHASVATQVSLPVADEGHSAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXXXXXXXKVY 1299
            +     +QV    +  G+SAPV  PP+MAWCELCR++CNT E+LEQ           K Y
Sbjct: 209  QIQGGLSQVKPSSSASGNSAPV-RPPRMAWCELCRIECNTPEVLEQHKNGKKHKKNLKAY 267

Query: 1298 EELQRLNKDLTGGQNEQPLTFELKPEGSSQP-VQSEGD------------GTKQP-PQEN 1161
            EE Q+LNK + G    Q +  E+KP  S QP V+  G             G +QP PQE 
Sbjct: 268  EERQKLNKQMDGAHGNQTINSEVKPRISYQPAVEGSGQLPLGHLPSETVTGDRQPLPQEK 327

Query: 1160 LPSQAVGEEN-------RVSTXXXXXXXXXXXXXLAQNLRMNHYEXXXXXXXXXXXXXXX 1002
            LP Q+V EE+       +V               + +NLR                    
Sbjct: 328  LPPQSVIEEDVGITGKQKVEETDPLDHIQGQGRGVKRNLRGGR----------------- 370

Query: 1001 XXXXXXXNDGSRRPIEPPMPKGFVPLICELCNVKCESVITFQGHLVGKKHQSNAKRFQGH 822
                    +GSR+P  PP P+  VPLICELCNVKCESV+ FQ HL G+KH SN K FQG 
Sbjct: 371  GSKLMKTQNGSRKPAVPPKPQKMVPLICELCNVKCESVVVFQSHLAGRKHLSNVKDFQGQ 430

Query: 821  QDIIGXXXXXXXXXXXXXXXXXXQVLCQQNPNASTSLAPQIIPQGFPGPEGNFAQPGHSV 642
            Q ++G                  Q LCQ N  ASTS+APQ          G   Q   S 
Sbjct: 431  QAMVGQAALQALYPALQALYPALQALCQPNSGASTSVAPQGHQHNLHEILGILTQQALSA 490

Query: 641  WTQGQASTVGPVALTAMP----------------PPLAMETQDPQTINLQGLASETGTQN 510
              Q Q   +G  A  A+P                PP  +  QD Q   LQG  SE  ++N
Sbjct: 491  IPQDQLLGIGAAATLAIPQDQLLGIGAAVASAFSPPSDLLAQDNQGSKLQGSVSEETSEN 550

Query: 509  AATEE 495
            AA E+
Sbjct: 551  AAAED 555


>ref|XP_010661316.1| PREDICTED: uncharacterized protein LOC104881792 isoform X1 [Vitis
            vinifera]
          Length = 628

 Score =  195 bits (496), Expect = 1e-46
 Identities = 185/559 (33%), Positives = 217/559 (38%), Gaps = 94/559 (16%)
 Frame = -1

Query: 1808 HQQQGEPT---PPGVTVPQPITQQAGE-----QAQSSYYYPHAV---------------- 1701
            +Q Q +PT   PPGV +P   T   G      QA  + YYP  V                
Sbjct: 58   YQNQQQPTSIHPPGVPIPPEPTHTVGPDYAHLQAPHNAYYPQGVHEQQQQQQQQHMGYPD 117

Query: 1700 -----------ALNIDPQQXXXXXXXXXXXGAMPTGLHQPISQALYGGTGILDGGPSRGA 1554
                       A N+D  Q           G  P  +  PI Q+ Y G G   G P RG 
Sbjct: 118  SAQAGSNLFQMAGNMDSAQGSAEQWQVNNGGFGPGPVRPPIGQSSYRGGGRRGGRPFRGG 177

Query: 1553 QR--------QFGPKPNVXXXXXXXXXXXXXXGKH--ASVATQVSLPVA----------- 1437
             R        QFGP                    H  AS     S+P A           
Sbjct: 178  GRSFRGGGRGQFGPHGFGPDGSGRGQGGGRYFPPHNAASTPNLGSVPTAEGPGALIPGEA 237

Query: 1436 ------------------DEG---------HSAPVWPPPQMAWCELCRVDCNTLEILEQX 1338
                              D G         H  P W  P MAWCELCRVDCNTLEILEQ 
Sbjct: 238  SQLQGKTPQAFMQPLSGSDPGQAQFPAMAQHGNPFWRSPCMAWCELCRVDCNTLEILEQH 297

Query: 1337 XXXXXXXXXXKVYEELQRLNKDLTGGQNEQPLTFELKPEGSSQPVQSEGDGTKQPPQENL 1158
                       VY+ELQ LNK +TG QNEQ    + KP    Q +QSE  G  +  Q   
Sbjct: 298  KNGKRHKKNLLVYQELQNLNKLITGVQNEQMPISDFKP----QLIQSERVGGSEDKQ--- 350

Query: 1157 PSQAVG----EENRVSTXXXXXXXXXXXXXLAQNLRMNHYEXXXXXXXXXXXXXXXXXXX 990
            PSQ  G    E+ + +                +  RM+H++                   
Sbjct: 351  PSQGTGANGTEKEQQTEAEKSEVSAQPTEEQERKARMDHFQ---------APGRGLKRKM 401

Query: 989  XXXNDGSR-RPIEPPMPKGFVPLICELCNVKCESVITFQGHLVGKKHQSNAKRFQGHQDI 813
                 G R R  EPP PK  +PLICELCNVKCES + F  HL GKKH SN KRF G+Q I
Sbjct: 402  RGGRGGKRMRQFEPPKPKEMIPLICELCNVKCESQVVFDSHLAGKKHHSNLKRFHGYQAI 461

Query: 812  IGXXXXXXXXXXXXXXXXXXQVLCQQNPNA-STSLAPQIIPQGFPGPEGNFAQPGHSVWT 636
            I                   Q L   NPNA S    PQ+  QG  G +G  AQP      
Sbjct: 462  IA---------------GALQALIPSNPNAPSNFFIPQVHQQGVSGSQGLPAQP-MPYMQ 505

Query: 635  QGQASTV--GPVALTAMPPPLAMETQDPQ---TINLQGLASETGTQNAATEEASSNVQSD 471
            QGQA  +  GP +     P  A+ETQD +   T+  Q   SE G QN  T EA+S +Q D
Sbjct: 506  QGQAPGMAPGPASEPEPAPVSALETQDKEGTKTVESQA-TSEAGGQNTVTAEANSQLQPD 564

Query: 470  INRSSDSSVVVENVATGSE 414
            I  S  SS V  N    SE
Sbjct: 565  IIASEASSGVSTNTTIVSE 583


>ref|XP_006347907.1| PREDICTED: chromatin modification-related protein EAF1-like isoform
            X1 [Solanum tuberosum]
          Length = 628

 Score =  192 bits (489), Expect = 7e-46
 Identities = 172/540 (31%), Positives = 216/540 (40%), Gaps = 59/540 (10%)
 Frame = -1

Query: 1811 QHQQQGEPT---PPGVTVPQPITQQAGEQAQSSYYYPHAVALNIDPQQXXXXXXXXXXXG 1641
            Q Q + EPT   PPGV +P      A +Q     YYP         QQ            
Sbjct: 85   QQQYRPEPTSIHPPGVPIPPSSDPYATQQQPQYSYYP---------QQQQGINYGEVATV 135

Query: 1640 AMPTGLHQP-ISQALYGGTGILDGGPSRGAQR------QFGPKPNVXXXXXXXXXXXXXX 1482
             MP   H P I+ + Y G G   G P RG         Q  P   +              
Sbjct: 136  TMPNVSHTPAIAPSPYKGKGKRGGRPYRGGAHGHLGGGQQQPSYTLPTYAENQPKVFQGA 195

Query: 1481 GKHASVATQVSLPVADEGHSAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXXXXXXXKV 1302
             +     +QV    +  G+SAPV  PP+MAWCELCR++CNT E+LEQ           K 
Sbjct: 196  SQIQGGLSQVKPSSSASGNSAPV-RPPRMAWCELCRIECNTPEVLEQHKNGKKHKKNLKA 254

Query: 1301 YEELQRLNKDLTGGQNEQPLTFELKPEGSSQP-VQSEGD------------GTKQP-PQE 1164
            YEE Q+LNK + G    Q    E+KP  S Q  V+  G             G +QP PQE
Sbjct: 255  YEERQKLNKQMDGAHGNQTTNSEVKPRVSYQSAVEGSGQLPLGNLPSETVTGDRQPLPQE 314

Query: 1163 NLPSQAVGEEN-------RVSTXXXXXXXXXXXXXLAQNLRMNHYEXXXXXXXXXXXXXX 1005
             LP QAV EE+       +V               + +NLR                   
Sbjct: 315  KLPPQAVIEEDVGITGKQKVEETDPVDHIQGQGRGVKRNLRGGR---------------- 358

Query: 1004 XXXXXXXXNDGSRRPIEPPMPKGFVPLICELCNVKCESVITFQGHLVGKKHQSNAKRFQG 825
                     +GSR+P   P P+  VPLICELCNVKCESV+ FQ HL G+KH SN K FQG
Sbjct: 359  -GGKLMKTQNGSRKPAVAPKPQKMVPLICELCNVKCESVVVFQSHLAGRKHLSNVKDFQG 417

Query: 824  HQDIIGXXXXXXXXXXXXXXXXXXQVLCQQNPNASTSLAPQIIPQGFPGPEGNFAQPGHS 645
             Q ++G                  Q LCQ +  ASTS+APQ          G   Q   S
Sbjct: 418  QQAMVGQAALQALYPALQALYPALQALCQPSSGASTSVAPQGHQHNLHEILGILTQQALS 477

Query: 644  VWTQ------GQAST----------VGPVALTAMPPPLAMETQDPQTINLQGLASETGTQ 513
               Q      G A+T          +G    +A PPP  +  QD Q   LQG  SE   +
Sbjct: 478  AIPQDQLLGIGAAATSAIPQDQLLGIGAAVASAFPPPSDLLAQDHQGSKLQGSVSEETRE 537

Query: 512  NAATEEA------------SSNVQSDINRSSDSSVVVENVATGSELVLSGTRTD*VSSMS 369
            NAA E+             S   +S  N+  + ++ VE  A   E  L    +  VSS S
Sbjct: 538  NAAAEDGRNCDLSVLPSTESKPEESTDNKHENVNLEVERKAMSVEEPLRFGTSGDVSSTS 597


>emb|CBI16584.3| unnamed protein product [Vitis vinifera]
          Length = 679

 Score =  190 bits (482), Expect = 5e-45
 Identities = 151/411 (36%), Positives = 179/411 (43%), Gaps = 11/411 (2%)
 Frame = -1

Query: 1613 ISQALYGGTGILDGGPSRGAQRQFGPKPNVXXXXXXXXXXXXXXGKHASVATQVSLPVAD 1434
            I Q+ Y G G   G P RG  R F                     +  ++A         
Sbjct: 58   IGQSSYRGGGRRGGRPFRGGGRSFRGGGRGQFGPHGFGPDDPGQAQFPAMAQ-------- 109

Query: 1433 EGHSAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXXXXXXXKVYEELQRLNKDLTGGQN 1254
              H  P W  P MAWCELCRVDCNTLEILEQ            VY+ELQ LNK +TG QN
Sbjct: 110  --HGNPFWRSPCMAWCELCRVDCNTLEILEQHKNGKRHKKNLLVYQELQNLNKLITGVQN 167

Query: 1253 EQPLTFELKPEGSSQPVQSEGDGTKQPPQENLPSQAVG----EENRVSTXXXXXXXXXXX 1086
            EQ    + KP    Q +QSE  G  +  Q   PSQ  G    E+ + +            
Sbjct: 168  EQMPISDFKP----QLIQSERVGGSEDKQ---PSQGTGANGTEKEQQTEAEKSEVSAQPT 220

Query: 1085 XXLAQNLRMNHYEXXXXXXXXXXXXXXXXXXXXXXNDGSR-RPIEPPMPKGFVPLICELC 909
                +  RM+H++                        G R R  EPP PK  +PLICELC
Sbjct: 221  EEQERKARMDHFQ---------APGRGLKRKMRGGRGGKRMRQFEPPKPKEMIPLICELC 271

Query: 908  NVKCESVITFQGHLVGKKHQSNAKRFQGHQDIIGXXXXXXXXXXXXXXXXXXQVLCQQNP 729
            NVKCES + F  HL GKKH SN KRF G+Q II                   Q L   NP
Sbjct: 272  NVKCESQVVFDSHLAGKKHHSNLKRFHGYQAIIA---------------GALQALIPSNP 316

Query: 728  NA-STSLAPQIIPQGFPGPEGNFAQPGHSVWTQGQASTV--GPVALTAMPPPLAMETQDP 558
            NA S    PQ+  QG  G +G  AQP      QGQA  +  GP +     P  A+ETQD 
Sbjct: 317  NAPSNFFIPQVHQQGVSGSQGLPAQP-MPYMQQGQAPGMAPGPASEPEPAPVSALETQDK 375

Query: 557  Q---TINLQGLASETGTQNAATEEASSNVQSDINRSSDSSVVVENVATGSE 414
            +   T+  Q   SE G QN  T EA+S +Q DI  S  SS V  N    SE
Sbjct: 376  EGTKTVESQA-TSEAGGQNTVTAEANSQLQPDIIASEASSGVSTNTTIVSE 425


>ref|XP_011093297.1| PREDICTED: uncharacterized protein LOC105173303 [Sesamum indicum]
          Length = 526

 Score =  189 bits (479), Expect = 1e-44
 Identities = 139/426 (32%), Positives = 181/426 (42%), Gaps = 45/426 (10%)
 Frame = -1

Query: 1949 MDYPANYPQQAYDPSSXXXXXXXXXXXXXXXXXXXXXXXXXXXXY--------PQHQQQG 1794
            M+Y ++Y QQ YDPSS                            Y        P   +Q 
Sbjct: 1    MNYSSHYQQQTYDPSSLTQQIYDQPANDYYTYSYPNPQYTSVQPYSYPNLIVQPSQHEQQ 60

Query: 1793 EPTPPGVT-VPQPITQQAGEQAQSSYYYPHAVA---LNIDPQQXXXXXXXXXXXGAMPTG 1626
            E  PPGVT +P P   Q  +Q+ +  Y+ H  A   L + PQ                  
Sbjct: 61   ELHPPGVTALPPPPPHQ--DQSSNFQYHAHTAAAAVLPVGPQHGGVDSGIGV-------- 110

Query: 1625 LHQPISQALYGGTGILDG-GPSRGAQRQFGPKPNVXXXXXXXXXXXXXXGKHASVATQ-- 1455
            +HQPI Q+ Y G  I++G  P   AQ    P+PNV                H  VA +  
Sbjct: 111  VHQPIVQSSYEGFSIVNGVAPLGAAQTHLSPQPNVRGRPYRVRGRGRGRVMHKHVAARGQ 170

Query: 1454 ------------------------------VSLPVADEGHSAPVWPPPQMAWCELCRVDC 1365
                                          +S  +    H  P  PPP++AWCE CRVDC
Sbjct: 171  ERTISTQGGSHIQGGSLPPMAQPYASSFGPISTAIHSPAHIMPALPPPRLAWCEFCRVDC 230

Query: 1364 NTLEILEQXXXXXXXXXXXKVYEELQRLNKDLTGGQNEQPLTFELKPEGSSQPVQSEGDG 1185
            NTL+ILEQ           KV+EELQ LN  + G Q EQ  + +LK E    P+QS+   
Sbjct: 231  NTLDILEQHKNGKKHKKKLKVFEELQNLNSRVIGRQMEQISSSQLKLE---VPLQSDERS 287

Query: 1184 TKQPPQENLPSQAVGEENRVSTXXXXXXXXXXXXXLAQNLRMNHYEXXXXXXXXXXXXXX 1005
             KQ  QE+LPSQA+ E+N+V+                + + ++H E              
Sbjct: 288  EKQIQQESLPSQAINEDNKVAVGNRELEDAEPTEEPGKKV-IDHSE-GLAHGLKRKMRGG 345

Query: 1004 XXXXXXXXNDGSRRPIEPPMPKGFVPLICELCNVKCESVITFQGHLVGKKHQSNAKRFQG 825
                     D S+R +EPP PK  +PL+CELCNVKCES I FQ HL GKKH+S AKRF G
Sbjct: 346  KVGRRTRPCDRSKRTVEPPKPKEVIPLVCELCNVKCESPIVFQSHLAGKKHKSKAKRFLG 405

Query: 824  HQDIIG 807
             Q+  G
Sbjct: 406  QQETFG 411


>ref|XP_010661317.1| PREDICTED: uncharacterized protein LOC104881792 isoform X2 [Vitis
            vinifera]
          Length = 528

 Score =  189 bits (479), Expect = 1e-44
 Identities = 139/349 (39%), Positives = 163/349 (46%), Gaps = 11/349 (3%)
 Frame = -1

Query: 1427 HSAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXXXXXXXKVYEELQRLNKDLTGGQNEQ 1248
            H  P W  P MAWCELCRVDCNTLEILEQ            VY+ELQ LNK +TG QNEQ
Sbjct: 168  HGNPFWRSPCMAWCELCRVDCNTLEILEQHKNGKRHKKNLLVYQELQNLNKLITGVQNEQ 227

Query: 1247 PLTFELKPEGSSQPVQSEGDGTKQPPQENLPSQAVG----EENRVSTXXXXXXXXXXXXX 1080
                + KP    Q +QSE  G  +  Q   PSQ  G    E+ + +              
Sbjct: 228  MPISDFKP----QLIQSERVGGSEDKQ---PSQGTGANGTEKEQQTEAEKSEVSAQPTEE 280

Query: 1079 LAQNLRMNHYEXXXXXXXXXXXXXXXXXXXXXXNDGSR-RPIEPPMPKGFVPLICELCNV 903
              +  RM+H++                        G R R  EPP PK  +PLICELCNV
Sbjct: 281  QERKARMDHFQ---------APGRGLKRKMRGGRGGKRMRQFEPPKPKEMIPLICELCNV 331

Query: 902  KCESVITFQGHLVGKKHQSNAKRFQGHQDIIGXXXXXXXXXXXXXXXXXXQVLCQQNPNA 723
            KCES + F  HL GKKH SN KRF G+Q II                   Q L   NPNA
Sbjct: 332  KCESQVVFDSHLAGKKHHSNLKRFHGYQAIIA---------------GALQALIPSNPNA 376

Query: 722  -STSLAPQIIPQGFPGPEGNFAQPGHSVWTQGQASTV--GPVALTAMPPPLAMETQDPQ- 555
             S    PQ+  QG  G +G  AQP      QGQA  +  GP +     P  A+ETQD + 
Sbjct: 377  PSNFFIPQVHQQGVSGSQGLPAQP-MPYMQQGQAPGMAPGPASEPEPAPVSALETQDKEG 435

Query: 554  --TINLQGLASETGTQNAATEEASSNVQSDINRSSDSSVVVENVATGSE 414
              T+  Q   SE G QN  T EA+S +Q DI  S  SS V  N    SE
Sbjct: 436  TKTVESQA-TSEAGGQNTVTAEANSQLQPDIIASEASSGVSTNTTIVSE 483


>emb|CAN65350.1| hypothetical protein VITISV_000640 [Vitis vinifera]
          Length = 628

 Score =  189 bits (479), Expect = 1e-44
 Identities = 183/560 (32%), Positives = 215/560 (38%), Gaps = 95/560 (16%)
 Frame = -1

Query: 1808 HQQQGEPT---PPGVTVPQPITQQAGE-----QAQSSYYYPHAV---------------- 1701
            +Q Q +PT   PPGV +P   T   G      QA  + YYP  V                
Sbjct: 58   YQNQQQPTSIHPPGVPIPPEPTHTVGPDYAHLQAPHNAYYPQGVHEQQQQQQQQHMGYPD 117

Query: 1700 -----------ALNIDPQQXXXXXXXXXXXGAMPTGLHQPISQALYGGTGILDGGPSRGA 1554
                       A N+D  Q           G  P  +  PI Q+ Y G G   G P RG 
Sbjct: 118  SAQAGSNLFQMAGNMDSAQGSAEQWQVNNGGFGPGPVRPPIGQSSYRGGGRRGGRPFRGG 177

Query: 1553 QR--------QFGPKPNVXXXXXXXXXXXXXXGKH--ASVATQVSLPVA----------- 1437
             R        QFGP                    H  AS     S+P A           
Sbjct: 178  GRSFRGGGRGQFGPHGFGPDGSGRGQGGGRYFPPHNAASTPNLGSVPTAEGPGALIPGEA 237

Query: 1436 ------------------DEG---------HSAPVWPPPQMAWCELCRVDCNTLEILEQX 1338
                              D G         H  P W  P MAWCELCRVDCNTLEILEQ 
Sbjct: 238  SQLQGKTPQAFMQPLSGSDPGQAQFPAMAQHGNPFWRSPCMAWCELCRVDCNTLEILEQH 297

Query: 1337 XXXXXXXXXXKVYEELQRLNKDLTGGQNEQPLTFELKPEGSSQPVQSEGDGTKQPPQENL 1158
                       VY+ELQ LNK +TG QNEQ    + KP    Q +QSE  G  +  Q   
Sbjct: 298  KNGKRHKKNLLVYQELQNLNKLITGVQNEQMPISDFKP----QLIQSERVGGSEDXQ--- 350

Query: 1157 PSQAVG----EENRVSTXXXXXXXXXXXXXLAQNLRMNHYEXXXXXXXXXXXXXXXXXXX 990
            PSQ  G    E+ + +                +  RM+H++                   
Sbjct: 351  PSQGTGANGTEKEQQTEAEKSEVSAQPTEEQERKARMDHFQ---------APGRGLKRKM 401

Query: 989  XXXNDGSR-RPIEPPMPKGFVPLICELCNVKCESVITFQGHLVGKKHQSNAKRFQGHQDI 813
                 G R R  EPP PK  +PLICELCNVKCES + F  HL GKKH SN KRF G+Q I
Sbjct: 402  RGGRGGKRMRQFEPPKPKEMIPLICELCNVKCESQVVFDSHLAGKKHHSNLKRFHGYQAI 461

Query: 812  IGXXXXXXXXXXXXXXXXXXQVLCQQNPNA-STSLAPQIIPQGFPGPEGNFAQPGHSVWT 636
            I                   Q L   NPNA S    PQ+  QG  G +G  AQP      
Sbjct: 462  IA---------------GALQALIPSNPNAPSNFFIPQVHQQGVSGSQGLPAQP-MPXMQ 505

Query: 635  QGQASTVGPVALTAMPPPL---AMETQDPQ---TINLQGLASETGTQNAATEEASSNVQS 474
            QGQA  + P  L + P P    A+ETQD +   T+  Q   SE G QN  T EA+S +Q 
Sbjct: 506  QGQAPGMAP-GLASEPEPAPVSALETQDKEGTKTVESQA-TSEAGGQNTVTAEANSQLQP 563

Query: 473  DINRSSDSSVVVENVATGSE 414
                S   S V  N    SE
Sbjct: 564  XXIASEAXSGVSTNTTIVSE 583


>ref|XP_008232574.1| PREDICTED: uncharacterized protein LOC103331705 [Prunus mume]
          Length = 686

 Score =  184 bits (466), Expect = 3e-43
 Identities = 166/508 (32%), Positives = 199/508 (39%), Gaps = 63/508 (12%)
 Frame = -1

Query: 1811 QHQQQGEPT---PPGVTVPQ--PITQQAGEQAQSSYYYPHAVALNIDPQQXXXXXXXXXX 1647
            QHQ   EPT   PPGV +P   P +   G     + YY H V    D QQ          
Sbjct: 89   QHQFHQEPTSIHPPGVPIPPEPPHSADPGHTHLQNAYYAHGVVE--DQQQQQMNSGSGGL 146

Query: 1646 XGAMPTGLHQ-----------------PISQALYGGTGILDGGPSRGAQR----QFGPKP 1530
              A    L Q                 PI Q  Y G G     P RG  R      G +P
Sbjct: 147  MPAAVAALSQLTQLSANMDAAQRATQPPIGQTPYRGGGRRGNRPFRGGGRGHFGYHGSRP 206

Query: 1529 NVXXXXXXXXXXXXXXGKH-----------------------------ASVATQVSLPVA 1437
            +               G+H                             A V  Q  LPV 
Sbjct: 207  DGSAHPFRGRGRGQGGGRHFPQYGAASNNLNSASVPAEGVAALMQPPSALVPGQAPLPVP 266

Query: 1436 DEGHSAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXXXXXXXKVYEELQRLNKDLTGGQ 1257
             +  S   W PP+MAWCELCRVDCNT EILEQ           +VYEELQ+LNK  T  Q
Sbjct: 267  TQVSSTSFWRPPRMAWCELCRVDCNTPEILEQHKNGKRHKKYMQVYEELQKLNKVKTEQQ 326

Query: 1256 NEQPLTFELKPEGSSQPVQSEGDGTKQPPQENLPSQAVGEENRVSTXXXXXXXXXXXXXL 1077
            N Q    ELK E   QPV+ EG   KQP QENL S+ V + NR  T              
Sbjct: 327  NAQMPNTELKTE-VGQPVKVEGFEEKQPLQENLTSEVVTDNNRNETDQKDTGANSEASAG 385

Query: 1076 AQNLRMNHYEXXXXXXXXXXXXXXXXXXXXXXNDGSRRPIEPPMPKGFVPLICELCNVKC 897
              N   +H+                       N+GSRRP+EPP PK  +P ICELCN+KC
Sbjct: 386  PGNKSGDHF-AARGRGFKRRMRGGRGGKYMRTNEGSRRPVEPPKPKQVIPFICELCNIKC 444

Query: 896  ESVITFQGHLVGKKHQSNAKRFQGHQDIIGXXXXXXXXXXXXXXXXXXQVLCQQNPN-AS 720
            ES + F  HL GKKH +  KRF GH+ + G                  Q L   N N AS
Sbjct: 445  ESQVVFDSHLSGKKHLATLKRFHGHRALYG--------------EVGLQALYPSNFNAAS 490

Query: 719  TSLAP----QIIPQGFPGPEGNFAQPGHS---VWTQGQASTVGPVALTAMPPPLAMETQD 561
            TS  P      + QG   P+   AQ   +     TQ Q S+  P   +A   P+    Q 
Sbjct: 491  TSATPTSAAPTVQQGDNDPQALLAQLLMTYVLTQTQAQGSSPAPAPASA---PVGTHNQL 547

Query: 560  PQTINLQGLASETGTQNAATEEASSNVQ 477
                 LQ +  + G+QNA   E    +Q
Sbjct: 548  ELIQGLQAMCQD-GSQNAVILELKRQLQ 574


>ref|XP_010112019.1| hypothetical protein L484_001626 [Morus notabilis]
            gi|587946039|gb|EXC32400.1| hypothetical protein
            L484_001626 [Morus notabilis]
          Length = 636

 Score =  182 bits (462), Expect = 1e-42
 Identities = 162/513 (31%), Positives = 201/513 (39%), Gaps = 68/513 (13%)
 Frame = -1

Query: 1808 HQQQGEPT---PPGVTVPQPITQQAGEQ-AQSSYYYPHAVA-------------LNIDPQ 1680
            HQ   EPT   PPGV +P P  +Q   Q  Q+ YYYPH  A               ++P 
Sbjct: 69   HQFPHEPTSIHPPGVPIPPPQPEQTDPQNQQNGYYYPHGAAESLPRSGSNSGLNFGLNPS 128

Query: 1679 QXXXXXXXXXXXGAMPTG--------LHQPISQALYGGTGILDGGPSRGAQRQFG---PK 1533
                         A   G        LH PI      G G     P RG +  FG   P+
Sbjct: 129  AAAAAAVAAISQLAQFPGNVDAAQNSLHVPIGYTPSRG-GRRGNRPFRGGRGHFGRHGPR 187

Query: 1532 PNVXXXXXXXXXXXXXXGKH-----------------------------ASVATQVSLPV 1440
            P+               G+H                              SV  Q  LPV
Sbjct: 188  PDGSAPSSRGRGRGQGGGRHFASHGAVLTNPNSASVPAEGEAAFVQQPSVSVPGQAPLPV 247

Query: 1439 ADEGHSAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXXXXXXXKVYEELQRLNKDLTGG 1260
              +  +AP W PP MAWCELCRVDCNTLEILE+           +V+EELQ+LNK  TG 
Sbjct: 248  PAQVPAAPFWQPPHMAWCELCRVDCNTLEILEKHKNGKRHKKNLQVFEELQKLNKVTTGQ 307

Query: 1259 QNEQPLTFELKPEGSSQPVQSEGDGTKQPPQENLPSQAVGEENRVSTXXXXXXXXXXXXX 1080
            QN Q    ELK +      Q E     QP  E        + +  +              
Sbjct: 308  QNAQMPNTELKTDVG----QPEKVDANQPLSETTSQVNTNDYSNETDQQCAVGGTSEASA 363

Query: 1079 LAQNLRMNHYEXXXXXXXXXXXXXXXXXXXXXXNDGSRRPIEPPMPKGFVPLICELCNVK 900
              + ++ + +                        +GSRRP++PP PK  +PLICELCNVK
Sbjct: 364  EPEKIQQDQFPARAHGSKRKMRGGRGGKYLRGN-EGSRRPVKPPKPKQMIPLICELCNVK 422

Query: 899  CESVITFQGHLVGKKHQSNAKRFQGHQDIIGXXXXXXXXXXXXXXXXXXQVLCQQNPNA- 723
            CES + F  HL GKKHQSN KRF GH+ + G                  Q L   N NA 
Sbjct: 423  CESQVVFDSHLTGKKHQSNLKRFHGHRAMYG--------------EAGVQALYPPNFNAP 468

Query: 722  STSLAPQIIPQGFPGPEGNFAQPGHSVWTQGQASTVGPVALTAMP--------PPLAMET 567
            STSLAPQ + Q    P+   AQ     +   QA   G + +T  P        P  +  T
Sbjct: 469  STSLAPQ-VQQVVNDPQVLLAQL-LMTYVLSQAQAPGTLGVTPAPGTVGVTPAPGSSSGT 526

Query: 566  QDPQTINLQG--LASETGTQNAATEEASSNVQS 474
            Q+      QG  L  E G QNA T    S +QS
Sbjct: 527  QNQPISQTQGTELTLEGGNQNAPTAVTKSELQS 559


>ref|XP_007219291.1| hypothetical protein PRUPE_ppa018574mg [Prunus persica]
            gi|462415753|gb|EMJ20490.1| hypothetical protein
            PRUPE_ppa018574mg [Prunus persica]
          Length = 675

 Score =  176 bits (445), Expect = 9e-41
 Identities = 163/510 (31%), Positives = 199/510 (39%), Gaps = 65/510 (12%)
 Frame = -1

Query: 1811 QHQQQGEPT---PPGVTVPQ--PITQQAGEQAQSSYYYPHAVALNIDPQQXXXXXXXXXX 1647
            QHQ   EPT   PPGV +P   P     G+    + YY H V    D QQ          
Sbjct: 89   QHQFHQEPTSIHPPGVPIPPEPPHNADPGQTHLQNAYYAHGVVE--DQQQQQMNSVSGGL 146

Query: 1646 XGAMPTGLHQ-----------------PISQALYGGTGILDGGPSRGAQR----QFGPKP 1530
              A    L Q                 PI Q  Y G G     P RG  R      G +P
Sbjct: 147  IPAAVAALSQLTQLSANMDAAQRATQPPIGQTPYRGGGRRGNRPFRGGGRGHFGYHGSRP 206

Query: 1529 NVXXXXXXXXXXXXXXGKH-----------------------------ASVATQVSLPVA 1437
            +               G+H                             A V  Q  LPV 
Sbjct: 207  DGSAHPFRGRGRGQGGGRHFPQYGAASNNLNSASVPAEGVAALMQPPSALVPGQAPLPVP 266

Query: 1436 DEGHSAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXXXXXXXKVYEELQRLNKDLTGGQ 1257
             +  S   W PP+MAWCELCRVDCNT EILEQ           +VYEELQ+LNK  T  Q
Sbjct: 267  TQVSSTSFWRPPRMAWCELCRVDCNTPEILEQHKNGKRHKKYMQVYEELQKLNKVKTEQQ 326

Query: 1256 NEQPLTFELKPEGSSQPVQSEGDGTKQPPQENLPSQAVGEENRVSTXXXXXXXXXXXXXL 1077
            N Q    ELKPE   QPV+ EG   KQP QENL S+ + + NR  T              
Sbjct: 327  NAQMPNTELKPE-VGQPVKVEGFEEKQPLQENLTSEVITDNNRNETDQKDTGANSEASAG 385

Query: 1076 AQNLRMNHYEXXXXXXXXXXXXXXXXXXXXXXNDGSRRPIEPPMPKGFVPLICELCNVKC 897
              N   +H+                            R +EPP PK  +P ICELCN+KC
Sbjct: 386  PGNKSGDHFAARGRGFKRRMR--------------GGRGVEPPKPKQVIPFICELCNIKC 431

Query: 896  ESVITFQGHLVGKKHQSNAKRFQGHQDIIGXXXXXXXXXXXXXXXXXXQVLCQQNPN-AS 720
            ES + F  HL GKKH +  KRF GH+ + G                  Q L   N N AS
Sbjct: 432  ESQVVFDSHLSGKKHLATLKRFHGHRALYG--------------EVGLQALYPSNFNAAS 477

Query: 719  TSLAP----QIIPQGFPGPEGNFAQPGHS---VWTQGQASTVGPVALTAMPPPLAMETQD 561
            TS  P      + QG   P+   AQ   +     TQ Q S+  P   +A+ P   + T +
Sbjct: 478  TSATPTSAAPTVQQGDNDPQALLAQLLMTYVLTQTQAQGSSPAPAPASAVAP---VGTHN 534

Query: 560  PQTINLQGLAS--ETGTQNAATEEASSNVQ 477
             Q   +QGL +  + G+QNA   E    +Q
Sbjct: 535  -QLELIQGLQTMCQDGSQNAVILELKRQLQ 563


>ref|XP_006347909.1| PREDICTED: chromatin modification-related protein EAF1-like isoform
            X3 [Solanum tuberosum]
          Length = 493

 Score =  175 bits (444), Expect = 1e-40
 Identities = 141/413 (34%), Positives = 178/413 (43%), Gaps = 49/413 (11%)
 Frame = -1

Query: 1460 TQVSLPVADEGHSAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXXXXXXXKVYEELQRL 1281
            +QV    +  G+SAPV  PP+MAWCELCR++CNT E+LEQ           K YEE Q+L
Sbjct: 68   SQVKPSSSASGNSAPV-RPPRMAWCELCRIECNTPEVLEQHKNGKKHKKNLKAYEERQKL 126

Query: 1280 NKDLTGGQNEQPLTFELKPEGSSQP-VQSEGD------------GTKQP-PQENLPSQAV 1143
            NK + G    Q    E+KP  S Q  V+  G             G +QP PQE LP QAV
Sbjct: 127  NKQMDGAHGNQTTNSEVKPRVSYQSAVEGSGQLPLGNLPSETVTGDRQPLPQEKLPPQAV 186

Query: 1142 GEEN-------RVSTXXXXXXXXXXXXXLAQNLRMNHYEXXXXXXXXXXXXXXXXXXXXX 984
             EE+       +V               + +NLR                          
Sbjct: 187  IEEDVGITGKQKVEETDPVDHIQGQGRGVKRNLRGGR-----------------GGKLMK 229

Query: 983  XNDGSRRPIEPPMPKGFVPLICELCNVKCESVITFQGHLVGKKHQSNAKRFQGHQDIIGX 804
              +GSR+P   P P+  VPLICELCNVKCESV+ FQ HL G+KH SN K FQG Q ++G 
Sbjct: 230  TQNGSRKPAVAPKPQKMVPLICELCNVKCESVVVFQSHLAGRKHLSNVKDFQGQQAMVGQ 289

Query: 803  XXXXXXXXXXXXXXXXXQVLCQQNPNASTSLAPQIIPQGFPGPEGNFAQPGHSVWTQ--- 633
                             Q LCQ +  ASTS+APQ          G   Q   S   Q   
Sbjct: 290  AALQALYPALQALYPALQALCQPSSGASTSVAPQGHQHNLHEILGILTQQALSAIPQDQL 349

Query: 632  ---GQAST----------VGPVALTAMPPPLAMETQDPQTINLQGLASETGTQNAATEEA 492
               G A+T          +G    +A PPP  +  QD Q   LQG  SE   +NAA E+ 
Sbjct: 350  LGIGAAATSAIPQDQLLGIGAAVASAFPPPSDLLAQDHQGSKLQGSVSEETRENAAAEDG 409

Query: 491  ------------SSNVQSDINRSSDSSVVVENVATGSELVLSGTRTD*VSSMS 369
                        S   +S  N+  + ++ VE  A   E  L    +  VSS S
Sbjct: 410  RNCDLSVLPSTESKPEESTDNKHENVNLEVERKAMSVEEPLRFGTSGDVSSTS 462


>ref|XP_009611448.1| PREDICTED: glutenin, high molecular weight subunit 12-like [Nicotiana
            tomentosiformis]
          Length = 570

 Score =  175 bits (443), Expect = 2e-40
 Identities = 153/468 (32%), Positives = 192/468 (41%), Gaps = 29/468 (6%)
 Frame = -1

Query: 1811 QHQQQGEPT---PPGVTVPQPITQ------QAGEQAQSSY-YYPHAVALNIDPQQXXXXX 1662
            Q + + EPT   PPGV +P P +       Q  +Q Q  Y YYP         QQ     
Sbjct: 97   QPEPEPEPTSIHPPGVPIPPPSSSDPYASTQQQQQPQPQYSYYPQ--------QQQQGIN 148

Query: 1661 XXXXXXGAMPTGLHQP-ISQALYGGTGILDGGPSRGAQRQFGPKPNVXXXXXXXXXXXXX 1485
                   +MP   + P I Q+ Y G G   G   RG     G   ++             
Sbjct: 149  YGEVATVSMPIVSYTPAIVQSPYRGRGKRGGRSYRG-----GAHGHLGGGQLQPSYDQPN 203

Query: 1484 XGKHASVA----TQVSLPVADEGHSAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXXXX 1317
              + AS      +QV    +  G+S  V PPP+MAWCELCRV+CNT E+LEQ        
Sbjct: 204  SFQGASQIQGDPSQVKPSSSASGNSGSVQPPPRMAWCELCRVECNTTEVLEQHKNG---- 259

Query: 1316 XXXKVYEELQRLNKDLTGGQNEQPLTFELKPEGSSQP-VQSEGD------------GTKQ 1176
                   +  + N    GGQ  Q   F+ KP    QP V   G             G +Q
Sbjct: 260  -------KKHKKNLKANGGQGHQTANFDFKPNVYYQPEVGGAGQLPQGHLPSEVVTGNRQ 312

Query: 1175 P-PQENLPSQAVGEENRVSTXXXXXXXXXXXXXLAQNLRMNHYEXXXXXXXXXXXXXXXX 999
              PQEN PS A  E+  + T             LA++L M+  +                
Sbjct: 313  SQPQENFPSHAAIEDGSI-TGKQKVEEAALVEELARDLGMDRVQ-GQGRGLKRMLKGGRD 370

Query: 998  XXXXXXNDGSRRPIEPPMPKGFVPLICELCNVKCESVITFQGHLVGKKHQSNAKRFQGHQ 819
                  ++GSR+P+ PP PK  VPLICELCNV CESV+ FQ HL GKKH SN K FQG Q
Sbjct: 371  GKLMKLHNGSRKPVVPPKPKKMVPLICELCNVTCESVVVFQSHLAGKKHLSNVKDFQGQQ 430

Query: 818  DIIGXXXXXXXXXXXXXXXXXXQVLCQQNPNASTSLAPQIIPQGFPGPEGNFAQPGHSVW 639
             ++G                  Q L Q N  ASTS APQ          G   Q   S  
Sbjct: 431  AMVGQAALQTLYPALQALYPALQALYQPNTGASTSTAPQGHQPNLLEILGMLTQQALSAI 490

Query: 638  TQGQASTVGPVALTAMPPPLAMETQDPQTINLQGLASETGTQNAATEE 495
             Q Q   +G      +PP   +E QD Q   LQG  SE   +N A E+
Sbjct: 491  PQDQVLGIG------LPPSSDLEAQDHQGSILQGSVSEETRENVAAED 532


>ref|XP_006490851.1| PREDICTED: uncharacterized protein LOC102607609 [Citrus sinensis]
          Length = 585

 Score =  162 bits (410), Expect = 1e-36
 Identities = 155/543 (28%), Positives = 210/543 (38%), Gaps = 31/543 (5%)
 Frame = -1

Query: 1808 HQQQGEPTPPGVTVPQPITQQAGEQAQSSYYYPHAVALNIDPQQXXXXXXXXXXXGAMPT 1629
            HQ+     PPGV +P   T         + ++P  + + I+  Q            A   
Sbjct: 59   HQESTSTHPPGVPIPPDPTHFPNHH---NAHFPLGIGIGIE--QPSNLTLFSGTAAAAQR 113

Query: 1628 GLHQPISQALYGGTGILDGGPSRGAQRQFGPKPNVXXXXXXXXXXXXXXGKHASVATQVS 1449
             +     Q+   G G   G P R   R  G                      A+V  Q S
Sbjct: 114  SVRPQFGQSACRGGGRKGGKPFRRGGRLVG--------RGRGHGAISNSTPSAAVPGQTS 165

Query: 1448 LPVADEGHSAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXXXXXXXKVYEELQRLNKDL 1269
              +  +   AP  PPP MAWCELCRVDCNTLEILEQ           + + +LQ LNK +
Sbjct: 166  SSIPGQVPGAPTRPPPPMAWCELCRVDCNTLEILEQHKNGKRHKRNLRTHADLQNLNKCI 225

Query: 1268 TGGQNEQPLTFELKPEGSSQPVQSEGDGTKQPPQENLPSQAVGEENRVSTXXXXXXXXXX 1089
             G QN Q      +PE  SQP + E    KQP  E+LPSQ +      S           
Sbjct: 226  AGQQNIQMPNSGSQPE-VSQPEKVEECREKQP-LESLPSQTL--LGNASNETEMQKNTVD 281

Query: 1088 XXXLAQNLRMNHYEXXXXXXXXXXXXXXXXXXXXXXNDGSRRPIEPPMPKGFVPLICELC 909
                 Q    +  +                        G RRPIEPP PKG +PLICELC
Sbjct: 282  SVKEPQRKSRDQPDSRGCGSKRKMRGGRGGKYMRTNEGGPRRPIEPPKPKGVIPLICELC 341

Query: 908  NVKCESVITFQGHLVGKKHQSNAKRFQGHQDIIGXXXXXXXXXXXXXXXXXXQVLCQQNP 729
            NVKCES + F  HLVGKKH +N KRF GH+ + G                    L    P
Sbjct: 342  NVKCESQVVFDSHLVGKKHLANVKRFHGHRALYG-----------------EAALQSLYP 384

Query: 728  NASTSLAPQII---PQGFPGPEGNFAQPGHSVWTQGQASTVGPVALT------------- 597
             +  SL+  +I    QG   P+   AQ    V +Q QA    P  L              
Sbjct: 385  ASFNSLSSSVITQVQQGVNDPQVVLAQLLTYVLSQAQAQAQAPGLLAEQLRGLAAQIPGL 444

Query: 596  ------AMPPPLAMETQDPQTINLQG--LASETGTQNAATEEASSNVQS-DINRSSDSSV 444
                  A  P  + ETQ       Q     +E G++N    EA    QS   +  S  +V
Sbjct: 445  VGMVAPAPAPGSSQETQYQHDFRTQRSMATTEEGSKNTVMVEAEDQQQSIATDLESPETV 504

Query: 443  VVE----NVATGSELVLSGTRTD*VSSMSE*ECAVRSGCYMSDSRTGQ--VQSKDDFEIL 282
             +E    N +   +  +  +  + +++ S  +C V SG      + G   V S+++ ++ 
Sbjct: 505  GIETKEKNASLPQDKKIISSLENPINTASASKCEVASGGEAVQQQHGDDLVDSENEQDLE 564

Query: 281  LSN 273
            L N
Sbjct: 565  LHN 567


>gb|KDO85697.1| hypothetical protein CISIN_1g007901mg [Citrus sinensis]
          Length = 585

 Score =  160 bits (406), Expect = 3e-36
 Identities = 157/545 (28%), Positives = 210/545 (38%), Gaps = 33/545 (6%)
 Frame = -1

Query: 1808 HQQQGEPTPPGVTVPQPITQQAGEQAQSSYYYPHAVALNIDPQQXXXXXXXXXXXGAMPT 1629
            HQ+     PPGV +P   T         + ++P  + + I+  Q            A   
Sbjct: 59   HQESTSTHPPGVPIPPDPTHFPNHH---NAHFPLGIGIGIE--QPSNLTLFSGTAAAAQR 113

Query: 1628 GLHQPISQALYGGTGILDGGPSRGAQRQFGPKPNVXXXXXXXXXXXXXXGKHASVATQVS 1449
             +     Q+   G G   G P R   R  G                      A+V  Q S
Sbjct: 114  SVRPQFGQSACRGGGRKGGKPFRRGGRLVG--------RGRGHGAISNSTPSAAVPGQTS 165

Query: 1448 LPVADEGHSAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXXXXXXXKVYEELQRLNKDL 1269
              +  +   AP  PPP MAWCELCRVDCNTLEILEQ           + + +LQ LNK +
Sbjct: 166  SSIPGQVPGAPTRPPPPMAWCELCRVDCNTLEILEQHKNGKRHKRNLRTHADLQNLNKCI 225

Query: 1268 TGGQNEQPLTFELKPEGSSQPVQSEGDGTKQPPQENLPSQAVGEENRVSTXXXXXXXXXX 1089
             G QN Q      +PE  SQP + E    KQP  E+LPSQ +      S           
Sbjct: 226  AGQQNIQMPNSGSQPE-VSQPEKVEECREKQP-LESLPSQTL--LGNASNETEMQKNTVD 281

Query: 1088 XXXLAQNLRMNHYEXXXXXXXXXXXXXXXXXXXXXXNDGSRRPIEPPMPKGFVPLICELC 909
                 Q    +  +                        G RRPIEPP PKG +PLICELC
Sbjct: 282  SVKEPQRKSRDQPDSRGCGSKRKMRGGRGGKYMRTNEGGPRRPIEPPKPKGVIPLICELC 341

Query: 908  NVKCESVITFQGHLVGKKHQSNAKRFQGHQDIIGXXXXXXXXXXXXXXXXXXQVLCQQNP 729
            NVKCES + F  HLVGKKH +N KRF GH+ + G                    L    P
Sbjct: 342  NVKCESQVVFDSHLVGKKHLANVKRFHGHRALYG-----------------EAALQSLYP 384

Query: 728  NASTSLAPQII---PQGFPGPEGNFAQPGHSVWTQGQASTVGPVALT------------- 597
             +  SL+  +I    QG   P+   AQ    V +Q QA    P  L              
Sbjct: 385  ASFNSLSSSVITQVQQGVNDPQVVLAQLLTYVLSQAQAQAQAPGLLAEQLRGLAAQIPGL 444

Query: 596  ------AMPPPLAMETQDPQTINLQG--LASETGTQNAATEEASSNVQS-DINRSSDSSV 444
                  A  P  + ETQ       Q     +E G++N    EA    QS   +  S  +V
Sbjct: 445  VGMVAPAPAPGSSQETQYQHDFRTQRSMATTEEGSKNTVMVEAEDQQQSIATDLESPETV 504

Query: 443  VVE------NVATGSELVLSGTRTD*VSSMSE*ECAVRSGCYMSDSRTGQ--VQSKDDFE 288
             +E      ++    +++ S    D  +S S  +C V SG      + G   V S+++ +
Sbjct: 505  GIETKEKNASLPQDKKIISSLENPDNTASAS--KCEVASGGEAVQQQHGDDLVDSENEQD 562

Query: 287  ILLSN 273
            + L N
Sbjct: 563  LELHN 567


>ref|XP_006445326.1| hypothetical protein CICLE_v10023836mg, partial [Citrus clementina]
            gi|557547588|gb|ESR58566.1| hypothetical protein
            CICLE_v10023836mg, partial [Citrus clementina]
          Length = 611

 Score =  155 bits (393), Expect = 1e-34
 Identities = 130/406 (32%), Positives = 169/406 (41%), Gaps = 29/406 (7%)
 Frame = -1

Query: 1472 ASVATQVSLPVADEGHSAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXXXXXXXKVYEE 1293
            A+V  Q S  +  +   AP  PPP MAWCELCRVDCNTLEILEQ           + + +
Sbjct: 40   AAVPGQTSSSIPGQVPGAPTRPPPPMAWCELCRVDCNTLEILEQHKNGKRHKRNLRTHAD 99

Query: 1292 LQRLNKDLTGGQNEQPLTFELKPEGSSQPVQSEGDGTKQPPQENLPSQAVGEENRVSTXX 1113
            LQ LNK + G QN Q      +PE  SQP + E    KQP  E+LPSQ +      S   
Sbjct: 100  LQNLNKCIAGQQNIQMPNSGSQPE-VSQPEKVEECREKQP-LESLPSQTL--LGNASNET 155

Query: 1112 XXXXXXXXXXXLAQNLRMNHYEXXXXXXXXXXXXXXXXXXXXXXNDGSRRPIEPPMPKGF 933
                         Q    +  +                        G RRPIEPP PKG 
Sbjct: 156  EMQKNTVDSVKEPQRKSRDQPDSRGCGSKRKMRGGRGGKYMRTNEGGPRRPIEPPKPKGV 215

Query: 932  VPLICELCNVKCESVITFQGHLVGKKHQSNAKRFQGHQDIIGXXXXXXXXXXXXXXXXXX 753
            +PLICELCNVKCES + F  HLVGKKH +N KRF GH+ + G                  
Sbjct: 216  IPLICELCNVKCESQVVFDSHLVGKKHLANVKRFHGHRALYG-----------------E 258

Query: 752  QVLCQQNPNASTSLAPQII---PQGFPGPEGNFAQPGHSVWTQGQASTVGPVALT----- 597
              L    P +  SL+  +I    QG   P+   AQ    V +Q QA    P  L      
Sbjct: 259  AALQSLYPASFNSLSSSVITQVQQGVNDPQVVLAQLLTYVLSQAQAQAQAPGLLAEQLRG 318

Query: 596  --------------AMPPPLAMETQDPQTINLQG--LASETGTQNAATEEASSNVQS-DI 468
                          A  P  + ETQ       Q     +E G++N    EA    QS   
Sbjct: 319  LAAQIPGLVGMVAPAPAPGSSQETQYQHDFRTQRSMATTEEGSKNTVMVEAEDQQQSIAT 378

Query: 467  NRSSDSSVVVE----NVATGSELVLSGTRTD*VSSMSE*ECAVRSG 342
            +  S  +V +E    N +   +  +  +  + +++ S  +C V SG
Sbjct: 379  DLESPETVGIETKEKNASLPQDKKIISSLENPINTASASKCEVASG 424


>gb|KDO85698.1| hypothetical protein CISIN_1g007901mg [Citrus sinensis]
          Length = 470

 Score =  155 bits (392), Expect = 1e-34
 Identities = 137/433 (31%), Positives = 180/433 (41%), Gaps = 33/433 (7%)
 Frame = -1

Query: 1472 ASVATQVSLPVADEGHSAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXXXXXXXKVYEE 1293
            A+V  Q S  +  +   AP  PPP MAWCELCRVDCNTLEILEQ           + + +
Sbjct: 43   AAVPGQTSSSIPGQVPGAPTRPPPPMAWCELCRVDCNTLEILEQHKNGKRHKRNLRTHAD 102

Query: 1292 LQRLNKDLTGGQNEQPLTFELKPEGSSQPVQSEGDGTKQPPQENLPSQAVGEENRVSTXX 1113
            LQ LNK + G QN Q      +PE  SQP + E    KQP  E+LPSQ +      S   
Sbjct: 103  LQNLNKCIAGQQNIQMPNSGSQPE-VSQPEKVEECREKQP-LESLPSQTL--LGNASNET 158

Query: 1112 XXXXXXXXXXXLAQNLRMNHYEXXXXXXXXXXXXXXXXXXXXXXNDGSRRPIEPPMPKGF 933
                         Q    +  +                        G RRPIEPP PKG 
Sbjct: 159  EMQKNTVDSVKEPQRKSRDQPDSRGCGSKRKMRGGRGGKYMRTNEGGPRRPIEPPKPKGV 218

Query: 932  VPLICELCNVKCESVITFQGHLVGKKHQSNAKRFQGHQDIIGXXXXXXXXXXXXXXXXXX 753
            +PLICELCNVKCES + F  HLVGKKH +N KRF GH+ + G                  
Sbjct: 219  IPLICELCNVKCESQVVFDSHLVGKKHLANVKRFHGHRALYG-----------------E 261

Query: 752  QVLCQQNPNASTSLAPQII---PQGFPGPEGNFAQPGHSVWTQGQASTVGPVALT----- 597
              L    P +  SL+  +I    QG   P+   AQ    V +Q QA    P  L      
Sbjct: 262  AALQSLYPASFNSLSSSVITQVQQGVNDPQVVLAQLLTYVLSQAQAQAQAPGLLAEQLRG 321

Query: 596  --------------AMPPPLAMETQDPQTINLQG--LASETGTQNAATEEASSNVQS-DI 468
                          A  P  + ETQ       Q     +E G++N    EA    QS   
Sbjct: 322  LAAQIPGLVGMVAPAPAPGSSQETQYQHDFRTQRSMATTEEGSKNTVMVEAEDQQQSIAT 381

Query: 467  NRSSDSSVVVE------NVATGSELVLSGTRTD*VSSMSE*ECAVRSGCYMSDSRTGQ-- 312
            +  S  +V +E      ++    +++ S    D  +S S  +C V SG      + G   
Sbjct: 382  DLESPETVGIETKEKNASLPQDKKIISSLENPDNTASAS--KCEVASGGEAVQQQHGDDL 439

Query: 311  VQSKDDFEILLSN 273
            V S+++ ++ L N
Sbjct: 440  VDSENEQDLELHN 452


>ref|XP_004308473.2| PREDICTED: uncharacterized protein LOC101306183 [Fragaria vesca
            subsp. vesca]
          Length = 725

 Score =  152 bits (384), Expect = 1e-33
 Identities = 139/450 (30%), Positives = 176/450 (39%), Gaps = 46/450 (10%)
 Frame = -1

Query: 1793 EPT---PPGVTVPQPITQQAGEQAQSSYYYPHAVALNI-DPQQXXXXXXXXXXXGAMPTG 1626
            EPT   PPGV +P P ++    Q   + YY H V  N   P              A+   
Sbjct: 113  EPTSIHPPGVPIPAPDSETTHLQ---NAYYGHGVVENQHQPIDSGSGSVQAPANVAVAQV 169

Query: 1625 LHQP-ISQALYGGTGILDGGPSRGAQR----QFGPKPNVXXXXXXXXXXXXXXGKH---- 1473
              QP I Q  Y G G     P RG  R      G  P+               G+H    
Sbjct: 170  ASQPQIGQTPYRGGGRKGNRPFRGGGRGHLGYHGHGPDGSAPPIHGRGRGQGCGRHFQLY 229

Query: 1472 -------------------------ASVATQVSLPVADEGHSAPVWPPPQMAWCELCRVD 1368
                                     A V     LPV  +  SA  W  P+MAWCELCRVD
Sbjct: 230  GAASSNPNPASVPAEGVAALKQPPSALVPLHAPLPVTAQVSSASSWRLPRMAWCELCRVD 289

Query: 1367 CNTLEILEQXXXXXXXXXXXKVYEELQRLNKDLTGGQNEQPLTFELKPEGSSQPVQSEGD 1188
            CNTLE LEQ           +V+EELQ+ NK  T  +N Q    +LKPE   Q  + EG 
Sbjct: 290  CNTLETLEQHKNGKRHKKILQVHEELQKRNKVNTEQKNAQMPNIDLKPE-VGQTEKVEGS 348

Query: 1187 GTKQPPQENLPSQAVGEENRVSTXXXXXXXXXXXXXLAQNLRMNHYEXXXXXXXXXXXXX 1008
              K+P +  L S+ + ++NR  T               +N   +H+              
Sbjct: 349  EEKRPSEGTLTSEVITDDNRNETDRRGMVGNSEASEEPENKSRDHF-AARGRGFKRRMRG 407

Query: 1007 XXXXXXXXXNDGSRRPIEPPMPKGFVPLICELCNVKCESVITFQGHLVGKKHQSNAKRFQ 828
                      +GSRR +EPP PK   PLICELCNVKCES + F  HL GKKH +  KRFQ
Sbjct: 408  GRGGKYMRTYEGSRRLVEPPKPK-VNPLICELCNVKCESQVVFDSHLSGKKHLATLKRFQ 466

Query: 827  GHQDIIGXXXXXXXXXXXXXXXXXXQVLCQQNPN---ASTSLAP---QIIP--QGFPGPE 672
            GH+ + G                   +     PN   ASTS+ P   Q++P  +  P P 
Sbjct: 467  GHRALYG----------------EQGLQALYPPNLTAASTSVTPPVDQVLPPVEQVPPPH 510

Query: 671  GNFAQPGHSVWTQGQASTVGPVALTAMPPP 582
                 P H       A  V P  +  + PP
Sbjct: 511  VQQVTPLHVQQGPPHAQQVPPPHVQQVAPP 540


>ref|XP_007052171.1| Uncharacterized protein TCM_005599 [Theobroma cacao]
            gi|508704432|gb|EOX96328.1| Uncharacterized protein
            TCM_005599 [Theobroma cacao]
          Length = 731

 Score =  149 bits (375), Expect = 1e-32
 Identities = 119/341 (34%), Positives = 153/341 (44%), Gaps = 16/341 (4%)
 Frame = -1

Query: 1424 SAPVWPPPQMAWCELCRVDCNTLEILEQXXXXXXXXXXXKVYEELQRLNKDLTGGQNEQP 1245
            +AP+WPPP+MAWCELCRVDCN  EILEQ           +V+EELQ+LNK +TG Q+ Q 
Sbjct: 291  AAPLWPPPRMAWCELCRVDCNRPEILEQHKNGKRHKKNLQVHEELQKLNKVITGQQSVQV 350

Query: 1244 LTFELKPEGSSQPVQSEG-DGTK-QPPQENLPSQAVGEENRVSTXXXXXXXXXXXXXLAQ 1071
                  P   S+ VQ E  +G++ Q  QE  PS AV  +++  T                
Sbjct: 351  ------PNSGSEAVQLEKVEGSEGQHQQETSPSLAVTNDSKKETEQQKDIVNNSEASTTD 404

Query: 1070 NLR----MNHYEXXXXXXXXXXXXXXXXXXXXXXNDGSRRPIEPPMPKGFVPLICELCNV 903
            + +    +                          N+  RRP+EPP PKG +P +CELCNV
Sbjct: 405  SAKAKRKLGDASEARGRGFKRKMRGGRGGKYMKGNERPRRPVEPPKPKGGIPFMCELCNV 464

Query: 902  KCESVITFQGHLVGKKHQSNAKRFQGHQDIIGXXXXXXXXXXXXXXXXXXQVLCQQNPNA 723
            KCES + F  HL GKKH +N KRF GH+ + G                  Q L   N NA
Sbjct: 465  KCESHVVFNSHLAGKKHIANLKRFHGHRALYG--------------EAGLQALYPPNFNA 510

Query: 722  -STSLAPQIIPQGFPGPEGNFAQPGHSVWTQGQASTVGP------VALTAMP--PPLAME 570
             S S  PQ I QG   P+   AQ    V +Q Q   +         A +A P  P  + E
Sbjct: 511  PSPSFIPQ-IQQGVTDPQVVLAQLLTYVLSQAQVPGLAAPQLPLLAATSAAPCAPLSSSE 569

Query: 569  TQDPQTINLQGLA-SETGTQNAATEEASSNVQSDINRSSDS 450
               P       LA SE     A   EA +  QS + +S  S
Sbjct: 570  NHYPHKFTEGSLATSEVRGGEAVKVEAETWQQSSVEKSEAS 610


Top