BLASTX nr result

ID: Sinomenium22_contig00000145 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00000145
         (1612 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI16022.3| unnamed protein product [Vitis vinifera]              129   4e-27
emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera]   127   2e-26
ref|XP_007016238.1| Uncharacterized protein isoform 8 [Theobroma...   108   9e-21
ref|XP_007016237.1| Uncharacterized protein isoform 7 [Theobroma...   108   9e-21
ref|XP_007016236.1| Uncharacterized protein isoform 6 [Theobroma...   108   9e-21
ref|XP_007016235.1| Uncharacterized protein isoform 5 [Theobroma...   108   9e-21
ref|XP_007016232.1| Uncharacterized protein isoform 2 [Theobroma...   108   9e-21
ref|XP_007016231.1| Uncharacterized protein isoform 1 [Theobroma...   108   9e-21
ref|XP_002520450.1| hypothetical protein RCOM_0731250 [Ricinus c...    99   7e-18
ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citr...    90   3e-15
ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II tra...    88   1e-14
ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Popu...    88   1e-14
ref|XP_004295721.1| PREDICTED: uncharacterized protein LOC101314...    76   4e-11
ref|XP_007204950.1| hypothetical protein PRUPE_ppa000292mg [Prun...    74   1e-10
ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Popu...    72   5e-10
ref|XP_004153176.1| PREDICTED: uncharacterized protein LOC101214...    68   1e-08
ref|XP_004145323.1| PREDICTED: uncharacterized protein LOC101205...    68   1e-08
ref|XP_007131393.1| hypothetical protein PHAVU_011G009900g [Phas...    66   5e-08
ref|XP_007131392.1| hypothetical protein PHAVU_011G009900g [Phas...    66   5e-08
emb|CDJ43054.1| hypothetical protein, conserved [Eimeria tenella]      62   1e-06

>emb|CBI16022.3| unnamed protein product [Vitis vinifera]
          Length = 1669

 Score =  129 bits (324), Expect = 4e-27
 Identities = 137/451 (30%), Positives = 186/451 (41%), Gaps = 22/451 (4%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGA-PQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVH 390
            VTGH S+PQ  P QQMP G   QQP+                                + 
Sbjct: 509  VTGHHSFPQPRPQQQMPLGGMQQQPMHMHPQAQFPQQSPQMRPSQAHAQSQQQSALLPLP 568

Query: 391  GQHPN-MPPGQQPLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQTQFFPAQASGL 567
            GQ  N +PP Q P+H                HP +Q+  A Q    +   QF      G 
Sbjct: 569  GQAQNVLPPQQLPVHPHQQAG----------HPVHQR-AAMQPIQQSLPHQFVQQPPLGT 617

Query: 568  VNSQPHQSGPFLQ-QQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHGLPSHQSQNL 744
              +Q HQ G F+Q   P MQ  LRPQ  P S                  HG+     QN+
Sbjct: 618  GQNQLHQQGSFMQPPTPTMQSQLRPQAPPQSWQQHSHAYPQPQQKVAMLHGMQPQLPQNV 677

Query: 745  PGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHPQSSSEQQPIYAHQ 924
             GRP M N G+Q   F Q+ +G          SGAV   P     +  S+++    +  Q
Sbjct: 678  -GRPGMPNQGVQPQPFPQSQAG---------LSGAVQLRPMHLGPNQPSANQTLGQHLEQ 727

Query: 925  SGIPQSG------TESAPFQSATKIQVSSVLAAVKTELKADALSQKPEIKEENGFLATSS 1086
            S  PQ G      T   P    +K  V           + ++ S+K   ++ NG  ATS 
Sbjct: 728  SAHPQPGLNVKQTTFEKPDDDLSKKGVGG--------QEGESFSEKTAREDANGVAATSG 779

Query: 1087 QGADSVELKIPSSESKLKSVGVDEKAGNAYESSEPISDV----KEISKSSQLLEKDNSLL 1254
              +++VE+K   SE+ +KS  +DEK     E  + IS +    KEI +S + L  D    
Sbjct: 780  IESNTVEIK---SETDMKS--MDEKQKTTGEDEDTISRINNSAKEIPESMRALGSDPMQQ 834

Query: 1255 AKKDLEEPKIKQMVKEEA-SGILEPLAG----GIAAETETKDGEHVPFRSRPTENSQQED 1419
            A +D  EP IKQMVKEE     +E   G    GI  E +  +    P +    E+S  +D
Sbjct: 835  ASED-GEPVIKQMVKEEVIKSTVERSPGGKSIGIVVEDQKDELSVPPKQVEQVEHSLLQD 893

Query: 1420 KEIQEETLHKNVSLQKTEALETM----QKDA 1500
            KEIQ   L KN  +Q+ E L+ M    QKD+
Sbjct: 894  KEIQNGLLMKNPPIQQVEILDEMGGKLQKDS 924


>emb|CAN64434.1| hypothetical protein VITISV_000937 [Vitis vinifera]
          Length = 1131

 Score =  127 bits (318), Expect = 2e-26
 Identities = 138/456 (30%), Positives = 187/456 (41%), Gaps = 27/456 (5%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGA-PQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVH 390
            VTGH S+PQ  P QQMP G   QQP+                                + 
Sbjct: 80   VTGHHSFPQPRPQQQMPLGGMQQQPMHMHPQAQFPQQSPQMRPSQAHAQSQQQSALLPLP 139

Query: 391  GQHPN-MPPGQQPLHTXXXXXXXXXXXXXXLHPPYQ----QPVAPQV-HGHAQQTQFFPA 552
            GQ  N +PP Q P+H                HP +Q    QP+   + H   QQ      
Sbjct: 140  GQAQNVLPPQQLPVHPHQQAG----------HPVHQRAAMQPIQQSLPHQXVQQPPL--- 186

Query: 553  QASGLVNSQPHQSGPFLQ-QQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHGLPSH 729
               G   +Q HQ G F+Q   P MQ  LRPQ  P S                  HG+   
Sbjct: 187  ---GTGQNQLHQQGSFMQPPTPTMQSQLRPQAPPQSWQQHSHAYPQPQQKVAMLHGMQPQ 243

Query: 730  QSQNLPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHPQSSSEQQP 909
              QN+ GRP M N G+Q   F Q+ +G          SGAV   P     +  S+++   
Sbjct: 244  LPQNV-GRPGMPNQGVQPQPFPQSQAG---------LSGAVQLRPMHLGPNQPSANQTLG 293

Query: 910  IYAHQSGIPQSG------TESAPFQSATKIQVSSVLAAVKTELKADALSQKPEIKEENGF 1071
             +  QS  PQ G      T   P    +K  V           + ++ S+K   ++ NG 
Sbjct: 294  QHLEQSAHPQPGLNVKQTTFEKPDDDLSKKGVGG--------QEGESFSEKTAREDANGV 345

Query: 1072 LATSSQGADSVELKIPSSESKLKSVGVDEKAGNAYESSEPISDV----KEISKSSQLLEK 1239
             ATS   +++VE+K   SE+ +KS  +DEK     E  + IS +    KEI +S + L  
Sbjct: 346  AATSGIESNTVEIK---SETDMKS--MDEKQKTTGEDEDTISRINNSAKEIPESMRALGS 400

Query: 1240 DNSLLAKKDLEEPKIKQMVKEEA-SGILEPLAG----GIAAETETKDGEHVPFRSRPTEN 1404
            D    A +D  EP IKQMVKEE     +E   G    GI  E +  +    P +    E+
Sbjct: 401  DPMQQASED-GEPVIKQMVKEEVIKSTVERSPGGKSIGIVVEDQKDELSVPPKQVEQVEH 459

Query: 1405 SQQEDKEIQEETLHKNVSLQKTEALETM----QKDA 1500
            S  +DKEIQ   L KN  +Q+ E L+ M    QKD+
Sbjct: 460  SLLQDKEIQNGLLMKNPPIQQVEILDEMGGKLQKDS 495


>ref|XP_007016238.1| Uncharacterized protein isoform 8 [Theobroma cacao]
            gi|508786601|gb|EOY33857.1| Uncharacterized protein
            isoform 8 [Theobroma cacao]
          Length = 972

 Score =  108 bits (269), Expect = 9e-21
 Identities = 135/502 (26%), Positives = 175/502 (34%), Gaps = 39/502 (7%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGAPQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVH- 390
            VTGHQSYP   PHQQM    PQ P+                                 H 
Sbjct: 19   VTGHQSYPLSQPHQQMQLVTPQHPMHVHAQGGLHPQQHPAQMQNSYPQQPPQMRPPQPHV 78

Query: 391  ----GQHPNMPPGQ----QPLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQTQFF 546
                 Q P + P      Q +H               +HP       P V     Q Q  
Sbjct: 79   AISNQQQPGLLPSPGSMLQQVHLHSHQPALPVQQRPVMHPAASPMSQPYV-----QQQPL 133

Query: 547  PAQASGLVNSQPHQSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHGLPS 726
              Q  GLV  Q  Q GPF+QQQ + Q   RP G P S                 SH +  
Sbjct: 134  STQPVGLVQPQMLQQGPFVQQQSSFQSQSRPLGPPHSFPQPPHAYAQPQQNVAGSHAVHF 193

Query: 727  HQSQNLPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHPQSSSEQQ 906
            H S NL GRP+  NHG+Q    Q  P    G   KP+  GA            Q SS Q 
Sbjct: 194  HPSHNLVGRPMTPNHGVQS---QPYPHSAAGTPVKPVHLGA-----------NQPSSYQN 239

Query: 907  PIYA--HQSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADALSQKPEIKEENGFLAT 1080
             ++   +QSG+        P    T   V+        E +AD+ S     KE N     
Sbjct: 240  NVFRTNNQSGVTSQPMSEVPGDHGTDKNVA--------EQEADSSSPGTARKEANELDMA 291

Query: 1081 SSQGADSVELKIPSSESKLKSVGVDEK-AGNAYESSEPIS-DVKEISKSSQLLEKDNSLL 1254
            SS GAD  E      E+ LKS  VDEK  G+  + S  +    KE  +S + +  D    
Sbjct: 292  SSLGADVAEKNTAKLEADLKS--VDEKLTGDVGDDSNGVDISTKETPESRRTVGTD---- 345

Query: 1255 AKKDLEEPKIKQMVKEEASGILEPLAGGIAAETETKDGEHVPFRSRPTENSQQEDKEIQE 1434
              +   +P  K MV  EA          I  + +  +GEH     +  +    +   +QE
Sbjct: 346  -LEQHRDPVSKNMVTCEA----------IEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQE 394

Query: 1435 ETLHKNVSLQKTEALETMQKDAEMPH-----KGSDGS----------------VPDKDTT 1551
              L +       E    MQKD  +PH     KG  G+                +P   + 
Sbjct: 395  AKLGE-------EQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSV 447

Query: 1552 SVILQG-----QIPGGERNNMQ 1602
              + QG     Q+P G  NN Q
Sbjct: 448  PNVDQGRHQPLQMPYGSNNNQQ 469


>ref|XP_007016237.1| Uncharacterized protein isoform 7 [Theobroma cacao]
            gi|508786600|gb|EOY33856.1| Uncharacterized protein
            isoform 7 [Theobroma cacao]
          Length = 975

 Score =  108 bits (269), Expect = 9e-21
 Identities = 135/502 (26%), Positives = 175/502 (34%), Gaps = 39/502 (7%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGAPQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVH- 390
            VTGHQSYP   PHQQM    PQ P+                                 H 
Sbjct: 19   VTGHQSYPLSQPHQQMQLVTPQHPMHVHAQGGLHPQQHPAQMQNSYPQQPPQMRPPQPHV 78

Query: 391  ----GQHPNMPPGQ----QPLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQTQFF 546
                 Q P + P      Q +H               +HP       P V     Q Q  
Sbjct: 79   AISNQQQPGLLPSPGSMLQQVHLHSHQPALPVQQRPVMHPAASPMSQPYV-----QQQPL 133

Query: 547  PAQASGLVNSQPHQSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHGLPS 726
              Q  GLV  Q  Q GPF+QQQ + Q   RP G P S                 SH +  
Sbjct: 134  STQPVGLVQPQMLQQGPFVQQQSSFQSQSRPLGPPHSFPQPPHAYAQPQQNVAGSHAVHF 193

Query: 727  HQSQNLPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHPQSSSEQQ 906
            H S NL GRP+  NHG+Q    Q  P    G   KP+  GA            Q SS Q 
Sbjct: 194  HPSHNLVGRPMTPNHGVQS---QPYPHSAAGTPVKPVHLGA-----------NQPSSYQN 239

Query: 907  PIYA--HQSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADALSQKPEIKEENGFLAT 1080
             ++   +QSG+        P    T   V+        E +AD+ S     KE N     
Sbjct: 240  NVFRTNNQSGVTSQPMSEVPGDHGTDKNVA--------EQEADSSSPGTARKEANELDMA 291

Query: 1081 SSQGADSVELKIPSSESKLKSVGVDEK-AGNAYESSEPIS-DVKEISKSSQLLEKDNSLL 1254
            SS GAD  E      E+ LKS  VDEK  G+  + S  +    KE  +S + +  D    
Sbjct: 292  SSLGADVAEKNTAKLEADLKS--VDEKLTGDVGDDSNGVDISTKETPESRRTVGTD---- 345

Query: 1255 AKKDLEEPKIKQMVKEEASGILEPLAGGIAAETETKDGEHVPFRSRPTENSQQEDKEIQE 1434
              +   +P  K MV  EA          I  + +  +GEH     +  +    +   +QE
Sbjct: 346  -LEQHRDPVSKNMVTCEA----------IEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQE 394

Query: 1435 ETLHKNVSLQKTEALETMQKDAEMPH-----KGSDGS----------------VPDKDTT 1551
              L +       E    MQKD  +PH     KG  G+                +P   + 
Sbjct: 395  AKLGE-------EQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSV 447

Query: 1552 SVILQG-----QIPGGERNNMQ 1602
              + QG     Q+P G  NN Q
Sbjct: 448  PNVDQGRHQPLQMPYGSNNNQQ 469


>ref|XP_007016236.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508786599|gb|EOY33855.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 1345

 Score =  108 bits (269), Expect = 9e-21
 Identities = 135/502 (26%), Positives = 175/502 (34%), Gaps = 39/502 (7%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGAPQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVH- 390
            VTGHQSYP   PHQQM    PQ P+                                 H 
Sbjct: 452  VTGHQSYPLSQPHQQMQLVTPQHPMHVHAQGGLHPQQHPAQMQNSYPQQPPQMRPPQPHV 511

Query: 391  ----GQHPNMPPGQ----QPLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQTQFF 546
                 Q P + P      Q +H               +HP       P V     Q Q  
Sbjct: 512  AISNQQQPGLLPSPGSMLQQVHLHSHQPALPVQQRPVMHPAASPMSQPYV-----QQQPL 566

Query: 547  PAQASGLVNSQPHQSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHGLPS 726
              Q  GLV  Q  Q GPF+QQQ + Q   RP G P S                 SH +  
Sbjct: 567  STQPVGLVQPQMLQQGPFVQQQSSFQSQSRPLGPPHSFPQPPHAYAQPQQNVAGSHAVHF 626

Query: 727  HQSQNLPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHPQSSSEQQ 906
            H S NL GRP+  NHG+Q    Q  P    G   KP+  GA            Q SS Q 
Sbjct: 627  HPSHNLVGRPMTPNHGVQS---QPYPHSAAGTPVKPVHLGA-----------NQPSSYQN 672

Query: 907  PIYA--HQSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADALSQKPEIKEENGFLAT 1080
             ++   +QSG+        P    T   V+        E +AD+ S     KE N     
Sbjct: 673  NVFRTNNQSGVTSQPMSEVPGDHGTDKNVA--------EQEADSSSPGTARKEANELDMA 724

Query: 1081 SSQGADSVELKIPSSESKLKSVGVDEK-AGNAYESSEPIS-DVKEISKSSQLLEKDNSLL 1254
            SS GAD  E      E+ LKS  VDEK  G+  + S  +    KE  +S + +  D    
Sbjct: 725  SSLGADVAEKNTAKLEADLKS--VDEKLTGDVGDDSNGVDISTKETPESRRTVGTD---- 778

Query: 1255 AKKDLEEPKIKQMVKEEASGILEPLAGGIAAETETKDGEHVPFRSRPTENSQQEDKEIQE 1434
              +   +P  K MV  EA          I  + +  +GEH     +  +    +   +QE
Sbjct: 779  -LEQHRDPVSKNMVTCEA----------IEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQE 827

Query: 1435 ETLHKNVSLQKTEALETMQKDAEMPH-----KGSDGS----------------VPDKDTT 1551
              L +       E    MQKD  +PH     KG  G+                +P   + 
Sbjct: 828  AKLGE-------EQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSV 880

Query: 1552 SVILQG-----QIPGGERNNMQ 1602
              + QG     Q+P G  NN Q
Sbjct: 881  PNVDQGRHQPLQMPYGSNNNQQ 902


>ref|XP_007016235.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508786598|gb|EOY33854.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 1358

 Score =  108 bits (269), Expect = 9e-21
 Identities = 135/502 (26%), Positives = 175/502 (34%), Gaps = 39/502 (7%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGAPQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVH- 390
            VTGHQSYP   PHQQM    PQ P+                                 H 
Sbjct: 452  VTGHQSYPLSQPHQQMQLVTPQHPMHVHAQGGLHPQQHPAQMQNSYPQQPPQMRPPQPHV 511

Query: 391  ----GQHPNMPPGQ----QPLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQTQFF 546
                 Q P + P      Q +H               +HP       P V     Q Q  
Sbjct: 512  AISNQQQPGLLPSPGSMLQQVHLHSHQPALPVQQRPVMHPAASPMSQPYV-----QQQPL 566

Query: 547  PAQASGLVNSQPHQSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHGLPS 726
              Q  GLV  Q  Q GPF+QQQ + Q   RP G P S                 SH +  
Sbjct: 567  STQPVGLVQPQMLQQGPFVQQQSSFQSQSRPLGPPHSFPQPPHAYAQPQQNVAGSHAVHF 626

Query: 727  HQSQNLPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHPQSSSEQQ 906
            H S NL GRP+  NHG+Q    Q  P    G   KP+  GA            Q SS Q 
Sbjct: 627  HPSHNLVGRPMTPNHGVQS---QPYPHSAAGTPVKPVHLGA-----------NQPSSYQN 672

Query: 907  PIYA--HQSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADALSQKPEIKEENGFLAT 1080
             ++   +QSG+        P    T   V+        E +AD+ S     KE N     
Sbjct: 673  NVFRTNNQSGVTSQPMSEVPGDHGTDKNVA--------EQEADSSSPGTARKEANELDMA 724

Query: 1081 SSQGADSVELKIPSSESKLKSVGVDEK-AGNAYESSEPIS-DVKEISKSSQLLEKDNSLL 1254
            SS GAD  E      E+ LKS  VDEK  G+  + S  +    KE  +S + +  D    
Sbjct: 725  SSLGADVAEKNTAKLEADLKS--VDEKLTGDVGDDSNGVDISTKETPESRRTVGTD---- 778

Query: 1255 AKKDLEEPKIKQMVKEEASGILEPLAGGIAAETETKDGEHVPFRSRPTENSQQEDKEIQE 1434
              +   +P  K MV  EA          I  + +  +GEH     +  +    +   +QE
Sbjct: 779  -LEQHRDPVSKNMVTCEA----------IEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQE 827

Query: 1435 ETLHKNVSLQKTEALETMQKDAEMPH-----KGSDGS----------------VPDKDTT 1551
              L +       E    MQKD  +PH     KG  G+                +P   + 
Sbjct: 828  AKLGE-------EQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSV 880

Query: 1552 SVILQG-----QIPGGERNNMQ 1602
              + QG     Q+P G  NN Q
Sbjct: 881  PNVDQGRHQPLQMPYGSNNNQQ 902


>ref|XP_007016232.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|590588563|ref|XP_007016233.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
            gi|590588573|ref|XP_007016234.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508786595|gb|EOY33851.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508786596|gb|EOY33852.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508786597|gb|EOY33853.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 1408

 Score =  108 bits (269), Expect = 9e-21
 Identities = 135/502 (26%), Positives = 175/502 (34%), Gaps = 39/502 (7%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGAPQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVH- 390
            VTGHQSYP   PHQQM    PQ P+                                 H 
Sbjct: 452  VTGHQSYPLSQPHQQMQLVTPQHPMHVHAQGGLHPQQHPAQMQNSYPQQPPQMRPPQPHV 511

Query: 391  ----GQHPNMPPGQ----QPLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQTQFF 546
                 Q P + P      Q +H               +HP       P V     Q Q  
Sbjct: 512  AISNQQQPGLLPSPGSMLQQVHLHSHQPALPVQQRPVMHPAASPMSQPYV-----QQQPL 566

Query: 547  PAQASGLVNSQPHQSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHGLPS 726
              Q  GLV  Q  Q GPF+QQQ + Q   RP G P S                 SH +  
Sbjct: 567  STQPVGLVQPQMLQQGPFVQQQSSFQSQSRPLGPPHSFPQPPHAYAQPQQNVAGSHAVHF 626

Query: 727  HQSQNLPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHPQSSSEQQ 906
            H S NL GRP+  NHG+Q    Q  P    G   KP+  GA            Q SS Q 
Sbjct: 627  HPSHNLVGRPMTPNHGVQS---QPYPHSAAGTPVKPVHLGA-----------NQPSSYQN 672

Query: 907  PIYA--HQSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADALSQKPEIKEENGFLAT 1080
             ++   +QSG+        P    T   V+        E +AD+ S     KE N     
Sbjct: 673  NVFRTNNQSGVTSQPMSEVPGDHGTDKNVA--------EQEADSSSPGTARKEANELDMA 724

Query: 1081 SSQGADSVELKIPSSESKLKSVGVDEK-AGNAYESSEPIS-DVKEISKSSQLLEKDNSLL 1254
            SS GAD  E      E+ LKS  VDEK  G+  + S  +    KE  +S + +  D    
Sbjct: 725  SSLGADVAEKNTAKLEADLKS--VDEKLTGDVGDDSNGVDISTKETPESRRTVGTD---- 778

Query: 1255 AKKDLEEPKIKQMVKEEASGILEPLAGGIAAETETKDGEHVPFRSRPTENSQQEDKEIQE 1434
              +   +P  K MV  EA          I  + +  +GEH     +  +    +   +QE
Sbjct: 779  -LEQHRDPVSKNMVTCEA----------IEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQE 827

Query: 1435 ETLHKNVSLQKTEALETMQKDAEMPH-----KGSDGS----------------VPDKDTT 1551
              L +       E    MQKD  +PH     KG  G+                +P   + 
Sbjct: 828  AKLGE-------EQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSV 880

Query: 1552 SVILQG-----QIPGGERNNMQ 1602
              + QG     Q+P G  NN Q
Sbjct: 881  PNVDQGRHQPLQMPYGSNNNQQ 902


>ref|XP_007016231.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508786594|gb|EOY33850.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1326

 Score =  108 bits (269), Expect = 9e-21
 Identities = 135/502 (26%), Positives = 175/502 (34%), Gaps = 39/502 (7%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGAPQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVH- 390
            VTGHQSYP   PHQQM    PQ P+                                 H 
Sbjct: 452  VTGHQSYPLSQPHQQMQLVTPQHPMHVHAQGGLHPQQHPAQMQNSYPQQPPQMRPPQPHV 511

Query: 391  ----GQHPNMPPGQ----QPLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQTQFF 546
                 Q P + P      Q +H               +HP       P V     Q Q  
Sbjct: 512  AISNQQQPGLLPSPGSMLQQVHLHSHQPALPVQQRPVMHPAASPMSQPYV-----QQQPL 566

Query: 547  PAQASGLVNSQPHQSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHGLPS 726
              Q  GLV  Q  Q GPF+QQQ + Q   RP G P S                 SH +  
Sbjct: 567  STQPVGLVQPQMLQQGPFVQQQSSFQSQSRPLGPPHSFPQPPHAYAQPQQNVAGSHAVHF 626

Query: 727  HQSQNLPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHPQSSSEQQ 906
            H S NL GRP+  NHG+Q    Q  P    G   KP+  GA            Q SS Q 
Sbjct: 627  HPSHNLVGRPMTPNHGVQS---QPYPHSAAGTPVKPVHLGA-----------NQPSSYQN 672

Query: 907  PIYA--HQSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADALSQKPEIKEENGFLAT 1080
             ++   +QSG+        P    T   V+        E +AD+ S     KE N     
Sbjct: 673  NVFRTNNQSGVTSQPMSEVPGDHGTDKNVA--------EQEADSSSPGTARKEANELDMA 724

Query: 1081 SSQGADSVELKIPSSESKLKSVGVDEK-AGNAYESSEPIS-DVKEISKSSQLLEKDNSLL 1254
            SS GAD  E      E+ LKS  VDEK  G+  + S  +    KE  +S + +  D    
Sbjct: 725  SSLGADVAEKNTAKLEADLKS--VDEKLTGDVGDDSNGVDISTKETPESRRTVGTD---- 778

Query: 1255 AKKDLEEPKIKQMVKEEASGILEPLAGGIAAETETKDGEHVPFRSRPTENSQQEDKEIQE 1434
              +   +P  K MV  EA          I  + +  +GEH     +  +    +   +QE
Sbjct: 779  -LEQHRDPVSKNMVTCEA----------IEDQKDVHNGEHKVEEIKIKDGPSLKTPPLQE 827

Query: 1435 ETLHKNVSLQKTEALETMQKDAEMPH-----KGSDGS----------------VPDKDTT 1551
              L +       E    MQKD  +PH     KG  G+                +P   + 
Sbjct: 828  AKLGE-------EQNGKMQKDKILPHDQGTPKGPAGNGFRGIPPSSQVQPGGYLPPSHSV 880

Query: 1552 SVILQG-----QIPGGERNNMQ 1602
              + QG     Q+P G  NN Q
Sbjct: 881  PNVDQGRHQPLQMPYGSNNNQQ 902


>ref|XP_002520450.1| hypothetical protein RCOM_0731250 [Ricinus communis]
            gi|223540292|gb|EEF41863.1| hypothetical protein
            RCOM_0731250 [Ricinus communis]
          Length = 1329

 Score = 98.6 bits (244), Expect = 7e-18
 Identities = 129/477 (27%), Positives = 175/477 (36%), Gaps = 20/477 (4%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGAPQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVH- 390
            VTGH SYPQ  P QQ+  G  Q P+                                   
Sbjct: 428  VTGHHSYPQPQPQQQLQLGGLQHPVHYAQGGPQPQFPQQSPLLRPPQSHVPVQNPQQSGL 487

Query: 391  ----GQHPNMPPGQQPLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQTQFFPAQA 558
                GQ PN+PP QQ                  +    QQP+  Q   + QQ   FP QA
Sbjct: 488  LPSPGQVPNVPPAQQQPVQAHAQQPGLPVHQLPVMQSVQQPIHQQ---YVQQQPPFPGQA 544

Query: 559  SGLVNSQPHQSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHGLPSHQSQ 738
             G V +Q HQ G ++QQ       LRPQG  PS                  HG  +HQ+Q
Sbjct: 545  LGPVQNQVHQQGAYMQQHLHGHSQLRPQG--PS-----HAYTQPLQNVPLPHGTQAHQAQ 597

Query: 739  NLPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPP--RTNAHPQSSSEQQPI 912
            NL GRP        H      P   VG   +PMQ GA        R N   Q SSEQ   
Sbjct: 598  NLGGRPPYGVPTYPH------PHSSVGMQVRPMQVGADQQSGNAFRANNQMQLSSEQ--- 648

Query: 913  YAHQSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADALSQKPEIKEENGFLATSSQG 1092
                     SG  S P  +     +      ++   +AD+ SQK   ++ N     S  G
Sbjct: 649  --------PSGAISRPTSNRQGDDI------IEKSSEADSSSQKNVRRDPNDLDVASGLG 694

Query: 1093 ADSVELKIPSSESKLKSVGVDEKAGNAY--ESSEPISDVKEISKSSQLLE----KDNSLL 1254
            +D  +LK   SES LK V  D K+ N    E  +   D K+IS +    E    KD  ++
Sbjct: 695  SDVSDLKTVISESNLKPVDDDNKSINEVKEEPKKGNDDQKDISNTDNDAEDKGVKDGPVM 754

Query: 1255 AKKDLEEP---KIKQMVKEEASGILEPLAGGIAAETETK-DGEHVPFRSRP-TENSQQED 1419
              + L E    + + M  +    +    +GG     + + +G   P  S P  E  +Q+ 
Sbjct: 755  KNRPLPEAEHLEDQSMKSQRGRNVTPQHSGGFILHGQVQGEGLAQPSHSIPIAEQGKQQP 814

Query: 1420 KEIQEETLHKNVSLQKTEALETMQKDAEMPHKGSDGSVPDKDTTSV--ILQGQIPGG 1584
              I     H   +LQ+   + +    A  P     G +P   +  V  +  G IP G
Sbjct: 815  PVIP----HGPSALQQ-RPIGSSLLTAPPPGSLHHGQIPGHPSARVRPLGPGHIPHG 866


>ref|XP_006424987.1| hypothetical protein CICLE_v10027683mg [Citrus clementina]
            gi|557526921|gb|ESR38227.1| hypothetical protein
            CICLE_v10027683mg [Citrus clementina]
          Length = 1392

 Score = 89.7 bits (221), Expect = 3e-15
 Identities = 116/455 (25%), Positives = 180/455 (39%), Gaps = 17/455 (3%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGAP-QQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVH 390
            VT H SY Q  PHQQ+P   P Q P+                                 +
Sbjct: 423  VTSHHSYSQPQPHQQIPLSGPLQHPMYVHPHTGAQSQMQNQFPQQTPSMRPAQSHATISN 482

Query: 391  ----------GQHPNMPPGQQPLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQTQ 540
                      GQ  N+PP QQ                  +  P QQP+  Q   + QQ  
Sbjct: 483  QPLSTGLPPLGQVANIPPAQQLPVRPHAPQPGVPVSQHPVMQPVQQPMPYQ---YVQQHL 539

Query: 541  FFPAQASGLVNSQPHQSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHGL 720
             F  Q         HQ GPF+Q      P LRPQ  P S                  +G+
Sbjct: 540  PFSGQ---------HQQGPFVQ------PQLRPQRPPQSLQLHPPAYSQPLQNVAVINGM 584

Query: 721  PSHQSQNLPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHPQSSSE 900
             SHQ +NL G+PL  N+G+    +QQ+ +     H +P Q GA            QSSS 
Sbjct: 585  QSHQPRNL-GQPLTPNYGVHAQSYQQSAT---SLHVRPAQLGA-----------NQSSSN 629

Query: 901  QQPIYAHQSGIPQSGTESAPFQSATKI-QVSSVLAAVKTELKADALSQKPEIKEENGFLA 1077
            Q  ++   + +  S  + A   S  ++ + + V   +  E +A++ S+K   K +N    
Sbjct: 630  QSNLFWTSNQVQLSSEQQAGATSKPEMSEKNEVAVKIAHEREAESSSEK-TAKTDN--FD 686

Query: 1078 TSSQGADSVELKIPSSESKLKSVGVDEKAGNAYESSEPISDVKEISKSSQLLEKDNSLLA 1257
            T    A +V +K+P SE+ +K+  VDE      + +  +      + S + +    S +A
Sbjct: 687  TPGPEAAAVGMKVPKSETDVKA-AVDEIKTEVEDKTNVVD-----TSSKEFVTDRESHIA 740

Query: 1258 KKDLEEPKIKQMVKEEASGILEPLAGGIAAETETKDGEHVPFRSRPTENSQQEDKEIQEE 1437
            +       I +MVKEE   ++E + G        KD  +V  +    +      KE+QEE
Sbjct: 741  E---NVQPINKMVKEE---VIENVEG-------QKDSANVDIK----QEEHSVSKEVQEE 783

Query: 1438 TLHKNVSLQK----TEALETMQKDAEMPH-KGSDG 1527
             L K  ++Q+     E  E +QK+ ++P  +G+ G
Sbjct: 784  PLLKTSTMQQGTQFGEQSEKVQKEQKVPQAQGAQG 818


>ref|XP_006488440.1| PREDICTED: mediator of RNA polymerase II transcription subunit
            15-like isoform X1 [Citrus sinensis]
            gi|568870502|ref|XP_006488441.1| PREDICTED: mediator of
            RNA polymerase II transcription subunit 15-like isoform
            X2 [Citrus sinensis] gi|568870504|ref|XP_006488442.1|
            PREDICTED: mediator of RNA polymerase II transcription
            subunit 15-like isoform X3 [Citrus sinensis]
            gi|568870506|ref|XP_006488443.1| PREDICTED: mediator of
            RNA polymerase II transcription subunit 15-like isoform
            X4 [Citrus sinensis]
          Length = 1392

 Score = 87.8 bits (216), Expect = 1e-14
 Identities = 116/454 (25%), Positives = 177/454 (38%), Gaps = 16/454 (3%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGAP-QQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVH 390
            VT H SY Q  PHQQ+P   P Q P+                                 +
Sbjct: 423  VTSHHSYSQPQPHQQIPLSGPLQHPMYVHPHTGAQSQMQNQFPQQTPSMRPAQSHATISN 482

Query: 391  ----------GQHPNMPPGQQPLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQTQ 540
                      GQ  N+PP QQ                  +  P QQP+  Q   + QQ  
Sbjct: 483  QPLSTGLPPLGQVANIPPAQQLPVRPHAPQPGVPVSQHPVMQPVQQPMPYQ---YVQQHL 539

Query: 541  FFPAQASGLVNSQPHQSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHGL 720
             F  Q         HQ GPF+Q      P LRPQ  P S                  +G+
Sbjct: 540  PFSGQ---------HQQGPFVQ------PQLRPQRPPQSLQLHPPAYSQPLQNVAVINGM 584

Query: 721  PSHQSQNLPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHPQSSSE 900
             SHQ +NL G+PL  N+G+    +QQ+ +     H +P Q GA        ++  QS+  
Sbjct: 585  QSHQPRNL-GQPLTPNYGVHAQSYQQSAT---SLHVRPAQLGA------NQSSSNQSNLS 634

Query: 901  QQPIYAHQSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADALSQKPEIKEENGFLAT 1080
                    S   Q+G  S P  S    + + V   +  E +A++ S+K   K +N    T
Sbjct: 635  WTSNQVQLSSEQQAGATSKPEMS----EKNEVAVKIAHEREAESSSEK-TAKTDN--FDT 687

Query: 1081 SSQGADSVELKIPSSESKLKSVGVDEKAGNAYESSEPISDVKEISKSSQLLEKDNSLLAK 1260
                A +V +K+P SE+ +K+  VDE      + +  +      + S + +    S +A+
Sbjct: 688  PGPEAAAVGMKVPKSETDVKA-AVDEIKTEVEDKTNVVD-----TSSKEFVTDRESHIAE 741

Query: 1261 KDLEEPKIKQMVKEEASGILEPLAGGIAAETETKDGEHVPFRSRPTENSQQEDKEIQEET 1440
                   I +MVKEE   ++E + G        KD  +V  +    +      KE+QEE 
Sbjct: 742  ---NVQPINKMVKEE---VIENVEG-------QKDSANVDIK----QEEHSVSKEVQEEP 784

Query: 1441 LHKNVSLQK----TEALETMQKDAEMPH-KGSDG 1527
            L K  ++Q+     E  E +QK+ ++P  +G+ G
Sbjct: 785  LLKTSTMQQGTQFGEQSEKVQKEQKVPQAQGAQG 818


>ref|XP_006379033.1| hypothetical protein POPTR_0009s04520g [Populus trichocarpa]
            gi|550331020|gb|ERP56830.1| hypothetical protein
            POPTR_0009s04520g [Populus trichocarpa]
          Length = 1315

 Score = 87.8 bits (216), Expect = 1e-14
 Identities = 121/475 (25%), Positives = 174/475 (36%), Gaps = 21/475 (4%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGA-----------PQQPLXXXXXXXXXXXXXXXXXXXXXXXX 360
            VTGH SY Q   HQQM  GA            QQP+                        
Sbjct: 414  VTGHHSYQQPQIHQQMQTGALKHSQGGPQPHSQQPVQMQSQFPQQSSLWPQPQYHAAVQN 473

Query: 361  XXXXXXXXVHGQHPNMPPG-QQPLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQT 537
                      GQ PN+PP  QQP+H+              + P  Q    P    +AQ  
Sbjct: 474  LQQPGLLPSQGQVPNIPPALQQPIHSHAHQPGLPVQQRPGMQPTPQ----PMHQQYAQHQ 529

Query: 538  QFFPAQASGLVNSQPHQSGPFLQQQ---PAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXX 708
            Q F  Q  G V++Q HQ GP++QQQ   P  Q  LRPQG P S                 
Sbjct: 530  QPFSGQPWGAVHNQAHQQGPYVQQQQLHPLTQ--LRPQGLPQSFQQPSHAYPHPQQNVLL 587

Query: 709  SHGLPSHQSQNLPGRPLMANHGLQHHQFQQTPSG------PVGPHAKPMQSGAVLPYPPR 870
             HG   HQ+++L   P     GL    + Q+ SG       +G +    QSG +L    +
Sbjct: 588  PHGAHPHQAKSLAVGP-----GLPAQSYPQSASGMQVRSIQIGAN---QQSGNIL----K 635

Query: 871  TNAHPQSSSEQQPIYAHQSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADALSQKPE 1050
            TN   + SS+Q           QSG  S   Q   +      L+A KT         K E
Sbjct: 636  TNNQVELSSDQ-----------QSGVSSRQRQGDIEKGAEGELSAQKT--------IKKE 676

Query: 1051 IKEENGFLATSSQGADSVELKIPSSESKLKSVGVDEKAGNAYESSEPISDVKEISKSSQL 1230
            + + +  LA     AD+ E+K   SES LK   VD+K       ++P  + K++ +S   
Sbjct: 677  LNDLDAGLA-----ADASEMKTIKSESDLKQ--VDDK-------NKPTGEAKDVPESLAA 722

Query: 1231 LEKDNSLLAKKDLEEPKIKQMVKEEASGILEPLAGGIAAETETKDGEHVPFRSRPTENSQ 1410
               ++S           IKQ+ +E   G  E        + +  + +H       +E+  
Sbjct: 723  ANGESS-----------IKQVKEEHRDGADE--------QNDVSNADHEKVELSVSEHKD 763

Query: 1411 QEDKEIQEETLHKNVSLQKTEALETMQKDAEMPHKGSDGSVPDKDTTSVILQGQI 1575
                E     L + +   + +   T Q     P  G   S     + S + QG++
Sbjct: 764  GPLLETAPSHLEEQIMKLQKDKTPTSQSFGGFPPNGHVQS----QSVSAVDQGKL 814


>ref|XP_004295721.1| PREDICTED: uncharacterized protein LOC101314450 [Fragaria vesca
            subsp. vesca]
          Length = 1316

 Score = 76.3 bits (186), Expect = 4e-11
 Identities = 115/483 (23%), Positives = 165/483 (34%), Gaps = 29/483 (6%)
 Frame = +1

Query: 220  GHQSYPQLHPHQQMPQGAPQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVHGQH 399
            GH  +PQ HPHQ +   APQQ                                  VH Q 
Sbjct: 407  GHHLFPQSHPHQPVLSAAPQQ--------------------------------RTVHLQS 434

Query: 400  PNMPPGQQPLHTXXXXXXXXXXXXXXLHPPYQQPVA------------------PQVHGH 525
               P  Q   H                 PP+Q  +                   P VH  
Sbjct: 435  QGAPNSQSQNHVQTQIQFPLQPPLLR-PPPFQTTIPNQPQTALLPSPSMISAQQPPVHSF 493

Query: 526  AQQTQFFPAQ------ASGLVNSQPHQSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXX 687
            AQQ    P Q         L   Q  Q+ P++QQ PA    LRPQG   S          
Sbjct: 494  AQQPGIPPLQRPLIQPVQQLNPQQYFQNQPYVQQTPATLSQLRPQGQSHSFPQHIRASNQ 553

Query: 688  XXXXXXXSHGLPSHQSQNLPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPP 867
                   S G+   Q  NL GRP+M +HG+    + QT              G VLP P 
Sbjct: 554  SQQNVVLSQGMQHIQPSNLVGRPMMPSHGVLPQPYAQT-------------VGGVLPRPM 600

Query: 868  RTNAHPQSSSEQQPIYAHQSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADALSQKP 1047
                + QSS++        +   Q G  S P  +    +                  ++ 
Sbjct: 601  YPPLNHQSSNQNN--IGRTNNQVQPGANSRPTMTTRPAE------------------KEA 640

Query: 1048 EIKEENGF--LATSSQGADSVELKIPSSESKLKSVGVDEKAGNAYESSEPISDVKEISKS 1221
            E+  +NG   +  SS      E K   SE  +KS     K  +   S +     KEI +S
Sbjct: 641  ELSAKNGAQDVGVSSAVVADSEAKTVKSEVDIKSTDDGNKPSSEDRSYQ---GTKEIPES 697

Query: 1222 SQLLEKDNSLLAKKDLEEPKIKQMVKEEASGIL-EPLAGGI--AAETETKDGEHVPFRSR 1392
              +L  +    +K  L+E  +   +++ ++G L E +A G   A  +  K GEH   +  
Sbjct: 698  KGMLGANGESESKPTLKEEGVDSTLEDLSNGKLGELVAEGAKDAPSSGMKLGEH---KEM 754

Query: 1393 PTENSQQEDKEIQEETLHKNVSLQKTEALETMQKDAEMPHKGSDGSVPDKDTTSVILQGQ 1572
            P E +Q     ++++ L K VS  +  +       A +    + G +      S ILQ Q
Sbjct: 755  PPEEAQLHG--VKDKKLQKVVSSTEEGSQTVSISSAPIGQVQAGGLMQPSHPGSAILQ-Q 811

Query: 1573 IPG 1581
             PG
Sbjct: 812  KPG 814


>ref|XP_007204950.1| hypothetical protein PRUPE_ppa000292mg [Prunus persica]
            gi|462400592|gb|EMJ06149.1| hypothetical protein
            PRUPE_ppa000292mg [Prunus persica]
          Length = 1334

 Score = 74.3 bits (181), Expect = 1e-10
 Identities = 94/391 (24%), Positives = 141/391 (36%), Gaps = 26/391 (6%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGAPQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVHG 393
            VTG+  Y Q H HQ +  GAPQQ                                  +  
Sbjct: 437  VTGNHLYLQPHLHQPVQSGAPQQ--------------HTMHLQSHGMPHSQSQTPVQIQS 482

Query: 394  QHPNMPPGQQP--LHTXXXXXXXXXXXXXX-----LHPPYQQPVAPQVH--GHAQQTQFF 546
            Q P  PP  +P   HT                   ++P  QQPV    H  G+    +  
Sbjct: 483  QFPQQPPLMRPPPSHTTVPNQQQPALLPSPGQIQNINPAQQQPVHSYGHPPGNTVHQRPH 542

Query: 547  PAQASGLVNSQPHQSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHGLPS 726
                   +  Q     PF+QQQP  Q  LRPQG   S                 S G+  
Sbjct: 543  MQAVQQPIPQQYFHHQPFVQQQPPTQ--LRPQGQSHSFPQHIHASTQSQQNVTLSQGI-Q 599

Query: 727  HQSQNLPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHPQSSSEQQ 906
            H   NL GRP+M  HG+Q   + QT     G + +PM   A L            SS  Q
Sbjct: 600  HTQSNLGGRPMMPIHGVQSQTYAQTAG---GVYMRPMHPAANL------------SSTNQ 644

Query: 907  PIYAHQSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADALSQKPEIKEENGFLATSS 1086
                  + + QSG  S P  S  + +  S  +A            +   K+    + T+S
Sbjct: 645  NNMVRTNNLGQSGANSGPTTSERQAEQESEFSA------------QQNAKKVVHDVGTAS 692

Query: 1087 QGADSVELKIPSSESKLKSVGVDEKAGNAYESSEPISDVKEI----------SKSSQLLE 1236
                  E+K   SE+ +KS+  + K     ++ +  +  KEI          S S  +L+
Sbjct: 693  AVVADAEVKTAKSETDMKSIDNENKPTGEDKTIQGDTSSKEIPDIHALENGESVSKSILK 752

Query: 1237 K-------DNSLLAKKDLEEPKIKQMVKEEA 1308
            +       D+S ++  D+++ ++K++  EEA
Sbjct: 753  EEGVDGTLDHSNVSISDMKQRELKEIPSEEA 783


>ref|XP_002298329.1| hypothetical protein POPTR_0001s25430g [Populus trichocarpa]
            gi|222845587|gb|EEE83134.1| hypothetical protein
            POPTR_0001s25430g [Populus trichocarpa]
          Length = 1327

 Score = 72.4 bits (176), Expect = 5e-10
 Identities = 104/433 (24%), Positives = 154/433 (35%), Gaps = 24/433 (5%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGAPQ-----------QPLXXXXXXXXXXXXXXXXXXXXXXXX 360
            VTGH SY Q   HQQMP GAPQ           QP+                        
Sbjct: 420  VTGHHSYLQPQIHQQMPLGAPQHPRGGPQSQSQQPVQMQSQFIQQPPLLPPPQSHAAFQN 479

Query: 361  XXXXXXXXVHGQHPNMPPGQQ-PLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQT 537
                       Q P++PP QQ P+H+                P  Q  V P    + Q  
Sbjct: 480  PQQPGLLPSPVQVPSIPPAQQQPVHSHADQPGLPVQQ----RPVMQPIVQPMNQQYVQHQ 535

Query: 538  QFFPAQASGLVNSQPHQSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHG 717
            Q FP Q  G V++Q H  G + QQ P  Q  L P G   S                   G
Sbjct: 536  QPFPGQPWGAVHNQMHHQGLYGQQHP--QTQLHPHGPVQSFQQPSHAYPHPQQNVPLPRG 593

Query: 718  LPSHQSQNLPGRPLMANHGLQHHQFQQTPSGPVGPHAKPM------QSGAVLPYPPRTNA 879
               HQ+Q+L     ++ HG+     Q  P       A+P+      QSG +L    +TN 
Sbjct: 594  AHPHQAQSLAVGTGVSPHGVL--SVQSYPQSTAVMQARPVQIGANQQSGNIL----KTNN 647

Query: 880  HPQSSSEQQ------PIYAHQSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADALSQ 1041
              + SSEQQ      PI   Q  I + G E            SS    +K EL       
Sbjct: 648  QVEFSSEQQAWVASRPISERQGDI-EKGAEGE----------SSAHNTIKKELNE----- 691

Query: 1042 KPEIKEENGFLATSSQGADSVELKIPSSESKLKSVGVDEKAGNAYESSEPISDVKEISKS 1221
                         +  GA + E+K   SES LK V          + ++P  + K+I  +
Sbjct: 692  -----------LDAGLGASASEMKTIKSESDLKQVD---------DENKPTGEAKDIPGA 731

Query: 1222 SQLLEKDNSLLAKKDLEEPKIKQMVKEEASGILEPLAGGIAAETETKDGEHVPFRSRPTE 1401
                  + S+   K+ +   +    K+ ++   + +   ++   + KDG  +   + P+ 
Sbjct: 732  PAAANGEPSIKQVKE-DHRDVTDKQKDISNADQKKVELSLSEYMDGKDG--LSLETAPSH 788

Query: 1402 NSQQEDKEIQEET 1440
              +Q  K  +++T
Sbjct: 789  LEEQSKKSQKDKT 801


>ref|XP_004153176.1| PREDICTED: uncharacterized protein LOC101214768 [Cucumis sativus]
          Length = 1177

 Score = 68.2 bits (165), Expect = 1e-08
 Identities = 114/485 (23%), Positives = 171/485 (35%), Gaps = 30/485 (6%)
 Frame = +1

Query: 217  TGHQSYPQLHPHQQMPQGAPQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVHGQ 396
            TG+ SYPQ   HQQM  G PQ                                      Q
Sbjct: 137  TGYPSYPQPQHHQQMQLGVPQNVPSAPQGGAHQQSQPLVQMQSQLP-------------Q 183

Query: 397  HPNMPPGQQPLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQTQFFPAQASG---- 564
             P M P Q PL+                +    Q +   +H HAQQ    P QA+     
Sbjct: 184  PPPMRPSQPPLYQNQQQPPILPSSNQVQNVSSAQQL--HIHSHAQQPGG-PGQAANQRPV 240

Query: 565  -----------LVNSQPH--QSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXX 705
                       +V+   H  Q G F+Q Q  M P +R  G P S                
Sbjct: 241  MQLVQQSQSQQVVHQHQHFGQQGQFIQHQLHMTPQMRLPGPPNSLSQHNHAYAHLQHNAN 300

Query: 706  XSHGLPSHQSQNLPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHP 885
              HG+  + SQ+  GRPL+ N G Q   + Q+    VG   + +Q GA            
Sbjct: 301  LPHGMQHNPSQSSEGRPLVPNQGAQSIPYSQS---MVGVPVRAIQPGA-----------N 346

Query: 886  QSSSEQQPIYAHQSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADA-----LSQKPE 1050
            Q + +Q P +                +++ ++Q+       K E   D       SQK  
Sbjct: 347  QPTIKQGPTFG---------------KNSNQVQLPDGFGERKLEKGPDGRESGLSSQKDA 391

Query: 1051 IKEENGFLATSSQGADSVELKIPSSESKLKSVGVDEKAGNAYESSEPISDVKEISKSSQL 1230
             +  N    +S+ G ++ ELKI  SE+        +K+ +   S+E         ++ Q 
Sbjct: 392  KRAANHLDVSSTMGTNAGELKIDKSEADKGRYAFGDKSIHFDTSTE---------RTPQN 442

Query: 1231 LEKDNSLLAKKDLEEPKIKQMVK-EEASGILEPLAGGIAAETETKDGEHVPFRSRPTENS 1407
               D++L      +  +++  VK E A G  +  +     E    D + +    +  E+ 
Sbjct: 443  GAMDSNLHVGDSGKTKQVELKVKVEAAEGTFDHSSNDKLGEVSILDQKDLGTEPKKKEDL 502

Query: 1408 QQEDKEIQEETLHKNVSLQKTEALE----TMQKDAE---MPHKGSDGSVPDKDTTSVILQ 1566
              E+K  QEE     +S Q TE  E     MQ D      P  G++ S     TTS ++ 
Sbjct: 503  VIENKGNQEEF---KISSQDTELREEQSKRMQNDTSGTPHPSSGTNESQQGATTTSSLIL 559

Query: 1567 GQIPG 1581
            G  PG
Sbjct: 560  GS-PG 563


>ref|XP_004145323.1| PREDICTED: uncharacterized protein LOC101205914 [Cucumis sativus]
          Length = 1434

 Score = 68.2 bits (165), Expect = 1e-08
 Identities = 114/485 (23%), Positives = 171/485 (35%), Gaps = 30/485 (6%)
 Frame = +1

Query: 217  TGHQSYPQLHPHQQMPQGAPQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVHGQ 396
            TG+ SYPQ   HQQM  G PQ                                      Q
Sbjct: 394  TGYPSYPQPQHHQQMQLGVPQNVPSAPQGGAHQQSQPLVQMQSQLP-------------Q 440

Query: 397  HPNMPPGQQPLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQTQFFPAQASG---- 564
             P M P Q PL+                +    Q +   +H HAQQ    P QA+     
Sbjct: 441  PPPMRPSQPPLYQNQQQPPILPSSNQVQNVSSAQQL--HIHSHAQQPGG-PGQAANQRPV 497

Query: 565  -----------LVNSQPH--QSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXX 705
                       +V+   H  Q G F+Q Q  M P +R  G P S                
Sbjct: 498  MQLVQQSQSQQVVHQHQHFGQQGQFIQHQLHMTPQMRLPGPPNSLSQHNHAYAHLQHNAN 557

Query: 706  XSHGLPSHQSQNLPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHP 885
              HG+  + SQ+  GRPL+ N G Q   + Q+    VG   + +Q GA            
Sbjct: 558  LPHGMQHNPSQSSEGRPLVPNQGAQSIPYSQS---MVGVPVRAIQPGA-----------N 603

Query: 886  QSSSEQQPIYAHQSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADA-----LSQKPE 1050
            Q + +Q P +                +++ ++Q+       K E   D       SQK  
Sbjct: 604  QPTIKQGPTFG---------------KNSNQVQLPDGFGERKLEKGPDGRESGLSSQKDA 648

Query: 1051 IKEENGFLATSSQGADSVELKIPSSESKLKSVGVDEKAGNAYESSEPISDVKEISKSSQL 1230
             +  N    +S+ G ++ ELKI  SE+        +K+ +   S+E         ++ Q 
Sbjct: 649  KRAANHLDVSSTMGTNAGELKIDKSEADKGRYAFGDKSIHFDTSTE---------RTPQN 699

Query: 1231 LEKDNSLLAKKDLEEPKIKQMVK-EEASGILEPLAGGIAAETETKDGEHVPFRSRPTENS 1407
               D++L      +  +++  VK E A G  +  +     E    D + +    +  E+ 
Sbjct: 700  GAMDSNLHVGDSGKTKQVELKVKVEAAEGTFDHSSNDKLGEVSILDQKDLGTEPKKKEDL 759

Query: 1408 QQEDKEIQEETLHKNVSLQKTEALE----TMQKDAE---MPHKGSDGSVPDKDTTSVILQ 1566
              E+K  QEE     +S Q TE  E     MQ D      P  G++ S     TTS ++ 
Sbjct: 760  VIENKGNQEEF---KISSQDTELREEQSKRMQNDTSGTPHPSSGTNESQQGATTTSSLIL 816

Query: 1567 GQIPG 1581
            G  PG
Sbjct: 817  GS-PG 820


>ref|XP_007131393.1| hypothetical protein PHAVU_011G009900g [Phaseolus vulgaris]
            gi|561004393|gb|ESW03387.1| hypothetical protein
            PHAVU_011G009900g [Phaseolus vulgaris]
          Length = 1314

 Score = 65.9 bits (159), Expect = 5e-08
 Identities = 93/419 (22%), Positives = 139/419 (33%), Gaps = 6/419 (1%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGAPQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVHG 393
            VTGH SYPQ  PH  M  G PQ P+                                +  
Sbjct: 443  VTGHHSYPQPLPHPNMQTGVPQHPMHMHPQNGPQPQAQHSVQ---------------MQN 487

Query: 394  QHPNMPPGQQPLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQTQFFPAQASGLVN 573
            Q P   P  +P  +                PP QQ     V+ H QQ    P Q +    
Sbjct: 488  QFPPQIPTMRPNQSHAIFPNQQSSVQGQTTPPLQQQ---PVYSHNQQ----PGQINQRPT 540

Query: 574  SQP----HQSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHGLPSHQSQN 741
             QP     Q  PF Q Q +M   LRP G  P+                 S+ +   QSQN
Sbjct: 541  MQPVQQIPQQQPFAQHQMSMPSHLRPLG--PAHSFPKHVYSQSQGNIAPSNNIQHSQSQN 598

Query: 742  LPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHPQSSSEQQPIYAH 921
              GRPL+ NH                       +G + P+    N  P    +    Y H
Sbjct: 599  AGGRPLVPNH-----------------------AGHLQPFAQSANTIPVRHGQNGAGYLH 635

Query: 922  QSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADALSQKPEIKEENGFLATSSQGADS 1101
            ++    +GT + P Q  +++Q     A    E   D   Q+ E                +
Sbjct: 636  ENQKSLAGTNN-PVQLPSELQSR---APETIERHGDVGEQQTESAAGKLGKNLDIVSGSA 691

Query: 1102 VELKIPSSESKLKSVGVDEKAGNAYESSEPISDVKEISKSSQLLEKD--NSLLAKKDLEE 1275
             ELK    E+ LK + V    GN   + +P S    +  ++ +   D  N  L      E
Sbjct: 692  NELKSEKFEASLKPIEV----GNMQNNEDPHSIKTSVPNANAVENADSVNKNLGMGAAAE 747

Query: 1276 PKIKQMVKEEASGILEPLAGGIAAETETKDGEHVPFRSRPTENSQQEDKEIQEETLHKN 1452
               K  V  ++ G +     G+  ++     +   F+     N++ +  E + + LH +
Sbjct: 748  SNWKPAVSNKSGGAMH----GVQNDSNEHSVQGNEFQEGHPPNTETKLPESETDKLHND 802


>ref|XP_007131392.1| hypothetical protein PHAVU_011G009900g [Phaseolus vulgaris]
            gi|561004392|gb|ESW03386.1| hypothetical protein
            PHAVU_011G009900g [Phaseolus vulgaris]
          Length = 1288

 Score = 65.9 bits (159), Expect = 5e-08
 Identities = 93/419 (22%), Positives = 139/419 (33%), Gaps = 6/419 (1%)
 Frame = +1

Query: 214  VTGHQSYPQLHPHQQMPQGAPQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVHG 393
            VTGH SYPQ  PH  M  G PQ P+                                +  
Sbjct: 443  VTGHHSYPQPLPHPNMQTGVPQHPMHMHPQNGPQPQAQHSVQ---------------MQN 487

Query: 394  QHPNMPPGQQPLHTXXXXXXXXXXXXXXLHPPYQQPVAPQVHGHAQQTQFFPAQASGLVN 573
            Q P   P  +P  +                PP QQ     V+ H QQ    P Q +    
Sbjct: 488  QFPPQIPTMRPNQSHAIFPNQQSSVQGQTTPPLQQQ---PVYSHNQQ----PGQINQRPT 540

Query: 574  SQP----HQSGPFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHGLPSHQSQN 741
             QP     Q  PF Q Q +M   LRP G  P+                 S+ +   QSQN
Sbjct: 541  MQPVQQIPQQQPFAQHQMSMPSHLRPLG--PAHSFPKHVYSQSQGNIAPSNNIQHSQSQN 598

Query: 742  LPGRPLMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHPQSSSEQQPIYAH 921
              GRPL+ NH                       +G + P+    N  P    +    Y H
Sbjct: 599  AGGRPLVPNH-----------------------AGHLQPFAQSANTIPVRHGQNGAGYLH 635

Query: 922  QSGIPQSGTESAPFQSATKIQVSSVLAAVKTELKADALSQKPEIKEENGFLATSSQGADS 1101
            ++    +GT + P Q  +++Q     A    E   D   Q+ E                +
Sbjct: 636  ENQKSLAGTNN-PVQLPSELQSR---APETIERHGDVGEQQTESAAGKLGKNLDIVSGSA 691

Query: 1102 VELKIPSSESKLKSVGVDEKAGNAYESSEPISDVKEISKSSQLLEKD--NSLLAKKDLEE 1275
             ELK    E+ LK + V    GN   + +P S    +  ++ +   D  N  L      E
Sbjct: 692  NELKSEKFEASLKPIEV----GNMQNNEDPHSIKTSVPNANAVENADSVNKNLGMGAAAE 747

Query: 1276 PKIKQMVKEEASGILEPLAGGIAAETETKDGEHVPFRSRPTENSQQEDKEIQEETLHKN 1452
               K  V  ++ G +     G+  ++     +   F+     N++ +  E + + LH +
Sbjct: 748  SNWKPAVSNKSGGAMH----GVQNDSNEHSVQGNEFQEGHPPNTETKLPESETDKLHND 802


>emb|CDJ43054.1| hypothetical protein, conserved [Eimeria tenella]
          Length = 1375

 Score = 61.6 bits (148), Expect = 1e-06
 Identities = 112/432 (25%), Positives = 141/432 (32%), Gaps = 28/432 (6%)
 Frame = +1

Query: 226  QSYPQLHPHQQMPQGAPQQPLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVHGQHPN 405
            Q  PQ  PHQQ  Q A QQP                                    + P 
Sbjct: 498  QQQPQQQPHQQPQQQAQQQPQQQPQHQPQQQPQ-----------------------RQPQ 534

Query: 406  MPPGQQPLHTXXXXXXXXXXXXXXLHPPYQQP-VAPQVHGHAQQTQFFPAQASGLVNSQP 582
             PP QQP H                  P+QQP   PQ H   Q  Q    Q   L   QP
Sbjct: 535  QPPHQQPQHQ-----------------PHQQPQQQPQPHPRRQLQQQPQQQPQPLPQQQP 577

Query: 583  HQSG-PFLQQQPAMQPPLRPQGHPPSXXXXXXXXXXXXXXXXXSHGLPSHQSQNLP-GRP 756
             Q   P  +Q+P   P  +PQ HP                       P HQ Q  P  RP
Sbjct: 578  QQRPLPLPRQRPQPLPRHQPQPHPQHQPQQQPQQQPQQHLQQQ----PQHQPQPQPRQRP 633

Query: 757  LMANHGLQHHQFQQTPSGPVGPHAKPMQSGAVLPYPPRTNAHPQSSSEQQPIYAHQSGIP 936
                    H Q Q  P     PH +P          P+    PQ   +QQP   H+S  P
Sbjct: 634  --------HQQPQPQPRQRQRPHQQPQ---------PQPQPQPQQQQQQQP--QHESQQP 674

Query: 937  QSGTE------------SAPFQSATKIQVSSVLAAVKTELKADALSQKPEIKEENGFLA- 1077
            Q   E              P    T  +V       KTE  ++  + K E K E    A 
Sbjct: 675  QQQQEPQREVKVQQGHLPLPQPRRTTQEVQLPQEEAKTEEGSEEAAAKSEEKSEEAEAAE 734

Query: 1078 ---TSSQGADSVELKIPSSESKL--KSVG------VDEKAGNAYESSEPISDVKEISKSS 1224
               T ++   +      ++ES L   SVG      ++ +A    E  EP  +V    +  
Sbjct: 735  GKRTGTRRRKAAAHVKQNAESLLARHSVGAVPPFPLEAEAAATQEQQEP--EVDYAQQQQ 792

Query: 1225 QLLEKDNSLLAKKDLE-EPKIKQMVKEEASGILEPLAGGIAAETETKDGEHVPFRSRPTE 1401
             LL      LAKKDL   P+  Q     AS IL  L   +        GE     SR  E
Sbjct: 793  NLLR-----LAKKDLGCLPQELQQDVSTASSILSALRREV--------GEQRDRLSRLEE 839

Query: 1402 NSQQEDKEIQEE 1437
             +  ++K I E+
Sbjct: 840  KAANDEKNISEK 851


Top