BLASTX nr result

ID: Sinomenium21_contig00024826 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00024826
         (1428 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007210196.1| hypothetical protein PRUPE_ppa017129mg [Prun...   161   6e-37
ref|XP_004301230.1| PREDICTED: uncharacterized protein LOC101310...   142   5e-31
ref|XP_006372627.1| hypothetical protein POPTR_0017s03370g [Popu...   134   8e-29
ref|XP_006441430.1| hypothetical protein CICLE_v10018551mg [Citr...   134   1e-28
emb|CAN81192.1| hypothetical protein VITISV_022847 [Vitis vinifera]   133   2e-28
ref|XP_006493429.1| PREDICTED: uncharacterized protein LOC102612...   133   2e-28
emb|CBI26413.3| unnamed protein product [Vitis vinifera]              131   6e-28
ref|XP_002305691.2| hypothetical protein POPTR_0004s06730g [Popu...   125   4e-26
ref|XP_007029359.1| Uncharacterized protein isoform 2 [Theobroma...   124   1e-25
ref|XP_007029358.1| Uncharacterized protein isoform 1 [Theobroma...   124   1e-25
ref|XP_007152541.1| hypothetical protein PHAVU_004G138800g [Phas...    92   7e-16
ref|XP_006847866.1| hypothetical protein AMTR_s00029p00086500 [A...    86   5e-14
ref|XP_002516352.1| hypothetical protein RCOM_1402790 [Ricinus c...    71   1e-09
ref|XP_004137638.1| PREDICTED: uncharacterized protein LOC101212...    69   7e-09

>ref|XP_007210196.1| hypothetical protein PRUPE_ppa017129mg [Prunus persica]
            gi|462405931|gb|EMJ11395.1| hypothetical protein
            PRUPE_ppa017129mg [Prunus persica]
          Length = 1056

 Score =  161 bits (408), Expect = 6e-37
 Identities = 141/489 (28%), Positives = 219/489 (44%), Gaps = 27/489 (5%)
 Frame = -1

Query: 1416 SKDIGIDSA----NEIQMTSKVLPTLDWEEDCNQKNTSYCNDISSKVISDVSNATILDLI 1249
            S ++GI S     N++ +     P  D E D      S  +D+ ++  SD+ ++ +LD +
Sbjct: 159  SDEVGIPSIGNFENQLLLKDSGFPIFD-EVDGIHTQVSCYSDMYTRGYSDMHDSFVLDSM 217

Query: 1248 SDGWTSDASASAGDYTEEKSTIKEXXXXXXXXXXXXXXXXXXXGKGLLFQENLSNVDVNA 1069
            S G  S  S +AG   +EK   KE                    KG    +   N  V+ 
Sbjct: 218  SIGSNSGDSINAGH--DEKHAEKEIFKIDISKPPGLSSG-----KGRFSCQRFLNDVVDN 270

Query: 1068 DDQTERTKCDSQGCSSSDFHLVLYGKRGRQGRKLCGSSTGMDRFNSGANVHGRCGKENNN 889
             D TE  +   QGC S+D  LV+  KR +Q  K+   +  + +F S  N+H R GKENN+
Sbjct: 271  YDHTEEARHGIQGCRSNDMQLVVPNKRSKQN-KVAPRTANVSKFGSNGNLHIRIGKENNH 329

Query: 888  SVWQKVQRNDVEGCDFQSNNAPHVSSRI-----------------XXXXXXXXXXXXXXX 760
            SVWQKVQRND   C  +   A  V SR+                                
Sbjct: 330  SVWQKVQRNDSSDCTGELKKASSVYSRLDLPLREAPLLKRTSNVADVNAFSKSEDKKQQK 389

Query: 759  XXXAEKMKRKPDAGQKQEHSCYSRKRPPACKTNSSGATRINIQQKEVLEIPSEVNQHKGN 580
               ++K+KRK     KQE++ YSRK   A      G  +  + Q ++L+I S++   K  
Sbjct: 390  DKVSKKLKRKTGPPLKQEYNFYSRKGSHASIAGLDGCAKARMDQNDILDISSQLKDKKSL 449

Query: 579  LDGSRSHCPIEPPGGGFDQSCRVDLSLSESIQDSQVCLDDTKPIDSVSKIFNMEGANKNS 400
               SRS  P   P GG+ QS +V+   SES+ + ++C ++    +SV         NKNS
Sbjct: 450  SLVSRSCSPPSCPRGGY-QSSKVECMTSESVHNMKLCQNEMDHFESVCV------GNKNS 502

Query: 399  PSSGTENTLDQKQFLGVHSDGNVYLP---INSVPARLQAEISHTENGKQD-HHSGPVLQK 232
                  ++L +   L V S   VYLP    N+    +Q E+S  E+ +Q+   SG +  K
Sbjct: 503  SVQRKWDSLSESNLLQVQSP--VYLPHLLCNATSQEVQKEVSLAESSRQNSSSSGSLKHK 560

Query: 231  WIPVVRKDAETTAANGSGNLLTSHLDESAGNESKVMDEEELSL--SAQSFVPLVDVSVES 58
            W+P+  K+   T++  SG+    H DE+A     + D  + ++  + Q+ V  V V    
Sbjct: 561  WMPIGSKNPGLTSSTRSGSSSLEHSDEAASKRWALKDPAKGNVVSNTQNLVSKVAVGCTG 620

Query: 57   MASSGDISC 31
              +S D++C
Sbjct: 621  Q-NSEDVTC 628


>ref|XP_004301230.1| PREDICTED: uncharacterized protein LOC101310807 [Fragaria vesca
            subsp. vesca]
          Length = 1194

 Score =  142 bits (357), Expect = 5e-31
 Identities = 126/445 (28%), Positives = 198/445 (44%), Gaps = 22/445 (4%)
 Frame = -1

Query: 1389 NEIQMTSKVLPTLDWEEDCNQKNTSYCNDISSKVISDVSNATILDLISDGWTSDASASAG 1210
            N+I +     P LD  E  +    S C+D+ +K  S++ ++ ILD IS G  SD S + G
Sbjct: 290  NQIILKDSAFPILDGVEGIHHTKASDCSDLYTKGYSEMHDSFILDSISIGSNSDGSINLG 349

Query: 1209 DYTEEKSTIKEXXXXXXXXXXXXXXXXXXXGKGLLFQENLSNVDVNADDQTERTKCDSQG 1030
               +EK   KE                    K    +++  N  VN  + TE  +  + G
Sbjct: 350  H--DEKHADKEIYNTDISEPPNSNSR-----KVYFTRQSSLNDFVNTYNHTEGARQCTHG 402

Query: 1029 CSSS-DFHLVLYGKRGRQGRKLCGSSTGMDRFNSGANVHGRCGKENNNSVWQKVQRNDVE 853
            CSSS D   V+  KR RQ  K+   S  + +  S  N+  R GKEN +SVWQKVQ+ND  
Sbjct: 403  CSSSTDMKYVVPNKRSRQN-KVGQRSANVPKSGSVGNM--RTGKENIHSVWQKVQKNDAN 459

Query: 852  GCDFQSNNAPHVSSR-----------------IXXXXXXXXXXXXXXXXXXAEKMKRKPD 724
             C  +   A  V SR                 +                  ++K+KR+  
Sbjct: 460  DCTGELKTASSVYSRLDLPLKEAPMINRTCNSVDIDVFLKSENRKQQKDKVSKKLKRRNA 519

Query: 723  AGQKQEHSCYSRKRPPACKTNSSGATRINIQQKEVLEIPSEVNQHKGNLDGSRSHCPIEP 544
               K+E+ CYSRK   A    S G+ ++ + Q ++ +I ++    KG    S S      
Sbjct: 520  PALKREYRCYSRKGSHASLAGSDGSLKLRMDQSDISDILTQAKDKKGLSLVSTSCSQPSC 579

Query: 543  PGGGFDQSCRVDLSLSESIQDSQVCLDDTKPIDSVSKIFNMEGANKNSPSSGTENTLDQK 364
            P  GF Q+ +V+   SES+Q  Q+C ++   +++V K  ++     N  + G ++   QK
Sbjct: 580  PTAGF-QTSKVECK-SESVQSMQLCPNEIGHLENVCKTVSV----MNDQNVGNDDGSMQK 633

Query: 363  QFLGVHSDGNVYLP---INSVPARLQAEISHTENGKQDH-HSGPVLQKWIPVVRKDAETT 196
                +     VYLP    ++    +Q +IS  E+ KQ+   SG + QKW+P+  KD+E  
Sbjct: 634  MSNLLQMQSLVYLPHLLHDAASQEVQRQISLAESSKQNRSSSGSLTQKWMPIGLKDSELA 693

Query: 195  AANGSGNLLTSHLDESAGNESKVMD 121
            ++  S +    H DE A     + D
Sbjct: 694  SSTRSESSSLEHSDEGASKRWTIKD 718


>ref|XP_006372627.1| hypothetical protein POPTR_0017s03370g [Populus trichocarpa]
            gi|550319256|gb|ERP50424.1| hypothetical protein
            POPTR_0017s03370g [Populus trichocarpa]
          Length = 1122

 Score =  134 bits (338), Expect = 8e-29
 Identities = 131/451 (29%), Positives = 199/451 (44%), Gaps = 13/451 (2%)
 Frame = -1

Query: 1320 TSYCNDISSKVISDVSNATIL-DLISDGWTSDASASAGDYTEEKSTIKEXXXXXXXXXXX 1144
            TS CND  SK  S  S+++++ D +S G  SD      D T +   +K            
Sbjct: 293  TSCCNDTQSKDFSYASDSSLVFDYLSIGSNSD------DGTNDSHHVKTYHEGSSRGSVL 346

Query: 1143 XXXXXXXXGKGLLFQENLSNVDVNADDQTERTKCDSQGCSSSDFHLVLYGKRGRQGRKLC 964
                     KG L  +N  N  V+   QTE +K   Q  S SD  L++ GK+G+Q + L 
Sbjct: 347  EAPGFNSK-KGSLSHKNSLNGAVDTYHQTEGSKHRGQNFSCSDAQLLMSGKKGKQIKTLP 405

Query: 963  GSSTGMDRFNSGANVHGRCGKENNNSVWQKVQRND-VEGCD--FQSNNAPHVSSRIXXXX 793
             SS    ++    N+HGR GKENN+SVW+KVQRND  + C    + ++A  +S       
Sbjct: 406  RSSASAHKYGGFENLHGRTGKENNHSVWKKVQRNDTADECSPKMKMSHACFLSD------ 459

Query: 792  XXXXXXXXXXXXXXAEKMKRKPDAGQKQEHSCYSRKRPPACKTNSSGATRINIQQKEVLE 613
                             +K           S    K+ P  K   +   +  +QQ E+ +
Sbjct: 460  ---------LTLKEGPSLKGNCTLSDVNSSSRTEGKKLPKDKAILNAHAKTGVQQHEIFD 510

Query: 612  IPSEVNQHKGNLDGSRSHCPIEPPGGGFDQSCRVDLSLSESIQDSQVCLDDTKPI----D 445
            + ++VN  KG    SR+H        GF  S  V+   SES+  +QV  D  +P+    D
Sbjct: 511  LTAQVNDKKGGKSISRTHSLNSCLTAGFHPS-GVECMNSESVNSTQVSPDALQPLQSTCD 569

Query: 444  SVSKIFNMEGANKNSPSSGTENTLDQKQFLGVHSDGNVYLP---INSVPARLQAEISHTE 274
            +VS   +    N  S  +   N+L+Q           VYLP    N VP +L+ E++  E
Sbjct: 570  TVSSTRHCHTENGGSLPAKLCNSLEQHAV----KVPPVYLPHLFFNKVP-QLEKEVTVAE 624

Query: 273  NGKQDHHSGPVLQKWIPVVRKDAETTAANGSGNLLTSHLDESAGNESKVMD-EEELSLSA 97
              KQ+H S  V+QKWIP+  KD E T +   GN      D  AG +  + + +++ +  +
Sbjct: 625  YCKQNHSSVTVMQKWIPIGVKDPELTTSARFGNSSPDPSDGPAGEDLTLRNVQDKANFDS 684

Query: 96   QSFVPLVDVSVESMASSGDISC-PALNDECQ 7
            Q  V    + + +   SG+  C P  +D  Q
Sbjct: 685  QDLVS--SLMLGTCQDSGNAVCFPQEDDRIQ 713


>ref|XP_006441430.1| hypothetical protein CICLE_v10018551mg [Citrus clementina]
            gi|557543692|gb|ESR54670.1| hypothetical protein
            CICLE_v10018551mg [Citrus clementina]
          Length = 1229

 Score =  134 bits (336), Expect = 1e-28
 Identities = 128/491 (26%), Positives = 218/491 (44%), Gaps = 29/491 (5%)
 Frame = -1

Query: 1425 SFVSKDIGIDSANEIQMTSKVLPTLDWEEDCNQKNTSYCNDISSKVISDVSNATILDLIS 1246
            SF  +    DS   +QM  +   T    E+ +    S  + I S   SD+++  + D +S
Sbjct: 317  SFAGEHPLTDSKMMVQMEDQGSVTDGGVEEQHPLRISCYDAIHSNGFSDMNDCRVRDSVS 376

Query: 1245 DGWTSDASASAGDYTE------EKSTIKEXXXXXXXXXXXXXXXXXXXGKGLLFQENLSN 1084
             G  SD S SA  YT+       KS+  E                    KG     NL +
Sbjct: 377  IGSNSDNSTSASFYTKPYGRESNKSSFSESVDSRSR-------------KGSFSPLNLLS 423

Query: 1083 VDVNADDQTERTKCDSQGCSSSDFHLVLYGKRGRQGRKLCGSSTGMDRFNSGANVHGRCG 904
              V+  D +E  +  +QG + SD  + + GK  ++ + + GSS  + +     N     G
Sbjct: 424  SVVDFCDYSEGKRYVNQGLNHSDMQVAVPGKWNKKAKMVPGSSNAL-KPRGARNSRISAG 482

Query: 903  KENNNSVWQKVQRNDVEGCDFQSNNAPHVSSRIXXXXXXXXXXXXXXXXXXAE------- 745
            KEN++ VWQKVQ+ND   C+ +S  A  V S+                            
Sbjct: 483  KENSHCVWQKVQKNDANKCNSESRKANAVCSQFLGTVKESSLLKRNSDMTYVNIPSKSED 542

Query: 744  ----------KMKRKPDAGQKQEHSCYSRKRPPACKTNSSGATRINIQQKEVLEIPSEVN 595
                      K+KRK   G K E++ YS++   + K +++  ++I  QQ E+ ++ +++N
Sbjct: 543  KKQLRDKAPRKLKRKISPGSKHEYNSYSQRAMYSSKASANARSKIGSQQNEIRDVSAQLN 602

Query: 594  QHKGNLDGSRSHCPIEPPGGGFDQSCRVDLSLSESIQDSQVCLDDTKPIDSVSKIFNMEG 415
                      S   +  P     QS +V+   SES   SQ C  + +  + VS   +   
Sbjct: 603  NQTRVSSAPSSCSDVGSPEFEL-QSSKVESLNSESSHSSQDCPKNLESTERVSGAVSALK 661

Query: 414  ANKNSPSSGTENTLDQKQFLGVHSDGNVYLP--INSVPARLQAEISHTENGKQDHHSGPV 241
             +++SP + +  +LD+   L V S   + LP  I +  A+ + + S  E+GKQDH SG  
Sbjct: 662  EHQDSPLAKSCYSLDKMNMLEVPSP--ICLPHLIFNEVAQTEKDESLAEHGKQDHISGSP 719

Query: 240  LQKWIPVVRKDAETTAANGSGNLLTSHLDESAGNE----SKVMDEEELSLSAQSFVPLVD 73
            +QKWIP+  K++++T +   G+L  +H D   G E     K  D++  S ++Q+ +  ++
Sbjct: 720  VQKWIPIGTKNSQSTFSASCGSLQLAHAD-GKGTEYWTLRKNFDKKSAS-NSQNLISSLN 777

Query: 72   VSVESMASSGD 40
            V + SM  + +
Sbjct: 778  VGMMSMGLNSE 788


>emb|CAN81192.1| hypothetical protein VITISV_022847 [Vitis vinifera]
          Length = 1239

 Score =  133 bits (335), Expect = 2e-28
 Identities = 120/450 (26%), Positives = 195/450 (43%), Gaps = 7/450 (1%)
 Frame = -1

Query: 1341 EDCNQKNTSYCNDISSKVISDVSNATILDLISDGWTSDASASAGDYTEEKSTIKEXXXXX 1162
            ED + +    C+D+SSK  SD+ ++ +L  +S G +S+ S +AG      +         
Sbjct: 402  EDKHGETIHCCDDMSSKGFSDMPDSLVLGSVSVGCSSEDSPNAGYDDSTDAGYNVSPSNE 461

Query: 1161 XXXXXXXXXXXXXXGKGLLFQENLSNVDVNADDQTERTKCDSQGCSSSDFHLVLYGKRGR 982
                                +++ SN  V++ +  +R K  S GCSSSD  L   GKR +
Sbjct: 462  QGSGISDSEAHQSTRNECFSRQSPSNGVVDSCNNADRMKLHSAGCSSSDIQLDARGKRDK 521

Query: 981  QGRKLCGSSTGMDRFNSGANVHGRCGKENNNSVWQKVQRNDVEGCDFQSN-NAPHVSSRI 805
            Q + +              N HG  GKEN      ++ +   E   F+ N N  +++S+ 
Sbjct: 522  QAKMVV------------ENXHGCVGKENVGCF--QLDKTLKEAPLFKRNCNNANIASK- 566

Query: 804  XXXXXXXXXXXXXXXXXXAEKMKRKPDAGQKQEHSCYSRKRPPACKTNSSGATRINIQQK 625
                                K K+    G KQE++C+SRKR  A K +S+   RINIQ+ 
Sbjct: 567  -------SEDKNRSXVKVHRKSKKNSSPGSKQEYNCHSRKRSLAMKASSNAPARINIQEN 619

Query: 624  EVLEIPSEVNQHKGNLDGSRSHCPIEPPGGGFDQSCRVDLSLSESIQDSQVCLDDTKPID 445
            E+   P   N  KG+   S+S+   + P     Q+ RV+   SE +   Q C  + +P +
Sbjct: 620  EMSVFPVLWNGQKGSGSISQSYSQNDCPEPEL-QTQRVESITSELVHSLQDCTGNLEPPE 678

Query: 444  SVSKIFNMEGANKNSPSSGTENTLDQKQFLGVH---SDGNVYLPINSVPARLQAEISHTE 274
              S I NM+       ++    +LD      +H   S  +++  I    A +  E+  +E
Sbjct: 679  RCSTISNMKDHITEGQNNSLLESLDSLNMSSLHEGQSAVHLHPLIGEEVAEVDKEVYLSE 738

Query: 273  NGKQDHHSGPVLQKWIPVVRKDAETTAANGSGNLLTSHLDESAGNESKVMDEEELSLSAQ 94
            N KQ+H S  V++KW PV +K++   +   S   L +H DE A       +  E   S+ 
Sbjct: 739  NSKQEHSSASVMKKWKPVAKKNSGFASLGRSDISLLAHADEPAAEGWTPKNSVEEKASSN 798

Query: 93   SFVPLVDVSVESMA---SSGDISCPALNDE 13
            S  P+     E M    S G+ +C +  D+
Sbjct: 799  SHKPISSNDSEIMCVDHSFGNANCSSPEDK 828


>ref|XP_006493429.1| PREDICTED: uncharacterized protein LOC102612440 [Citrus sinensis]
          Length = 1232

 Score =  133 bits (334), Expect = 2e-28
 Identities = 127/484 (26%), Positives = 213/484 (44%), Gaps = 27/484 (5%)
 Frame = -1

Query: 1425 SFVSKDIGIDSANEIQMTSKVLPTLDWEEDCNQKNTSYCNDISSKVISDVSNATILDLIS 1246
            SF  +    DS   +QM  +   T    E+ +    S  + I S   SD+++  + D +S
Sbjct: 317  SFAGEHPLTDSKMMVQMEDQGSVTDGGVEEQHPLRISCYDAIHSNGFSDMNDCRVRDSVS 376

Query: 1245 DGWTSDASASAGDYTE------EKSTIKEXXXXXXXXXXXXXXXXXXXGKGLLFQENLSN 1084
             G  SD S SA  YT+       KS+  E                    KG     NL +
Sbjct: 377  IGSNSDNSTSASFYTKPYGRESNKSSFSESVDSRSR-------------KGSFSPLNLLS 423

Query: 1083 VDVNADDQTERTKCDSQGCSSSDFHLVLYGKRGRQGRKLCGSSTGMDRFNSGANVHGRCG 904
              V+  D +E  +  +QG + SD  + +  K  ++ + + GSS  + +     N     G
Sbjct: 424  SVVDFCDYSEGKRYVNQGLNHSDMQVAVPRKWNKKAKMVPGSSNAL-KPRGARNSRISAG 482

Query: 903  KENNNSVWQKVQRNDVEGCDFQSNNAPHVSSRIXXXXXXXXXXXXXXXXXXAE------- 745
            KEN++ VWQKVQ+ND   C+ +S     V S+                            
Sbjct: 483  KENSHCVWQKVQKNDANKCNSESRKENAVCSQFLGAVKESSSLKRNSDMTDVNIPSKSED 542

Query: 744  ----------KMKRKPDAGQKQEHSCYSRKRPPACKTNSSGATRINIQQKEVLEIPSEVN 595
                      K+KRK   G K E++ YSR+   + K +S+  ++I  QQ E+L++ +++N
Sbjct: 543  KKQLRDKAPRKLKRKISPGSKHEYNSYSRRAMYSSKASSNARSKIGSQQNEILDVSAQLN 602

Query: 594  QHKGNLDGSRSHCPIEPPGGGFDQSCRVDLSLSESIQDSQVCLDDTKPIDSVSKIFNMEG 415
                      S   +  P     QS +V+   SES   SQ C  + +  + VS   +   
Sbjct: 603  NQTRVSSAPSSCSDVGAPEFEL-QSSKVESLNSESSHSSQDCPKNLESTERVSGAVSALK 661

Query: 414  ANKNSPSSGTENTLDQKQFLGVHSDGNVYLPINSVPARLQAEISHTENGKQDHHSGPVLQ 235
             +++SP + +  +LD+   L V S   +   I +  A+ + + S  E+GKQDH SG  +Q
Sbjct: 662  EHQDSPLAKSCYSLDKMNMLEVPSPICLPRLIFNEVAQTEKDESLAEHGKQDHISGSPVQ 721

Query: 234  KWIPVVRKDAETTAANGSGNLLTSHLDESAGNE----SKVMDEEELSLSAQSFVPLVDVS 67
            KWIP+  K +++T +   G+L  +H D   G E     K +D++  S ++Q+ +  ++V 
Sbjct: 722  KWIPIGTKGSQSTFSASCGSLQLAHAD-GKGTEYWTLRKNIDKKSAS-NSQNLISSLNVG 779

Query: 66   VESM 55
            + SM
Sbjct: 780  MMSM 783


>emb|CBI26413.3| unnamed protein product [Vitis vinifera]
          Length = 1067

 Score =  131 bits (330), Expect = 6e-28
 Identities = 117/449 (26%), Positives = 190/449 (42%), Gaps = 6/449 (1%)
 Frame = -1

Query: 1341 EDCNQKNTSYCNDISSKVISDVSNATILDLISDGWTSDASASAGDYTEEKSTIKEXXXXX 1162
            ED + +    C+D+SSK  SD+ ++ +L  +S G +S+ S +AG      +         
Sbjct: 265  EDKHGERIHCCDDMSSKGFSDMPDSLVLGSVSVGCSSEDSPNAGYDDSTDAGYNVSPSNE 324

Query: 1161 XXXXXXXXXXXXXXGKGLLFQENLSNVDVNADDQTERTKCDSQGCSSSDFHLVLYGKRGR 982
                                +++ SN  V++ +  +R K  S GCSSSD  L   GKR +
Sbjct: 325  QGSGISDSEAHQSTRNECFSRQSPSNGVVDSCNNADRMKLHSAGCSSSDIQLDARGKRDK 384

Query: 981  QGRKLCGSSTGMDRFNSGANVHGRCGKENNNSVWQKVQRNDVEGCDFQSNNAPHVSSRIX 802
            Q + +              N HG  GKEN           +        NNA +++S+  
Sbjct: 385  QAKMVV------------ENAHGCVGKENVGCFQLDKTLKEAPLLKRNCNNA-NIASK-- 429

Query: 801  XXXXXXXXXXXXXXXXXAEKMKRKPDAGQKQEHSCYSRKRPPACKTNSSGATRINIQQKE 622
                               K K+    G KQE++C+SRKR  A K +S+   RINIQ+ E
Sbjct: 430  ------SEDKNRSRVKVHRKSKKNSSPGSKQEYNCHSRKRSLAMKASSNAPARINIQENE 483

Query: 621  VLEIPSEVNQHKGNLDGSRSHCPIEPPGGGFDQSCRVDLSLSESIQDSQVCLDDTKPIDS 442
            +   P   N  KG+   S+S+   + P     Q+  V+   SE +   Q C  + +P + 
Sbjct: 484  MSVFPVLWNGQKGSGSISQSYSQNDCPEPEL-QTHGVESITSELVHSLQDCTGNLEPPER 542

Query: 441  VSKIFNMEGANKNSPSSGTENTLDQKQFLGVH---SDGNVYLPINSVPARLQAEISHTEN 271
             S I NM+       ++    +LD      +H   S  +++  +    A +  E+S +EN
Sbjct: 543  CSTISNMKDHITEGQNNSLLESLDSLNMSSLHEGQSAVHLHPLLGEEVAEVDKEVSLSEN 602

Query: 270  GKQDHHSGPVLQKWIPVVRKDAETTAANGSGNLLTSHLDESAGNESKVMDEEELSLSAQS 91
             KQ+H S  V++KW PV +K++   +   S   L +H DE A       +  E   S+ S
Sbjct: 603  SKQEHSSASVMKKWKPVAKKNSGFASLGRSDISLLAHADEPAAEGWTPKNSVEEKPSSNS 662

Query: 90   FVPLVDVSVESMA---SSGDISCPALNDE 13
              P+     E M    S G+ +C +  D+
Sbjct: 663  HKPISSNDSEIMCVDHSFGNANCSSPEDK 691


>ref|XP_002305691.2| hypothetical protein POPTR_0004s06730g [Populus trichocarpa]
            gi|550340470|gb|EEE86202.2| hypothetical protein
            POPTR_0004s06730g [Populus trichocarpa]
          Length = 1132

 Score =  125 bits (315), Expect = 4e-26
 Identities = 127/458 (27%), Positives = 197/458 (43%), Gaps = 23/458 (5%)
 Frame = -1

Query: 1317 SYCNDISSKVISDVSNAT-ILDLISDGWTSDASASAGDYTEEKSTIKEXXXXXXXXXXXX 1141
            S C+D  SK  S   +++ +LD +S G  SD   + G Y  +                  
Sbjct: 280  SCCDDKQSKDFSYAPDSSLVLDYVSIGSNSDDDPN-GSYRSKP------FHEASSRGSVL 332

Query: 1140 XXXXXXXGKGLLFQENLSNVDVNADDQTERTKCDSQGCSSSDFHLVLY--GKRGRQGRKL 967
                    KG L  +N  N  V+    TE +K  SQ  SSSD  L++    K+G+Q + L
Sbjct: 333  EAPGCNSRKGSLSYKNSFNGVVDTYHHTEGSKHGSQNFSSSDAQLLISRSSKKGKQIKAL 392

Query: 966  CGSSTGMDRFNSGANVHGRCGKENNNSVWQKVQRNDVEG----------CDFQSNNAPHV 817
               S G  ++    N+H R GKE N+SVW+KVQRN V+            D      P +
Sbjct: 393  -PRSAGAHKYGGFGNLHVRAGKEINHSVWKKVQRNGVDTETKISPVCFQSDMSLKETPSL 451

Query: 816  SSRIXXXXXXXXXXXXXXXXXXAE---KMKRKPDAGQKQEHSCYSRKRPPACKTNSSGAT 646
                                   +   K+KRK   G K ++SC+ R    + K + +   
Sbjct: 452  KRNCIVAEVNTVSRTENKKLLKDKVSKKLKRKNSLGSKLDYSCHGRGHS-SNKASFNTRA 510

Query: 645  RINIQQKEVLEIPSEVNQHKGNLDGSRSHCPIEPPGGGFDQSCRVDLSLSESIQDSQVCL 466
            +  ++Q E   + +EV+  KG    SR+H        GF  S RV+ + SES+   QV  
Sbjct: 511  KTGMRQDETFGLTAEVDDQKGGKSISRTHSMNTCLMVGFQPS-RVECANSESVNSLQVFP 569

Query: 465  DDTKPI----DSVSKIFNMEGANKNSPSSGTENTLDQKQFLGVHSDGNVYLP--INSVPA 304
            D  +P+    D+VS   +    N+ +  +   N LDQ     +     VYLP    +   
Sbjct: 570  DALQPLQSTYDAVSSPRHHHSENQGNSPAKLSNLLDQN---ALKVPPPVYLPHLFFNKGL 626

Query: 303  RLQAEISHTENGKQDHHSGPVLQKWIPVVRKDAETTAANGSGNLLTSHLDESAGNESKVM 124
            +++ EI+  E+ KQ+H SG V+QKWIP+  +D+E   +   GN L    D  A  +  + 
Sbjct: 627  QMEKEITLAEHCKQNHSSGSVMQKWIPIGVRDSELATSARFGNSLPDPSDRPAREDFTLR 686

Query: 123  D-EEELSLSAQSFVPLVDVSVESMASSGDISCPALNDE 13
            + +E  S  +Q  V      + +   SG+ SC    D+
Sbjct: 687  NVQENASFDSQDLVS--SSLLGTCQGSGNASCSPKEDD 722


>ref|XP_007029359.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508717964|gb|EOY09861.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 1182

 Score =  124 bits (311), Expect = 1e-25
 Identities = 136/455 (29%), Positives = 199/455 (43%), Gaps = 27/455 (5%)
 Frame = -1

Query: 1302 ISSKVISDVSNATILDLISDGWTSDASASAGDYTEEKSTIKEXXXXXXXXXXXXXXXXXX 1123
            I  +  SD+ ++ +LD +S G +S+ S SA    +      E                  
Sbjct: 345  IHQEDFSDLHDSLVLDSVSVGSSSEESMSASHIVKPFDNSHENSQSEAPGSNTK------ 398

Query: 1122 XGKGLLFQENLSNVDVNADDQTERTKCDSQGCSSSDFHLVLYGKRGRQGRKLCGSSTGMD 943
              KG  + +N         D T+  K      SS D  ++  GKRG+Q + + GSS+   
Sbjct: 399  --KGSFYHQNSLCSISETHDYTQGPK-HGLDFSSCDVQMIASGKRGKQFKSVPGSSSTC- 454

Query: 942  RFNSGANVHGRCGKENNNSVWQKVQRNDVEGC--------------DFQSNNAPHV---S 814
            +  S  N+HG  G EN++SVWQ+VQR+ VE C              D  + +AP +   S
Sbjct: 455  KLGSIGNLHGGMGTENSHSVWQRVQRHGVEKCNTELKKASPICSGSDVTAKDAPLLKRSS 514

Query: 813  SRIXXXXXXXXXXXXXXXXXXAEKMKRKPDAGQKQEHSCYSRKRPPACKTNSSGATRINI 634
            +                      K+KRK     KQE S  SRK     K N +   + + 
Sbjct: 515  NAANETTLSGTNDKRKLKDKVPRKLKRKVSPASKQEKSSCSRKGSHPNKVNLNAHAKTSS 574

Query: 633  QQK-EVLEIPSEVNQHKGNLDGSRSHCPIEPPGGGFDQSCRVDLSLSESIQDSQVCLDDT 457
             QK E+L++ + +N  +   + SRS   +     GF    RV+   SES+ + QV     
Sbjct: 575  MQKDEMLDVLTALNDQRVIKNVSRSCAQL-----GF---ARVETMKSESLNNLQVSPGSM 626

Query: 456  KPIDSV----SKIFNMEGANKNSPSSGTENTLDQKQFLGVHSDGNVYLP---INSVPARL 298
            +P +SV    S + N    N++S    +   LDQ     V +   VYLP   +N V AR 
Sbjct: 627  EPCESVCDAASGLNNQCIENQDSLLKKSCVPLDQPNLHEVRAP--VYLPHLMVNGV-ART 683

Query: 297  QAEISHTENGKQDHHSGPVLQKWIPVVRKDAETTAANGSGNLLTSHLD--ESAGNESKVM 124
            + E S  E GKQ H SG VLQKWIPV  KD   T +  S +L T H +  E+     K  
Sbjct: 684  EKEFSLAEYGKQSHSSGSVLQKWIPVGIKDPGFTTSVRSASLSTEHSNGPEAEDWTFKNK 743

Query: 123  DEEELSLSAQSFVPLVDVSVESMASSGDISCPALN 19
             EE+++  AQ+    VD    +M S G  S  A++
Sbjct: 744  FEEKVAPCAQNLSSSVDAG--TMCSIGKDSGHAIS 776


>ref|XP_007029358.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508717963|gb|EOY09860.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1222

 Score =  124 bits (311), Expect = 1e-25
 Identities = 136/455 (29%), Positives = 199/455 (43%), Gaps = 27/455 (5%)
 Frame = -1

Query: 1302 ISSKVISDVSNATILDLISDGWTSDASASAGDYTEEKSTIKEXXXXXXXXXXXXXXXXXX 1123
            I  +  SD+ ++ +LD +S G +S+ S SA    +      E                  
Sbjct: 350  IHQEDFSDLHDSLVLDSVSVGSSSEESMSASHIVKPFDNSHENSQSEAPGSNTK------ 403

Query: 1122 XGKGLLFQENLSNVDVNADDQTERTKCDSQGCSSSDFHLVLYGKRGRQGRKLCGSSTGMD 943
              KG  + +N         D T+  K      SS D  ++  GKRG+Q + + GSS+   
Sbjct: 404  --KGSFYHQNSLCSISETHDYTQGPK-HGLDFSSCDVQMIASGKRGKQFKSVPGSSSTC- 459

Query: 942  RFNSGANVHGRCGKENNNSVWQKVQRNDVEGC--------------DFQSNNAPHV---S 814
            +  S  N+HG  G EN++SVWQ+VQR+ VE C              D  + +AP +   S
Sbjct: 460  KLGSIGNLHGGMGTENSHSVWQRVQRHGVEKCNTELKKASPICSGSDVTAKDAPLLKRSS 519

Query: 813  SRIXXXXXXXXXXXXXXXXXXAEKMKRKPDAGQKQEHSCYSRKRPPACKTNSSGATRINI 634
            +                      K+KRK     KQE S  SRK     K N +   + + 
Sbjct: 520  NAANETTLSGTNDKRKLKDKVPRKLKRKVSPASKQEKSSCSRKGSHPNKVNLNAHAKTSS 579

Query: 633  QQK-EVLEIPSEVNQHKGNLDGSRSHCPIEPPGGGFDQSCRVDLSLSESIQDSQVCLDDT 457
             QK E+L++ + +N  +   + SRS   +     GF    RV+   SES+ + QV     
Sbjct: 580  MQKDEMLDVLTALNDQRVIKNVSRSCAQL-----GF---ARVETMKSESLNNLQVSPGSM 631

Query: 456  KPIDSV----SKIFNMEGANKNSPSSGTENTLDQKQFLGVHSDGNVYLP---INSVPARL 298
            +P +SV    S + N    N++S    +   LDQ     V +   VYLP   +N V AR 
Sbjct: 632  EPCESVCDAASGLNNQCIENQDSLLKKSCVPLDQPNLHEVRAP--VYLPHLMVNGV-ART 688

Query: 297  QAEISHTENGKQDHHSGPVLQKWIPVVRKDAETTAANGSGNLLTSHLD--ESAGNESKVM 124
            + E S  E GKQ H SG VLQKWIPV  KD   T +  S +L T H +  E+     K  
Sbjct: 689  EKEFSLAEYGKQSHSSGSVLQKWIPVGIKDPGFTTSVRSASLSTEHSNGPEAEDWTFKNK 748

Query: 123  DEEELSLSAQSFVPLVDVSVESMASSGDISCPALN 19
             EE+++  AQ+    VD    +M S G  S  A++
Sbjct: 749  FEEKVAPCAQNLSSSVDAG--TMCSIGKDSGHAIS 781


>ref|XP_007152541.1| hypothetical protein PHAVU_004G138800g [Phaseolus vulgaris]
            gi|561025850|gb|ESW24535.1| hypothetical protein
            PHAVU_004G138800g [Phaseolus vulgaris]
          Length = 1187

 Score = 91.7 bits (226), Expect = 7e-16
 Identities = 107/424 (25%), Positives = 173/424 (40%), Gaps = 26/424 (6%)
 Frame = -1

Query: 1284 SDVSNATILDLISDGWTSDASASAGDYTEEKSTIKEXXXXXXXXXXXXXXXXXXXGKGLL 1105
            +D+ +  ++D +S G  SD S +A D  ++ +                         G  
Sbjct: 341  NDIQDTLVIDSVSVGSRSDGSINADDIGKQSNKAN-------------CTTISDSQDGYF 387

Query: 1104 FQENLSNVDVNADDQTERTKCDSQGCSSSDFHLVLYGKRGRQGRKLCGSSTGMDRFNSGA 925
              +NL+N   N  +  E      Q C S+D       KR +Q R +  SS G+++F    
Sbjct: 388  LCQNLTNDIHNNCEHMEGVMHSGQNCISND-------KRVKQKRTMSNSS-GLNKFGGVG 439

Query: 924  NVHGRCGKENNNSVWQKVQRNDVEGC--DFQSNNA------------PHV---SSRIXXX 796
             +H R GKEN++SVWQKVQ+N  +GC  D +  N             P V    + +   
Sbjct: 440  ILHSRKGKENSHSVWQKVQKNSSDGCGSDLKKVNTTLSQLASIVEKDPSVIKECNSVGVH 499

Query: 795  XXXXXXXXXXXXXXXAEKMKRKPDAGQKQEHSCYSRKRPPACKTNSSGATRINIQQKEVL 616
                            +K K K D   K+  S YSRK     ++ S+   ++ +QQ ++L
Sbjct: 500  GVSKTEDKKQMKNKIGKKSKGKMDLVSKKGQSNYSRKNLHFNRSLSNDHGKVGVQQNDML 559

Query: 615  EIPSEVNQHKGNLDGSRSHCPIEPPGGGFDQSCRVDLSLSESIQDSQVCLDDTKPIDS-- 442
             I S+     G ++ S  +  +     G  Q+  V+   SE I  ++  L+++ P +S  
Sbjct: 560  HISSQEFDQHGLINDSGLNSDVHCLRDGV-QTVGVEQVTSEQIHSAEFHLEESNPQNSAC 618

Query: 441  --VSKIFNMEGANKNSPSSGTENTLDQKQFLGVHSDGNVYLPINSVPARLQAEISHTENG 268
              V K       +++S        ++Q       S  +  L  + V  + + E+S  +  
Sbjct: 619  HTVVKTKKESIDSQDSSLVMPSENVNQSNMSVELSPASCDLEGDEV-GQTEKEVSSADCN 677

Query: 267  KQDHHSGPVLQKWIPVVRKDAETTAANGSGNLL-TSHLDESAGN----ESKVMDEEELSL 103
             Q+  SG  L KWIPV +KD  T       N+L   + D S+ N    ES V  E   S 
Sbjct: 678  AQNQCSGTTLWKWIPVGKKD--TGLEKSESNILPPDYFDASSSNNFNYESSVEPEVVSSE 735

Query: 102  SAQS 91
            S  S
Sbjct: 736  SKDS 739


>ref|XP_006847866.1| hypothetical protein AMTR_s00029p00086500 [Amborella trichopoda]
            gi|548851171|gb|ERN09447.1| hypothetical protein
            AMTR_s00029p00086500 [Amborella trichopoda]
          Length = 1276

 Score = 85.5 bits (210), Expect = 5e-14
 Identities = 100/400 (25%), Positives = 152/400 (38%), Gaps = 75/400 (18%)
 Frame = -1

Query: 1056 ERTKCDSQGCSSSDFHLVLYGKRGRQGRKLCGSSTG-MDRFNSGANVHGRCGKENNNSVW 880
            ER K  +QGCSSS  H      + RQGRK  GSS G + R++ G  +HGR G++NN+SVW
Sbjct: 420  ERLKYSNQGCSSSKTHAFGLSGKARQGRKSNGSSLGSIPRYHHGVTIHGRMGRDNNHSVW 479

Query: 879  QKVQRNDVEGCDFQSNNA----PHVSSRIXXXXXXXXXXXXXXXXXXAEKMKRKPDAGQK 712
            QKVQ++  E C  ++ N     P   +                      + + KP     
Sbjct: 480  QKVQKSGNE-CVLEAKNPNRLWPQPDAASVPVRDDVFMSQYGKKGQRRNEQEVKPRTASI 538

Query: 711  QEHSCYSRKRPPACKTNSSGATRINIQQKEVLE-IPSEVNQHKGNLDGSRSHC------- 556
              H       P    +       ++  + EV+E   SE ++ K NL   + H        
Sbjct: 539  SSH----LDAPQGVPSAVDRTLPLSTGEDEVIESTMSERSKGKTNLGSKQEHTNHSRIGN 594

Query: 555  ----------------PIEPP------------GGGFDQSC-------------RVDLSL 499
                              E P            GGG   +C             ++D   
Sbjct: 595  GGSKSKLIRLSRTNGFQRESPEIAWHANYYRSFGGGSKSTCYAQSERVEAAVSDKMDRVN 654

Query: 498  SESIQDSQVCLDDTKPIDSV------------SKIFNMEGA--NKNSPSSGTENTLDQKQ 361
            S+SI  SQ   D+  P+ +V            SK+ N   +  N +   S  E   D+ +
Sbjct: 655  SDSILGSQANNDEIIPVGNVGAGDANMKIQAASKLVNSSSSTLNLSYQVSAIEGPGDKWR 714

Query: 360  FLGVHSDGNVYLPI---NSVPARLQAEISHTENGKQDHHSGPVLQKWIPVVRKDA----E 202
                 S G  +  +          + E S  E+ KQD  S    +KWIPV RKDA     
Sbjct: 715  ISHGDSPGTDHPSLTHQEKETLHSETETSSVEHAKQDISSSYTSKKWIPVGRKDAGAFKT 774

Query: 201  TTAANGSGNLLTSHLDESAGNESKVMDEEELSLSAQSFVP 82
             T    +GN+L +  D+S     +V + ++     ++F+P
Sbjct: 775  NTITESNGNVLNNDFDKSLSRNGEVNNTQK----EEAFLP 810


>ref|XP_002516352.1| hypothetical protein RCOM_1402790 [Ricinus communis]
            gi|223544518|gb|EEF46036.1| hypothetical protein
            RCOM_1402790 [Ricinus communis]
          Length = 951

 Score = 71.2 bits (173), Expect = 1e-09
 Identities = 79/310 (25%), Positives = 128/310 (41%), Gaps = 18/310 (5%)
 Frame = -1

Query: 1095 NLSNVDVNADDQTERTKCDSQGCSSSDFHLVLYGKRGRQGRKLCGSSTGMDRFNSGANVH 916
            NL +  ++  D+ + TK   Q    S+   ++ GK   Q + L  SST     NS     
Sbjct: 269  NLLDGIIDLFDKAKGTKHHIQSFGGSNVQFLVPGKGDEQIKTLPRSSTVYKFGNS----- 323

Query: 915  GRCGKENNNSVWQKVQRNDVEGCDFQSNNAPHVS----------------SRIXXXXXXX 784
             R GKEN +SVWQKVQR+D + C+ +    P  S                +         
Sbjct: 324  -RIGKENIHSVWQKVQRDDRDDCNCELKKVPTCSQVNVALEGAPLLKNNCNVALVNTLSG 382

Query: 783  XXXXXXXXXXXAEKMKRKPDAGQKQEHSCYSRKRPPACKTNSSGATRINIQQKEVLEIPS 604
                        +K++++   G KQ ++C + +   + K   +G    NI+Q E+L   +
Sbjct: 383  PEDKRQPKTKVLKKLQKEGGLGSKQGYNCNNGRGCNSIKARLNGHAMANIKQNEILGTSA 442

Query: 603  EVNQHKGNLDGSRSHCPIEPPGGGFDQSCRVDLSLSESIQDSQVCLDDTKPIDSVSKIFN 424
            EVN  +      + H        GF  + +V+   S S   +QV  D+ + ++S S    
Sbjct: 443  EVNNEERVKCLPKHHNQSSGSQDGFYNN-KVERVNSGSANMAQVFSDELELLESTS---- 497

Query: 423  MEGANKNSPSSGTENTLDQKQFLGVHSDGNVYLP--INSVPARLQAEISHTENGKQDHHS 250
                  NS S    +   + Q         VYLP  +    +++  EIS  E  +++H S
Sbjct: 498  ------NSVSGDINHHTSEVQ-------PPVYLPHLVGIKVSQINKEIS-LEYSRKNHSS 543

Query: 249  GPVLQKWIPV 220
               LQKWIP+
Sbjct: 544  VSTLQKWIPI 553


>ref|XP_004137638.1| PREDICTED: uncharacterized protein LOC101212209 [Cucumis sativus]
          Length = 1174

 Score = 68.6 bits (166), Expect = 7e-09
 Identities = 86/378 (22%), Positives = 147/378 (38%), Gaps = 50/378 (13%)
 Frame = -1

Query: 1098 ENLSNVDVNADDQTERTKCDSQGCSSSDFHLVLYGKRGRQGRKLCGSSTGMDRFNSGANV 919
            +N +  +V+ + + E+     +GC+ S+   VL GK+ +Q +KL GSS  M+R+    + 
Sbjct: 367  QNSARDEVDLNAEVEKANLGIRGCTVSETCSVLPGKKTKQNKKLTGSSR-MNRYGGLGSS 425

Query: 918  HGRCGKENNNSVWQKVQRNDVEGCDFQSNNA------------PHVSSRIXXXXXXXXXX 775
              R GKEN ++VWQKVQR+   GC  Q +              P V  ++          
Sbjct: 426  QRRTGKENRHTVWQKVQRSSSGGCSEQLDQVSPISKQFKGICNPVVGVQMPKVKDKKTGN 485

Query: 774  XXXXXXXXAEKMKRKPDAGQKQEHSCYSRKRPPACKTNSSGATRINIQQKEVLEIPS--- 604
                      ++KRK  +GQ++ +      RP      S+ ++ ++    E L++ S   
Sbjct: 486  KKQLKEKCPRRLKRKNTSGQEKIY------RPTRNSCGSNTSSMVHKPPNEKLDVRSMGF 539

Query: 603  EVNQHKGNLDGSRSHCPIEPPGGGFDQSCRVDLSLSESIQDSQVCLDD---TKPI----- 448
            ++ +  G+            P   F        + SES++  QV LD+    K I     
Sbjct: 540  DIRRSSGD------------PRSCFQNDSTDKCTNSESVESKQVHLDELISNKLINDGLS 587

Query: 447  ------DSVSKIFNMEGANKNSP---------------------SSGTENTLDQKQFLGV 349
                  DS S   +   +N+++P                     SS  ++     Q   V
Sbjct: 588  SQKVENDSSSLPKSCNSSNQSNPVEVKSPVYLPHLFFQKVGNDSSSLPKSCNSSNQSNPV 647

Query: 348  HSDGNVYLPINSVPARLQAEISHTENGKQDHHSGPVLQKWIPVVRKDAETTAANGSGNLL 169
                +VYLP     A   + +   E  K D  S   LQ W+P        + A GS ++ 
Sbjct: 648  EVKSSVYLPHLFFQATKGSSLD--ERSKHDTQSRSPLQNWLP--------SGAEGSRSIT 697

Query: 168  TSHLDESAGNESKVMDEE 115
             +  D S+  ++     E
Sbjct: 698  LARPDFSSLRDANTQPAE 715


Top