BLASTX nr result

ID: Atropa21_contig00027387 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atropa21_contig00027387
         (1281 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006350032.1| PREDICTED: uncharacterized protein LOC102601...   532   e-148
ref|XP_006350037.1| PREDICTED: uncharacterized protein LOC102601...   526   e-146
ref|XP_004251799.1| PREDICTED: uncharacterized protein LOC101256...   519   e-144
ref|XP_004243407.1| PREDICTED: uncharacterized protein LOC101260...   381   e-103
ref|XP_006348849.1| PREDICTED: uncharacterized protein LOC102594...   252   2e-64
ref|XP_006348850.1| PREDICTED: uncharacterized protein LOC102594...   243   1e-61
ref|XP_002327318.1| predicted protein [Populus trichocarpa] gi|5...   115   5e-23
ref|XP_006376346.1| hypothetical protein POPTR_0013s12230g [Popu...   110   2e-21
gb|EXB78097.1| hypothetical protein L484_004798 [Morus notabilis]     107   8e-21
ref|XP_006446765.1| hypothetical protein CICLE_v10015391mg [Citr...   104   9e-20
ref|XP_004303170.1| PREDICTED: uncharacterized protein LOC101303...   104   9e-20
ref|XP_006469032.1| PREDICTED: uncharacterized protein LOC102611...   101   8e-19
gb|EOX92817.1| Uncharacterized protein isoform 1 [Theobroma caca...    98   6e-18
ref|XP_004487168.1| PREDICTED: protein gar2-like [Cicer arietinum]     89   5e-15
ref|XP_004149372.1| PREDICTED: uncharacterized protein LOC101205...    87   1e-14
gb|EOX92819.1| Uncharacterized protein isoform 3, partial [Theob...    85   7e-14
ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus c...    82   4e-13
ref|XP_002325580.2| hypothetical protein POPTR_0019s11960g [Popu...    81   8e-13
gb|EOX92821.1| Uncharacterized protein isoform 5 [Theobroma caca...    79   4e-12
gb|EOX92820.1| Uncharacterized protein isoform 4, partial [Theob...    79   4e-12

>ref|XP_006350032.1| PREDICTED: uncharacterized protein LOC102601397 isoform X1 [Solanum
            tuberosum] gi|565366720|ref|XP_006350033.1| PREDICTED:
            uncharacterized protein LOC102601397 isoform X2 [Solanum
            tuberosum] gi|565366722|ref|XP_006350034.1| PREDICTED:
            uncharacterized protein LOC102601397 isoform X3 [Solanum
            tuberosum] gi|565366724|ref|XP_006350035.1| PREDICTED:
            uncharacterized protein LOC102601397 isoform X4 [Solanum
            tuberosum] gi|565366726|ref|XP_006350036.1| PREDICTED:
            uncharacterized protein LOC102601397 isoform X5 [Solanum
            tuberosum]
          Length = 421

 Score =  532 bits (1370), Expect = e-148
 Identities = 292/398 (73%), Positives = 320/398 (80%), Gaps = 30/398 (7%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVAAADLSLNPYAHTEIN 1100
            +AMYQDTARYVENQVQTVG SV RFYSDV+LDLHPQFNIDPVKVAAADLSLNPYAHTEI+
Sbjct: 25   DAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQFNIDPVKVAAADLSLNPYAHTEIS 84

Query: 1099 KKLKANLKGHH-RGINKELFDDTHVIKGKSKSGGVYRRQNIGIKEIVRDSYPSSKKSDAL 923
            KKLKA LKG H   INKEL DDT VIKGKSKSGGVYRRQ++GIKEIVRD++P SKKSDAL
Sbjct: 85   KKLKAKLKGGHPMVINKELIDDTQVIKGKSKSGGVYRRQSVGIKEIVRDNHPPSKKSDAL 144

Query: 922  CLVSGNGIKLSSDSKVRGGFEVASDHMT--SPFASVKEHDFAEAGKEVSNHIINTNVPPA 749
            CLVSGN IKLSSDSKVRGGFEVASDHMT  SP ASVK    AE GKEVSNHII T+V  A
Sbjct: 145  CLVSGNAIKLSSDSKVRGGFEVASDHMTMTSPLASVKGRSSAETGKEVSNHIIKTDVSAA 204

Query: 748  -------ASDTSLSVEYVGQHQADLRNTSSVGDMQSESRADRGTYEELAGETGSNISSNT 590
                   ASD SLSV+ VGQ+QADLRNTSSVGD+QS+S ADRGT +ELAG+TG  ISSNT
Sbjct: 205  GISINVAASDRSLSVDCVGQNQADLRNTSSVGDLQSDSHADRGTCKELAGDTGLKISSNT 264

Query: 589  HNDNIASEGIN--------------------ESCKEISDRFPSATPEKYDLIESDMEIVE 470
             ++NIASE IN                    ESCKE SD+  S  PEKYDLIESD+EIVE
Sbjct: 265  GDNNIASEEINNIAKISSNTGDNNITGEEINESCKERSDKSCSPPPEKYDLIESDVEIVE 324

Query: 469  HYDQSKLEETCVLVEADRLHVPQGSVKRKSYKKKLQQVFSMKKKTTRKEYEQLGALHGDQ 290
            HYD+SKLEETCVLVEA++LHVPQ SVK+KSYKKKL+QVFSMKKK+TRKEYEQLGALHGDQ
Sbjct: 325  HYDESKLEETCVLVEAEKLHVPQESVKQKSYKKKLRQVFSMKKKSTRKEYEQLGALHGDQ 384

Query: 289  QPNIEAEDDKVIQVLAANLNTKKLXXXXXXXXXEWELL 176
            QPN+E E +K +QVL+ N N KKL         EWELL
Sbjct: 385  QPNLEPE-EKPMQVLSKNSNMKKLSSADDHSESEWELL 421


>ref|XP_006350037.1| PREDICTED: uncharacterized protein LOC102601397 isoform X6 [Solanum
            tuberosum]
          Length = 420

 Score =  526 bits (1354), Expect = e-146
 Identities = 291/398 (73%), Positives = 319/398 (80%), Gaps = 30/398 (7%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVAAADLSLNPYAHTEIN 1100
            +AMYQDTARYVENQVQTVG SV RFYSDV+LDLHPQFNIDPVKVAAADLSLNPYAHTEI+
Sbjct: 25   DAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQFNIDPVKVAAADLSLNPYAHTEIS 84

Query: 1099 KKLKANLKGHH-RGINKELFDDTHVIKGKSKSGGVYRRQNIGIKEIVRDSYPSSKKSDAL 923
            KKLKA LKG H   INKEL DDT VIKGKSKSGGVYRRQ++GIKEIVRD++P SKKSDAL
Sbjct: 85   KKLKAKLKGGHPMVINKELIDDTQVIKGKSKSGGVYRRQSVGIKEIVRDNHPPSKKSDAL 144

Query: 922  CLVSGNGIKLSSDSKVRGGFEVASDHMT--SPFASVKEHDFAEAGKEVSNHIINTNVPPA 749
            CLVSGN IKLSSDSKVRGGFEVASDHMT  SP ASVK    AE GKEVSNHII T+V  A
Sbjct: 145  CLVSGNAIKLSSDSKVRGGFEVASDHMTMTSPLASVKGRSSAETGKEVSNHIIKTDVSAA 204

Query: 748  -------ASDTSLSVEYVGQHQADLRNTSSVGDMQSESRADRGTYEELAGETGSNISSNT 590
                   ASD SLSV+ VGQ+QADLRNTSSVGD+QS+S  DRGT +ELAG+TG  ISSNT
Sbjct: 205  GISINVAASDRSLSVDCVGQNQADLRNTSSVGDLQSDSH-DRGTCKELAGDTGLKISSNT 263

Query: 589  HNDNIASEGIN--------------------ESCKEISDRFPSATPEKYDLIESDMEIVE 470
             ++NIASE IN                    ESCKE SD+  S  PEKYDLIESD+EIVE
Sbjct: 264  GDNNIASEEINNIAKISSNTGDNNITGEEINESCKERSDKSCSPPPEKYDLIESDVEIVE 323

Query: 469  HYDQSKLEETCVLVEADRLHVPQGSVKRKSYKKKLQQVFSMKKKTTRKEYEQLGALHGDQ 290
            HYD+SKLEETCVLVEA++LHVPQ SVK+KSYKKKL+QVFSMKKK+TRKEYEQLGALHGDQ
Sbjct: 324  HYDESKLEETCVLVEAEKLHVPQESVKQKSYKKKLRQVFSMKKKSTRKEYEQLGALHGDQ 383

Query: 289  QPNIEAEDDKVIQVLAANLNTKKLXXXXXXXXXEWELL 176
            QPN+E E +K +QVL+ N N KKL         EWELL
Sbjct: 384  QPNLEPE-EKPMQVLSKNSNMKKLSSADDHSESEWELL 420


>ref|XP_004251799.1| PREDICTED: uncharacterized protein LOC101256948 [Solanum
            lycopersicum]
          Length = 421

 Score =  519 bits (1336), Expect = e-144
 Identities = 284/398 (71%), Positives = 316/398 (79%), Gaps = 30/398 (7%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVAAADLSLNPYAHTEIN 1100
            +AMYQDTARYVENQVQTVG SV RFYSDV+LDLHPQFNIDPVKVAAADLSLNPYAHTEI+
Sbjct: 25   DAMYQDTARYVENQVQTVGASVKRFYSDVVLDLHPQFNIDPVKVAAADLSLNPYAHTEIS 84

Query: 1099 KKLKANLKGHH-RGINKELFDDTHVIKGKSKSGGVYRRQNIGIKEIVRDSYPSSKKSDAL 923
            KKLKA LKG H R INKEL DDT VIKGKSKSGGVYRRQ++G+KEIVRD++P SKKSDAL
Sbjct: 85   KKLKAQLKGGHPRVINKELIDDTQVIKGKSKSGGVYRRQSVGMKEIVRDNHPPSKKSDAL 144

Query: 922  CLVSGNGIKLSSDSKVRGGFEVASDHMT--SPFASVKEHDFAEAGKEVSNHIINTNVPPA 749
            CLVSGN IKLSSDSKVRGGFEVASDHMT  SP ASVK     E GKEVSNHII T VP A
Sbjct: 145  CLVSGNTIKLSSDSKVRGGFEVASDHMTMTSPLASVKGLKSTETGKEVSNHIIKTEVPAA 204

Query: 748  -------ASDTSLSVEYVGQHQADLRNTSSVGDMQSESRADRGTYEELAGETGSNISSNT 590
                   ASDTSLSV+ VGQ+QADLRNT SVGD+QS+S  DRGT +ELAG+TG  ISSNT
Sbjct: 205  GISINIAASDTSLSVDCVGQNQADLRNTFSVGDLQSDSHVDRGTRKELAGDTGLKISSNT 264

Query: 589  HNDNIASEGIN--------------------ESCKEISDRFPSATPEKYDLIESDMEIVE 470
             ++NIAS+ +N                    ESCK  SD+  S  P+KYDLIESD+EIVE
Sbjct: 265  GDNNIASKEVNNIAKISSNTDDNNIAGEEIKESCKARSDKSCSPPPDKYDLIESDVEIVE 324

Query: 469  HYDQSKLEETCVLVEADRLHVPQGSVKRKSYKKKLQQVFSMKKKTTRKEYEQLGALHGDQ 290
             YD+ KLEETCVLVEA++LHVPQGSVKRKSYKKKL+QVFSMKKK+TR EYEQLGAL+GDQ
Sbjct: 325  RYDEPKLEETCVLVEAEKLHVPQGSVKRKSYKKKLRQVFSMKKKSTRTEYEQLGALYGDQ 384

Query: 289  QPNIEAEDDKVIQVLAANLNTKKLXXXXXXXXXEWELL 176
            QPN++ E +K +QVL+ N N KKL         EWELL
Sbjct: 385  QPNLQPE-EKQMQVLSKNSNPKKLSSADDHSESEWELL 421


>ref|XP_004243407.1| PREDICTED: uncharacterized protein LOC101260247 [Solanum
            lycopersicum]
          Length = 374

 Score =  381 bits (978), Expect = e-103
 Identities = 219/370 (59%), Positives = 270/370 (72%), Gaps = 2/370 (0%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVAAADLSLNPYAHTEIN 1100
            EAMYQDT +YVENQ+ TVG +V RF S+VM D+HPQ NIDPVKVAAADLSLNPYAH EI+
Sbjct: 25   EAMYQDTVKYVENQMNTVGTNVKRFCSEVMQDVHPQCNIDPVKVAAADLSLNPYAHYEID 84

Query: 1099 KKLKANLKGHHRGINKELFDDTHVIKGKSKSGGVYRRQNIGIKEIVRDSYPSSKKSDALC 920
            KKLKANLKG  RG + +L DDT VIKGKSKSGGVY+RQN+GIKEIVRDS+  +KK +A+C
Sbjct: 85   KKLKANLKGSARGFSNKLNDDTQVIKGKSKSGGVYKRQNVGIKEIVRDSH-LTKKPNAIC 143

Query: 919  LVSGNGIKLSSDSKVRGGFEVASDH--MTSPFASVKEHDFAEAGKEVSNHIINTNVPPAA 746
            L SG+ +KLSS ++VRGGFE+ASDH  +TS  ASVK  D  E   +VSNH+I TNV  + 
Sbjct: 144  LASGDALKLSSSAEVRGGFELASDHVTLTSALASVKGSDSGEVASKVSNHVIQTNV--ST 201

Query: 745  SDTSLSVEYVGQHQADLRNTSSVGDMQSESRADRGTYEELAGETGSNISSNTHNDNIASE 566
            +DTS++ E      + + +  SVG  Q+++       +ELA  T    SS+  N N+A+E
Sbjct: 202  ADTSITSE-----ASVMMSVESVGKKQTDT-----CTKELACNTRFKTSSDVRN-NLANE 250

Query: 565  GINESCKEISDRFPSATPEKYDLIESDMEIVEHYDQSKLEETCVLVEADRLHVPQGSVKR 386
             I+ES +E SD   S    KYD IESD+EIVE +D+ +L ETCVLVE DR+HVPQG VK+
Sbjct: 251  EIDESHEEKSDNLLS----KYDSIESDLEIVEKFDEFQLNETCVLVEEDRIHVPQGPVKQ 306

Query: 385  KSYKKKLQQVFSMKKKTTRKEYEQLGALHGDQQPNIEAEDDKVIQVLAANLNTKKLXXXX 206
            KSYKKKL+  FS KK+ TRKEYEQLGAL+GDQQ  +E+E DKV+ VLA N NTK L    
Sbjct: 307  KSYKKKLRDAFSTKKRLTRKEYEQLGALYGDQQIKVESE-DKVMPVLAMNSNTKML-SAN 364

Query: 205  XXXXXEWELL 176
                 EWE+L
Sbjct: 365  DHPESEWEIL 374


>ref|XP_006348849.1| PREDICTED: uncharacterized protein LOC102594335 isoform X1 [Solanum
            tuberosum]
          Length = 260

 Score =  252 bits (644), Expect = 2e-64
 Identities = 145/245 (59%), Positives = 179/245 (73%), Gaps = 2/245 (0%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVAAADLSLNPYAHTEIN 1100
            EAMYQDT +YVENQV TVG +V RF S+VM D+HPQ NIDPVKVAAADLS+NPYAH EI+
Sbjct: 25   EAMYQDTVKYVENQVNTVGTNVKRFCSEVMQDVHPQCNIDPVKVAAADLSINPYAHYEID 84

Query: 1099 KKLKANLKGHHRGINKELFDDTHVIKGKSKSGGVYRRQNIGIKEIVRDSYPSSKKSDALC 920
            KKLKANLKG  R  + +L DDT VIKGKSKSGGVY+RQN+GIKEIVRDS+P +KK +A+C
Sbjct: 85   KKLKANLKGSARRFSNKLNDDTQVIKGKSKSGGVYKRQNVGIKEIVRDSHP-AKKPNAIC 143

Query: 919  LVSGNGIKLSSDSKVRGGFEVASDH--MTSPFASVKEHDFAEAGKEVSNHIINTNVPPAA 746
            L SG+ +KLSS ++VRGGFE+ASDH  +TS  ASVK  D  EA  +V +H I TNV  +A
Sbjct: 144  LASGDALKLSSSAEVRGGFEMASDHVTLTSALASVKGSDSGEAASKVRDHFIQTNV--SA 201

Query: 745  SDTSLSVEYVGQHQADLRNTSSVGDMQSESRADRGTYEELAGETGSNISSNTHNDNIASE 566
            +DTS++ E      +   +  SV   Q+++       +ELA  T   ISSN  N N+A+E
Sbjct: 202  ADTSITSE-----ASVTMSVESVRKKQTDT-----CTKELACNTRYKISSNVRN-NLANE 250

Query: 565  GINES 551
             INES
Sbjct: 251  EINES 255


>ref|XP_006348850.1| PREDICTED: uncharacterized protein LOC102594335 isoform X2 [Solanum
            tuberosum] gi|565364274|ref|XP_006348851.1| PREDICTED:
            uncharacterized protein LOC102594335 isoform X3 [Solanum
            tuberosum]
          Length = 251

 Score =  243 bits (620), Expect = 1e-61
 Identities = 140/241 (58%), Positives = 175/241 (72%), Gaps = 2/241 (0%)
 Frame = -3

Query: 1267 QDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVAAADLSLNPYAHTEINKKLK 1088
            +DT +YVENQV TVG +V RF S+VM D+HPQ NIDPVKVAAADLS+NPYAH EI+KKLK
Sbjct: 20   EDTVKYVENQVNTVGTNVKRFCSEVMQDVHPQCNIDPVKVAAADLSINPYAHYEIDKKLK 79

Query: 1087 ANLKGHHRGINKELFDDTHVIKGKSKSGGVYRRQNIGIKEIVRDSYPSSKKSDALCLVSG 908
            ANLKG  R  + +L DDT VIKGKSKSGGVY+RQN+GIKEIVRDS+P +KK +A+CL SG
Sbjct: 80   ANLKGSARRFSNKLNDDTQVIKGKSKSGGVYKRQNVGIKEIVRDSHP-AKKPNAICLASG 138

Query: 907  NGIKLSSDSKVRGGFEVASDH--MTSPFASVKEHDFAEAGKEVSNHIINTNVPPAASDTS 734
            + +KLSS ++VRGGFE+ASDH  +TS  ASVK  D  EA  +V +H I TNV  +A+DTS
Sbjct: 139  DALKLSSSAEVRGGFEMASDHVTLTSALASVKGSDSGEAASKVRDHFIQTNV--SAADTS 196

Query: 733  LSVEYVGQHQADLRNTSSVGDMQSESRADRGTYEELAGETGSNISSNTHNDNIASEGINE 554
            ++ E      +   +  SV   Q+++       +ELA  T   ISSN  N N+A+E INE
Sbjct: 197  ITSE-----ASVTMSVESVRKKQTDT-----CTKELACNTRYKISSNVRN-NLANEEINE 245

Query: 553  S 551
            S
Sbjct: 246  S 246


>ref|XP_002327318.1| predicted protein [Populus trichocarpa]
            gi|566200863|ref|XP_006376347.1| hypothetical protein
            POPTR_0013s12230g [Populus trichocarpa]
            gi|550325623|gb|ERP54144.1| hypothetical protein
            POPTR_0013s12230g [Populus trichocarpa]
          Length = 418

 Score =  115 bits (287), Expect = 5e-23
 Identities = 116/405 (28%), Positives = 178/405 (43%), Gaps = 37/405 (9%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVAAADLSLNPYAHTEIN 1100
            E M ++  +YVENQ+QTV  +V +FYSDVM DL    +  P   A + L ++  A  ++ 
Sbjct: 25   EIMCEEAVKYVENQMQTVSGNVRKFYSDVMQDLCSPDSEVPANGAVSKLPVDLGA-ADVG 83

Query: 1099 KKLKANLKGHHRGINKELFDDTHVIKGKSK---SGGVYR---RQNIGIKEIVRDSYPSSK 938
              LK +  G      K   DD  ++ G SK     G  R   R+ I I+ I R     S 
Sbjct: 84   VHLKPD-DGAKETCEKA--DDLRLLTGYSKMTTDHGPDRLPVRERISIRRISRQHSKGSL 140

Query: 937  KSDALCLVSGN------------GIKLSSDSKVRGGFEVASDH---------------MT 839
             + +   + GN            GI   S SK   G+   S+H               +T
Sbjct: 141  SNKSNLDMHGNSNCKNVSPKETSGITTPS-SKHLIGYSTISEHSDQNLEASCDWNARLIT 199

Query: 838  SPFASVKEH-DFAEAGKEVSN---HIINTNVPPAASDTSLSVEYVGQHQADLRNTSSVGD 671
                 V EH    ++ KE+ N   H+++ +    + D   ++   G+H+   R  SS+  
Sbjct: 200  PGSVEVTEHFSIEKSKKEIENTREHMLDISFYKPSLDMG-NITETGRHEGTDRRPSSINL 258

Query: 670  MQSESRADRGTYEELAGETGSNISSNTHNDNIASEGINESCKEISDRFPSATPEKYDLIE 491
            ++  + A       L   T    + N   +  A E   E     SD +   + +   LIE
Sbjct: 259  LEESNAAGVCLNNGLVSMTDFYANGNMQTNKFAYE---EDFVSNSDEWGIDSDKDGTLIE 315

Query: 490  SDMEIVEHYDQSKLEETCVLVEADRLHVPQGSVKRKSYKKKLQQVFSMKKKTTRKEYEQL 311
             DMEI++  D+++LEETCVL+  D L   +   K K YKKK++ VFS +K++ RKEYEQL
Sbjct: 316  EDMEIIQQVDKAQLEETCVLMNGDELDASREG-KNKPYKKKIRDVFSSRKRSVRKEYEQL 374

Query: 310  GALHGDQQPNIEAEDDKVIQVLAANLNTKKLXXXXXXXXXEWELL 176
             A+     P    E+ K   +   ++   K          EWEL+
Sbjct: 375  -AVQFRSDPKSNQEESKTSLMATPSIKEAKRSSSHDPSESEWELV 418


>ref|XP_006376346.1| hypothetical protein POPTR_0013s12230g [Populus trichocarpa]
            gi|550325622|gb|ERP54143.1| hypothetical protein
            POPTR_0013s12230g [Populus trichocarpa]
          Length = 416

 Score =  110 bits (274), Expect = 2e-21
 Identities = 115/405 (28%), Positives = 177/405 (43%), Gaps = 37/405 (9%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVAAADLSLNPYAHTEIN 1100
            E M ++  +YVENQ+QTV  +V +FYSDVM DL    +  P   A + L ++  A  ++ 
Sbjct: 25   EIMCEEAVKYVENQMQTVSGNVRKFYSDVMQDLCSPDSEVPANGAVSKLPVDLGA-ADVG 83

Query: 1099 KKLKANLKGHHRGINKELFDDTHVIKGKSK---SGGVYR---RQNIGIKEIVRDSYPSSK 938
              LK +  G      K   DD  ++ G SK     G  R   R+ I I+ I R     S 
Sbjct: 84   VHLKPD-DGAKETCEKA--DDLRLLTGYSKMTTDHGPDRLPVRERISIRRISRQHSKGSL 140

Query: 937  KSDALCLVSGN------------GIKLSSDSKVRGGFEVASDH---------------MT 839
             + +   + GN            GI   S SK   G+   S+H               +T
Sbjct: 141  SNKSNLDMHGNSNCKNVSPKETSGITTPS-SKHLIGYSTISEHSDQNLEASCDWNARLIT 199

Query: 838  SPFASVKEH-DFAEAGKEVSN---HIINTNVPPAASDTSLSVEYVGQHQADLRNTSSVGD 671
                 V EH    ++ KE+ N   H+++ +    + D   ++   G+H+   R  SS+  
Sbjct: 200  PGSVEVTEHFSIEKSKKEIENTREHMLDISFYKPSLDMG-NITETGRHEGTDRRPSSINL 258

Query: 670  MQSESRADRGTYEELAGETGSNISSNTHNDNIASEGINESCKEISDRFPSATPEKYDLIE 491
            ++  +         L   T    + N   +  A E   E     SD +   + +   LIE
Sbjct: 259  LEESNGVCLN--NGLVSMTDFYANGNMQTNKFAYE---EDFVSNSDEWGIDSDKDGTLIE 313

Query: 490  SDMEIVEHYDQSKLEETCVLVEADRLHVPQGSVKRKSYKKKLQQVFSMKKKTTRKEYEQL 311
             DMEI++  D+++LEETCVL+  D L   +   K K YKKK++ VFS +K++ RKEYEQL
Sbjct: 314  EDMEIIQQVDKAQLEETCVLMNGDELDASREG-KNKPYKKKIRDVFSSRKRSVRKEYEQL 372

Query: 310  GALHGDQQPNIEAEDDKVIQVLAANLNTKKLXXXXXXXXXEWELL 176
             A+     P    E+ K   +   ++   K          EWEL+
Sbjct: 373  -AVQFRSDPKSNQEESKTSLMATPSIKEAKRSSSHDPSESEWELV 416


>gb|EXB78097.1| hypothetical protein L484_004798 [Morus notabilis]
          Length = 443

 Score =  107 bits (268), Expect = 8e-21
 Identities = 111/388 (28%), Positives = 163/388 (42%), Gaps = 57/388 (14%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVAAADLSLNPYAHTEIN 1100
            E MYQDT +YVENQVQTVG SV RFYSDVM DL P  + D  KV+         +   I+
Sbjct: 25   EIMYQDTVKYVENQVQTVGASVKRFYSDVMQDLLPPSSQDSEKVSLCGFIGKQDSDDGIS 84

Query: 1099 KKLKANLKGHHRGINKELFDDTHVIKGKSKSGGVYRRQNIGIK-----------EIVR-- 959
            KK     K      + E    T  +K  S S  VY   +I ++           E V+  
Sbjct: 85   KKPNVAKKEKPAKADDEQLIRT--LKVTSDSKDVYLAPSIHVRCDVDNMCRPSGECVKGA 142

Query: 958  -DSYPSSKKSDALCLVSGNGIKLS---SDSKV----RGGFEVASDHMTSPFASVKEHDFA 803
              +  S KK   + + S + + ++   SD K+             H++ P +S  E  F 
Sbjct: 143  CSNLRSRKKCRDVSVHSSSNLSVNENRSDKKLIPPETSCAITREKHLSRPLSSYSE--FV 200

Query: 802  EAGKEVSNHIINTNVPPAASDTSLS---------VEYVGQHQADLRNT------------ 686
                E+S     T   P+ ++ + S         +E   +  ADL ++            
Sbjct: 201  NEIHEISLDQTGTTKAPSVNEDTSSDSIVESCDEIENSSECMADLSSSFHASSEIILVKS 260

Query: 685  ----------SSVGDMQSESRAD---RGTYEELAGETGSNISSNTHNDNIASEGINESCK 545
                       S G +  ++  D   + +   LA   GS+ +    ND  A E +  S  
Sbjct: 261  VGYDGNEMDVPSGGGLSEQANGDYTSKCSSNSLASTGGSSQNEEARNDKYADEDVFVSLP 320

Query: 544  EISDRFPSATPEKYDLIESDMEIVEHYDQSKLEETCVLVEADRLHV-PQGSVKRKSYKKK 368
               D +     E     E   E ++  D+ KLEETCVLV  D LH+ PQ   K + YKKK
Sbjct: 321  RKFDDWNLNITESEIATEHGTETIQQRDKVKLEETCVLVNEDELHILPQRGGKWRPYKKK 380

Query: 367  LQQVFSMKKKTTRK-EYEQLGALHGDQQ 287
            ++     + ++ RK EYEQL   +GD +
Sbjct: 381  IRDALYSRMRSARKEEYEQLVLQYGDNK 408


>ref|XP_006446765.1| hypothetical protein CICLE_v10015391mg [Citrus clementina]
            gi|567908905|ref|XP_006446766.1| hypothetical protein
            CICLE_v10015391mg [Citrus clementina]
            gi|557549376|gb|ESR60005.1| hypothetical protein
            CICLE_v10015391mg [Citrus clementina]
            gi|557549377|gb|ESR60006.1| hypothetical protein
            CICLE_v10015391mg [Citrus clementina]
          Length = 416

 Score =  104 bits (259), Expect = 9e-20
 Identities = 105/374 (28%), Positives = 166/374 (44%), Gaps = 43/374 (11%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVA-AADLSLNPYAHTEI 1103
            E MYQDT +YVENQVQTVG +V +FYSDV+ DL P  ++D VK A A++L L   A   I
Sbjct: 25   EIMYQDTVKYVENQVQTVGSTVKKFYSDVIEDLLPPPSVDLVKGAVASNLPLEQNADVGI 84

Query: 1102 NKKLKANLKGHHRGINKELFDD-----THVIKGKSKSGGVYR--------RQNIG--IKE 968
             KK K  +K     +N E   +     T + KG        R        + ++G  +K 
Sbjct: 85   YKKPKIGIKEEAMNVNNEQLSESSLATTDLDKGAGGGQSFCRFHIEDTSFQPSLGDTLKG 144

Query: 967  IVRDSYP------SSKKSDALCLVSGNGIKLSSDSKVRGGFEVASDHMTSPFASVKEH-D 809
            +  D+Y       S     ++C+   +       S++ G    A  HM           +
Sbjct: 145  VFSDAYSKEYDIRSGHNQSSICMQKISKEDNLPPSEMSG----AGPHMERGLRRASSSCE 200

Query: 808  FAEAGKEVSNHIINTNVPPAASDTSLSVEYVGQH-QADLRNTSSVGDMQSESRADRGTYE 632
              +  +EVS+  +  +  P  ++ +    +   + + +  +  + G + S   A      
Sbjct: 201  LLDKIQEVSDDQVVVDPTPVTTEVASCKSFEEIYDELEKASKGASGALTSSPAAKNCDES 260

Query: 631  ELAGETGSNISSNTH----NDNIAS---EGINESCKEISDRFP----------SATPEKY 503
            E A  + S++S+  +    ND + S     +NE  +     FP           AT    
Sbjct: 261  ENAHSSCSSLSAELNGICTNDGVVSLVGSFVNEDVQ--PSEFPDPGRSDYSTVDATESNI 318

Query: 502  DLIESDMEIVEHYDQSKLEETCVLVEADRL-HVPQGSVKRKSYKKKLQQVFSMKKKTTRK 326
            D +E   E V+  D  ++EETCVLV  D L  VP    K + YKKK+Q   S + ++TRK
Sbjct: 319  D-VEQGYETVQRVDNIQVEETCVLVNGDELCFVPCREGKHRPYKKKIQDAISSRMRSTRK 377

Query: 325  -EYEQLGALHGDQQ 287
             EY+QL   + + +
Sbjct: 378  HEYKQLAVWYNEDE 391


>ref|XP_004303170.1| PREDICTED: uncharacterized protein LOC101303722 [Fragaria vesca
            subsp. vesca]
          Length = 389

 Score =  104 bits (259), Expect = 9e-20
 Identities = 99/358 (27%), Positives = 162/358 (45%), Gaps = 24/358 (6%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVAAADLSLNPYAHTEIN 1100
            E MY+DT ++VE+QVQTVGESV +FY+DVM DL    ++D   V+A    +  Y+  + +
Sbjct: 28   ENMYEDTVKFVEDQVQTVGESVKKFYADVMQDLLCDSSLDRDDVSAGGFPVEHYSDVDNS 87

Query: 1099 KKLKANLKGHHRGINKELFDDTHVIKGKSKS---GGVYRRQNIGIKEIVRDSYPSSKKSD 929
            K      K H +   +E+  D+ VI    K     G++ RQ +         Y S  KS 
Sbjct: 88   KSKIRKKKEHVKAGVEEVKGDSEVISAVLKDVDHTGLFHRQRV---------YDSCTKS- 137

Query: 928  ALCLVSGNGIKLSSDSKVRGGFEVASDHMTSPFASVKEH---DFAEAGKEVSNHIINTNV 758
                 SGN  KL+  S+   G    +  +      +K+         GK+ S   +++  
Sbjct: 138  -----SGNCAKLAC-SRQDHGVRSCNKKIVVRETPIKDRLPGANTAVGKDFSRESLSSCS 191

Query: 757  PPAASDTSLSVEYVGQHQADLRNTSSVGDMQSESRADRGTYEELAGETGSNISSNTHNDN 578
              +  D   S +   Q    +  +     M+ +S ++       +  TG ++S N  + +
Sbjct: 192  EFSNEDRDTSCD---QPDEVITPSKPPEGMRCDSMSESCVVANASQCTGDDVSVNCQSSD 248

Query: 577  IA----------SEGINESCKEISDRFPSATPE-KYDLIESDM-----EIVEHYDQSKLE 446
            +           +E ++ S   +S      +     D IES++     EI++  D+ KLE
Sbjct: 249  MIVLDNSDGKRWNELLDSSIGGLSTELNGGSINPSMDAIESNIGTHGTEIIQQSDKPKLE 308

Query: 445  ETCVLVEADRLHVPQGSVKR-KSYKKKLQQVFSMKKKTTRK-EYEQLGALHGDQQPNI 278
            ETCV+V  + LH    +V   K YKKK+ + F+ +  + RK EYEQL   HG    +I
Sbjct: 309  ETCVMVSGEDLHFVHHTVANYKPYKKKIPKAFTSRTSSARKQEYEQLALWHGHHTKSI 366


>ref|XP_006469032.1| PREDICTED: uncharacterized protein LOC102611541 isoform X1 [Citrus
            sinensis] gi|568829444|ref|XP_006469033.1| PREDICTED:
            uncharacterized protein LOC102611541 isoform X2 [Citrus
            sinensis] gi|568829446|ref|XP_006469034.1| PREDICTED:
            uncharacterized protein LOC102611541 isoform X3 [Citrus
            sinensis] gi|568829448|ref|XP_006469035.1| PREDICTED:
            uncharacterized protein LOC102611541 isoform X4 [Citrus
            sinensis]
          Length = 416

 Score =  101 bits (251), Expect = 8e-19
 Identities = 108/375 (28%), Positives = 166/375 (44%), Gaps = 44/375 (11%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVA-AADLSLNPYAHTEI 1103
            E MYQDT +YVENQVQTVG +V +FYSDV+ DL P  ++D VK A A++L L   A   I
Sbjct: 25   EIMYQDTVKYVENQVQTVGSTVKKFYSDVIEDLLPPPSVDLVKGAVASNLPLEQNADVGI 84

Query: 1102 NKKLKANLKGHHRGINKELFDD-----THVIKGKSKSGGVYR--------RQNIG--IKE 968
             KK K  +K     +N E   +     T + KG        R        + ++G  +K 
Sbjct: 85   YKKPKIGIKEEAMKVNNEQLSESSLATTDLDKGAGGGQSFCRFHIEDTSFQPSLGNTLKG 144

Query: 967  IVRDSYP------SSKKSDALCLVSGNGIKLSSDSKVRGGFEVASDHMTSPFASVKEH-D 809
            +  D+YP      S     ++C+   +       S++ G    A  HM           +
Sbjct: 145  VFSDAYPKEYDIRSGHNQSSICMQKISKEDNLPPSEMSG----AGPHMERGLRRASSSCE 200

Query: 808  FAEAGKEVSNHIINTNVPPAASDTSLSVEYVGQHQADLRNTS--SVGDMQSESRADRGTY 635
              +  +EVS+  +  + P + +    S +   +   +L   S  + G + S   A     
Sbjct: 201  LLDKIQEVSDDQVVVD-PTSVTTEVASCKSFEEIYDELEKASKGASGALTSSPAAKNCDE 259

Query: 634  EELAGETGSNISSNTH----NDNIAS---EGINESCKEISDRFP----------SATPEK 506
             E A  + S++S+  +    ND + S     +NE  +     FP           AT   
Sbjct: 260  SESAHSSCSSLSAELNGICTNDGVVSLVGSFVNEDVQ--PSEFPDPGRSDYSTVDATESN 317

Query: 505  YDLIESDMEIVEHYDQSKLEETCVLVEADRL-HVPQGSVKRKSYKKKLQQVFSMKKKTTR 329
             D +E   E V+  D  ++EETCVLV  D L  VP    K +  KKK+Q   S + ++TR
Sbjct: 318  ID-VEQGYETVQRVDNIQVEETCVLVNGDELCFVPCREDKHRPCKKKIQDAISSRMRSTR 376

Query: 328  K-EYEQLGALHGDQQ 287
            K EY+QL   + + +
Sbjct: 377  KHEYKQLAVWYNEDE 391


>gb|EOX92817.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508700922|gb|EOX92818.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 397

 Score = 98.2 bits (243), Expect = 6e-18
 Identities = 102/352 (28%), Positives = 162/352 (46%), Gaps = 23/352 (6%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYS----DVMLDLHPQFNIDPVK-VAAADLSLNPYA 1115
            E MYQDT +YVEN+VQTVG SV +FYS    DVM DL    +++P+K VAA+DL +  YA
Sbjct: 28   EVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQDLLLPSSLEPMKAVAASDLPVEIYA 87

Query: 1114 HTEINKKLKANLKGHH-RGINKELFDDTHVIKGKSKS-----GGVYRRQNIGIKEIVRDS 953
             T   KK    LK    +G +++L +D+ VI   +++               I E    S
Sbjct: 88   ET--LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNENAAHVPSSCQLHMVDNIFESCSGS 145

Query: 952  YPSSKKSDALCLVSGNGIKLSSDSKVRGGFEVASDHMTSPFASVKEHDF---------AE 800
            +     SD L     N   L+     +   E      TS  A   E++F         A 
Sbjct: 146  FVERASSDLLSGEHNNRCTLN-----KTNVEHLLPAETSSEAGCVENEFGRMSSFCGNAN 200

Query: 799  AGKEVSNHIINTNVPPAASDTSLSVEYVGQHQADLRNTS-SVGDMQSESRADRGTYEELA 623
            A  EVS H I   + P + +     + + +   ++++ S SV ++  +     G  E+  
Sbjct: 201  ANHEVSCHQIPATLTPVSVEED-DCDSIEESSNEIKSASDSVPEILPDGLHLVGIVEK-- 257

Query: 622  GETGSNISSNTHNDNIASEGINESCKEISDRFPSATPEKYDLIESDMEIVEHYDQSKLEE 443
                + +     +  I SE  N       D   S+T  +      ++E V+  D+ +++E
Sbjct: 258  ----NEMEMRCSSSIIESEESNGKLNWTKDASGSSTVGR-----KEIETVQQLDKIRVDE 308

Query: 442  TCVLVEADRLHV-PQGSVKRKSYKKKLQQVFSMKKKTTR-KEYEQLGALHGD 293
            +C +V    LH  PQ   K K+Y++K++   S + ++ R KEYEQL   +GD
Sbjct: 309  SCFMVNGAELHFHPQREGKHKTYQRKIRDAISSRMRSARKKEYEQLPLWYGD 360


>ref|XP_004487168.1| PREDICTED: protein gar2-like [Cicer arietinum]
          Length = 365

 Score = 88.6 bits (218), Expect = 5e-15
 Identities = 93/349 (26%), Positives = 151/349 (43%), Gaps = 23/349 (6%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVAAADLSLNPYAHTEIN 1100
            + ++++T +Y+ENQ++ VGESV +FYSDVM ++ PQ +      +A  +S+  Y    + 
Sbjct: 28   DTLFEETFQYIENQMEIVGESVKKFYSDVMQEILPQSSF-----SAPVMSVEQYTGAGLT 82

Query: 1099 KK-LKANLKGHHRGINKELFDDTHVIKGKSKSGGVYRRQNIGIKEIVRDSYPSSKKSDAL 923
            KK  +A+ +   R   ++  +++ V              +IG   +  +S    K+    
Sbjct: 83   KKSFQASREITIRAYTEQSTENSSV------------NHDIGNDAVYAES--CGKQCVES 128

Query: 922  CLVSGNGIKLSSDSKVRGGFEVASDHMTSPFASVKEHDFAEAGKEVSNHIINTNVPPAAS 743
              V  N   LSSD   +    VAS+  T    S  +   +    E++N  +N N     S
Sbjct: 129  AEVKSN---LSSDENQQNMKMVASNTTTEVALSKTDTCISSQSCEIAN--VNQNHEATVS 183

Query: 742  DTSLSVEYVGQHQADLRNTSSVGDMQSESRADRGTYEELAGETGSNISSNTHNDNIASEG 563
             T           A++ N +SV D  +ES          A E   +   N  N +  S  
Sbjct: 184  KTD---------YAEVTNFASVEDCCNESENASTEQNSNAMELVESTEENEINTSYFSSD 234

Query: 562  INESCKEIS------------DRFPSATPEKYDL--------IESDMEIVEHYDQSKLEE 443
              E   E+S                 + PE   L        +E D +I+   D+ + +E
Sbjct: 235  AFEDAHELSTIGAMQLDDCSHSTITVSHPESSSLDIENFDAAMEKDHKIIHQDDELQFDE 294

Query: 442  TCVLVEADRLH-VPQGSVKRKSYKKKLQQVFSMKKKTTRK-EYEQLGAL 302
            TCV++  D    VP+  V  K+ KKK +Q FS+ KK+ RK EYE+L  L
Sbjct: 295  TCVMITKDEYQSVPEAIVNLKTSKKKWRQPFSLSKKSARKQEYEELALL 343


>ref|XP_004149372.1| PREDICTED: uncharacterized protein LOC101205697 [Cucumis sativus]
          Length = 379

 Score = 87.0 bits (214), Expect = 1e-14
 Identities = 88/332 (26%), Positives = 141/332 (42%), Gaps = 8/332 (2%)
 Frame = -3

Query: 1267 QDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVAAADLSLNPYAHTEINKKLK 1088
            QDT +YVENQV+ VG SV RFYSDVM D  P   +   KVA  + +L  Y +  I KK  
Sbjct: 29   QDTVKYVENQVEVVGASVKRFYSDVMQDFLPPSELSDEKVAVCNSALENYENVVICKKPT 88

Query: 1087 ANLKGHHRGINKELFDDTHVIKGKSKSGGVYR----RQNIGIKEIVRDSYPSSKKSDALC 920
              +K      ++E  ++   +   +K     +      +     +V   Y ++ ++    
Sbjct: 89   MGMKIERSKFSEEKSNENSKVTADAKRDIACKLPRGHNHANYLYLVSSPYSAANRAQ--- 145

Query: 919  LVSGNGIKLSSDSKVRGGFEV-ASDHMTSPFASVKEHDFAEAGKEVSNHIINTNVPPAAS 743
             + G   K   D  +    ++   +  T    S+ E       K+  N   ++       
Sbjct: 146  -IDGYSRK-KDDENIHHKIDLDGRESTTRGCKSLTETSPTNLEKKYEND-ASSCCTILNR 202

Query: 742  DTSLSVEYVGQHQADL-RNTSSVGDMQSESRADRGTYEELAGETGSNISSNTHNDNIASE 566
             +  S E  G  +  L ++T     MQS +  +  T   L     S I        + S 
Sbjct: 203  KSEASSELAGNMETMLVKDTRCNSVMQSANETEIKTDNILPDTPSSAIVDTEKETRLLSY 262

Query: 565  GINESCKEISDRFPSATPEKYDLIESDMEIVEHYDQSKL-EETCVLVEADRLHVPQGSVK 389
            G  +S  E+  R  S + +  +L E     ++  D++KL EE CVLV+ D LH       
Sbjct: 263  G--DSSAELDGRSDSWSLDDIEL-EQGTHNIQQADETKLDEEACVLVKGDDLHFDFNEEV 319

Query: 388  RKSYKKKLQQVFSMKKKTTRK-EYEQLGALHG 296
            ++ + KK+   FS  KK+ RK EY++L   HG
Sbjct: 320  KQRHYKKIAGAFSFTKKSKRKQEYKELAMKHG 351


>gb|EOX92819.1| Uncharacterized protein isoform 3, partial [Theobroma cacao]
          Length = 343

 Score = 84.7 bits (208), Expect = 7e-14
 Identities = 93/334 (27%), Positives = 150/334 (44%), Gaps = 22/334 (6%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYS----DVMLDLHPQFNIDPVK-VAAADLSLNPYA 1115
            E MYQDT +YVEN+VQTVG SV +FYS    DVM DL    +++P+K VAA+DL +  YA
Sbjct: 28   EVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQDLLLPSSLEPMKAVAASDLPVEIYA 87

Query: 1114 HTEINKKLKANLKGHH-RGINKELFDDTHVIKGKSKS-----GGVYRRQNIGIKEIVRDS 953
             T   KK    LK    +G +++L +D+ VI   +++               I E    S
Sbjct: 88   ET--LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNENAAHVPSSCQLHMVDNIFESCSGS 145

Query: 952  YPSSKKSDALCLVSGNGIKLSSDSKVRGGFEVASDHMTSPFASVKEHDF---------AE 800
            +     SD L     N   L+     +   E      TS  A   E++F         A 
Sbjct: 146  FVERASSDLLSGEHNNRCTLN-----KTNVEHLLPAETSSEAGCVENEFGRMSSFCGNAN 200

Query: 799  AGKEVSNHIINTNVPPAASDTSLSVEYVGQHQADLRNTS-SVGDMQSESRADRGTYEELA 623
            A  EVS H I   + P + +     + + +   ++++ S SV ++  +     G  E+  
Sbjct: 201  ANHEVSCHQIPATLTPVSVEED-DCDSIEESSNEIKSASDSVPEILPDGLHLVGIVEK-- 257

Query: 622  GETGSNISSNTHNDNIASEGINESCKEISDRFPSATPEKYDLIESDMEIVEHYDQSKLEE 443
                + +     +  I SE  N       D   S+T  +      ++E V+  D+ +++E
Sbjct: 258  ----NEMEMRCSSSIIESEESNGKLNWTKDASGSSTVGR-----KEIETVQQLDKIRVDE 308

Query: 442  TCVLVEADRLHV-PQGSVKRKSYKKKLQQVFSMK 344
            +C +V    LH  PQ   K K+Y++K++   S +
Sbjct: 309  SCFMVNGAELHFHPQREGKHKTYQRKIRDAISSR 342


>ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus communis]
            gi|223535579|gb|EEF37247.1| hypothetical protein
            RCOM_0553590 [Ricinus communis]
          Length = 490

 Score = 82.4 bits (202), Expect = 4e-13
 Identities = 105/442 (23%), Positives = 163/442 (36%), Gaps = 107/442 (24%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVAAADLSLNPYAHTEIN 1100
            E MYQDT +YVENQVQTVG SV RFYSDVM DL P  ++D  K A  D+ L  YA   I 
Sbjct: 25   EVMYQDTVKYVENQVQTVGSSVKRFYSDVMQDLLPPSSVDAAKGAGVDVPLELYADLGIY 84

Query: 1099 KKLKANLKGHHRGI-NKELFDDTHVIKGKSKSGGVYRRQNIGIKEIVRDSYPSSK----- 938
             K K  +K     + ++E   +   I    KS        +G   +V + +P S+     
Sbjct: 85   MKPKVGVKEKQGKVDDRERLTEDPKITTDKKSMDPLTFHRLG---LVENRFPLSQGNSAG 141

Query: 937  -------------KSDALCLVSGNGIKLSSDSKVRGGFEVASDHMTSPFASVKEHDFAEA 797
                         KS+     + N   +S D K+     +    + + F+     +  ++
Sbjct: 142  GASRQHGKRSLSNKSNPYTRKNSNRENMSVDKKLEAISCLDKGLIRASFSERSNENLGDS 201

Query: 796  GKEVSNHIINTNVPPAASDTSLSVEYVGQHQ---------------ADLRNTSSVGDMQS 662
            G        ++ +P    DTSL      + Q                DL   SS+ ++ +
Sbjct: 202  GGGAPKQYGDSCLP---KDTSLGTNGNSERQNIFLHEKARVVIPLYNDLTRASSICELSN 258

Query: 661  ESR-----------------------ADRGTYE------------------ELAGETGSN 605
            E+                         D   YE                  E     G +
Sbjct: 259  ENHKDCVDQQAKITTPGSVEMTGHDSVDESKYEIENASEQIPDIPDMVNSTESGASKGMD 318

Query: 604  ISSNTHNDNIASEGINESCKEISDRFPSAT----PEKYDLIESDMEIVEH---------- 467
            ++ ++H    A     + C      FP+ +      K    +SD + V +          
Sbjct: 319  MTCSSHGSLSAEAHAADDCMSHGADFPADSFVNGNGKGQSSDSDEDFVSNSGSDDCNTDV 378

Query: 466  --YDQSKLEETCVLVEADRLHVPQGSV---------------KRKSYKKKLQQVFSMKKK 338
               D S   E  ++ + D+  + +  +               K KSYKKK++ VFS +K+
Sbjct: 379  YKIDFSISHEMEIIQQVDKAKLEESCILVNRDECHYLPQSERKSKSYKKKIRDVFSPRKR 438

Query: 337  TTRKEYEQLGALHG-DQQPNIE 275
            + RK +EQL    G D  PN E
Sbjct: 439  SMRK-HEQLSICPGSDSNPNQE 459


>ref|XP_002325580.2| hypothetical protein POPTR_0019s11960g [Populus trichocarpa]
            gi|550317324|gb|EEE99961.2| hypothetical protein
            POPTR_0019s11960g [Populus trichocarpa]
          Length = 442

 Score = 81.3 bits (199), Expect = 8e-13
 Identities = 97/362 (26%), Positives = 152/362 (41%), Gaps = 29/362 (8%)
 Frame = -3

Query: 1267 QDTARYVENQVQTVGESVTRFYSDVMLDLHPQFNIDPVKVAAADLSLNPYAHTEINKKLK 1088
            Q+  +YVENQ+QTV  +V +FYSDVM DL    + DP   A +   ++  A   I  K +
Sbjct: 61   QEAVKYVENQMQTVSNNVRKFYSDVMQDLCSPDSEDPANGAVSKFPVDSGADVGIYMKPE 120

Query: 1087 ANLKGH-HRGINKELFDDTHVIKGKSKSGGVYRRQNIGIKEIVRD---------SYPSSK 938
              ++    +  + E   +   +   S S  +  R+ I ++ I R          S   + 
Sbjct: 121  DGMEEKCGKADDPEQLAEDPKMTADSGSDCLPLRRRITVRRISRQHSKGSLSNKSNLDTD 180

Query: 937  KSDALCLVSGNGI--------KLSSDSKVRGGFEVASDHMTSPFAS-----VKEHDFAEA 797
            K+     VS N I        K SS+ ++      AS   T+  A+     V +H   E 
Sbjct: 181  KNSNCNNVSPNEISGTTTLSSKFSSNVELSDQNLEASCDQTARLATPGCVEVTDHFSMEE 240

Query: 796  G----KEVSNHI--INTNVPPAASDTSLSVEYVGQHQADLRNTSSVGDMQSESRADRGTY 635
                 K  S H+  I+ N P   S   +++   G+H+      SS   ++  +       
Sbjct: 241  SKNEIKNASKHVPEISFNKP---SLDMVNITETGRHEGTDSRPSSRNLLEESNGV--CIS 295

Query: 634  EELAGETGSNISSNTHNDNIASEGINESCKEISDRFPSATPEKYDLIESDMEIVEHYDQS 455
             E      S  + N   +  A E   E     SD +   + E   +I+  MEI+   D++
Sbjct: 296  NEFVSMIESAANGNMQTNKFAYE---EDFVSNSDEWGIESDEDGTIIDEGMEII-RADKA 351

Query: 454  KLEETCVLVEADRLHVPQGSVKRKSYKKKLQQVFSMKKKTTRKEYEQLGALHGDQQPNIE 275
            +LEE CVLV  D  H      K + Y KK++ VF  +K++  KEYEQL A       + E
Sbjct: 352  RLEEVCVLVNVDEFHHVPREGKNRPY-KKIRDVFRSRKRSVMKEYEQLAAQCSSDSKSKE 410

Query: 274  AE 269
             E
Sbjct: 411  EE 412


>gb|EOX92821.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508700926|gb|EOX92822.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 334

 Score = 79.0 bits (193), Expect = 4e-12
 Identities = 91/324 (28%), Positives = 144/324 (44%), Gaps = 22/324 (6%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYS----DVMLDLHPQFNIDPVK-VAAADLSLNPYA 1115
            E MYQDT +YVEN+VQTVG SV +FYS    DVM DL    +++P+K VAA+DL +  YA
Sbjct: 28   EVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQDLLLPSSLEPMKAVAASDLPVEIYA 87

Query: 1114 HTEINKKLKANLKGHH-RGINKELFDDTHVIKGKSKS-----GGVYRRQNIGIKEIVRDS 953
             T   KK    LK    +G +++L +D+ VI   +++               I E    S
Sbjct: 88   ET--LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNENAAHVPSSCQLHMVDNIFESCSGS 145

Query: 952  YPSSKKSDALCLVSGNGIKLSSDSKVRGGFEVASDHMTSPFASVKEHDF---------AE 800
            +     SD L     N   L+     +   E      TS  A   E++F         A 
Sbjct: 146  FVERASSDLLSGEHNNRCTLN-----KTNVEHLLPAETSSEAGCVENEFGRMSSFCGNAN 200

Query: 799  AGKEVSNHIINTNVPPAASDTSLSVEYVGQHQADLRNTS-SVGDMQSESRADRGTYEELA 623
            A  EVS H I   + P + +     + + +   ++++ S SV ++  +     G  E+  
Sbjct: 201  ANHEVSCHQIPATLTPVSVEED-DCDSIEESSNEIKSASDSVPEILPDGLHLVGIVEK-- 257

Query: 622  GETGSNISSNTHNDNIASEGINESCKEISDRFPSATPEKYDLIESDMEIVEHYDQSKLEE 443
                + +     +  I SE  N       D   S+T  +      ++E V+  D+ +++E
Sbjct: 258  ----NEMEMRCSSSIIESEESNGKLNWTKDASGSSTVGR-----KEIETVQQLDKIRVDE 308

Query: 442  TCVLVEADRLHV-PQGSVKRKSYK 374
            +C +V    LH  PQ   K K+Y+
Sbjct: 309  SCFMVNGAELHFHPQREGKHKTYQ 332


>gb|EOX92820.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
          Length = 341

 Score = 79.0 bits (193), Expect = 4e-12
 Identities = 91/324 (28%), Positives = 144/324 (44%), Gaps = 22/324 (6%)
 Frame = -3

Query: 1279 EAMYQDTARYVENQVQTVGESVTRFYS----DVMLDLHPQFNIDPVK-VAAADLSLNPYA 1115
            E MYQDT +YVEN+VQTVG SV +FYS    DVM DL    +++P+K VAA+DL +  YA
Sbjct: 28   EVMYQDTVKYVENRVQTVGASVKKFYSGMMQDVMQDLLLPSSLEPMKAVAASDLPVEIYA 87

Query: 1114 HTEINKKLKANLKGHH-RGINKELFDDTHVIKGKSKS-----GGVYRRQNIGIKEIVRDS 953
             T   KK    LK    +G +++L +D+ VI   +++               I E    S
Sbjct: 88   ET--LKKPNVGLKEDAIQGDSEQLTEDSEVIADVNENAAHVPSSCQLHMVDNIFESCSGS 145

Query: 952  YPSSKKSDALCLVSGNGIKLSSDSKVRGGFEVASDHMTSPFASVKEHDF---------AE 800
            +     SD L     N   L+     +   E      TS  A   E++F         A 
Sbjct: 146  FVERASSDLLSGEHNNRCTLN-----KTNVEHLLPAETSSEAGCVENEFGRMSSFCGNAN 200

Query: 799  AGKEVSNHIINTNVPPAASDTSLSVEYVGQHQADLRNTS-SVGDMQSESRADRGTYEELA 623
            A  EVS H I   + P + +     + + +   ++++ S SV ++  +     G  E+  
Sbjct: 201  ANHEVSCHQIPATLTPVSVEED-DCDSIEESSNEIKSASDSVPEILPDGLHLVGIVEK-- 257

Query: 622  GETGSNISSNTHNDNIASEGINESCKEISDRFPSATPEKYDLIESDMEIVEHYDQSKLEE 443
                + +     +  I SE  N       D   S+T  +      ++E V+  D+ +++E
Sbjct: 258  ----NEMEMRCSSSIIESEESNGKLNWTKDASGSSTVGR-----KEIETVQQLDKIRVDE 308

Query: 442  TCVLVEADRLHV-PQGSVKRKSYK 374
            +C +V    LH  PQ   K K+Y+
Sbjct: 309  SCFMVNGAELHFHPQREGKHKTYQ 332


Top