BLASTX nr result

ID: Mentha25_contig00005920 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha25_contig00005920
         (1302 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU19258.1| hypothetical protein MIMGU_mgv1a006213mg [Mimulus...   410   e-112
gb|EPS68415.1| hypothetical protein M569_06358 [Genlisea aurea]       281   3e-73
ref|XP_006346218.1| PREDICTED: uncharacterized protein LOC102582...   275   4e-71
ref|XP_004244123.1| PREDICTED: uncharacterized protein LOC101249...   267   7e-69
ref|XP_007020848.1| Uncharacterized protein isoform 4 [Theobroma...   258   5e-66
ref|XP_007020845.1| Uncharacterized protein isoform 1 [Theobroma...   257   7e-66
ref|XP_002317597.1| hypothetical protein POPTR_0011s14260g [Popu...   255   3e-65
ref|XP_002522945.1| conserved hypothetical protein [Ricinus comm...   254   6e-65
ref|XP_007020846.1| Uncharacterized protein isoform 2 [Theobroma...   252   3e-64
ref|XP_002278240.2| PREDICTED: uncharacterized protein LOC100255...   250   8e-64
ref|XP_006452328.1| hypothetical protein CICLE_v10008166mg [Citr...   243   2e-61
emb|CAN63914.1| hypothetical protein VITISV_004851 [Vitis vinifera]   236   1e-59
gb|EXB50302.1| hypothetical protein L484_017840 [Morus notabilis]     229   2e-57
ref|XP_006858203.1| hypothetical protein AMTR_s00062p00174310 [A...   224   8e-56
ref|XP_007020849.1| Uncharacterized protein isoform 5 [Theobroma...   219   2e-54
ref|XP_004491393.1| PREDICTED: uncharacterized protein LOC101503...   219   3e-54
ref|XP_004512818.1| PREDICTED: uncharacterized protein LOC101490...   213   2e-52
ref|XP_007214595.1| hypothetical protein PRUPE_ppa024431mg [Prun...   205   3e-50
ref|XP_003518524.1| PREDICTED: uncharacterized protein LOC100798...   205   3e-50
gb|AAM96977.1| unknown protein [Arabidopsis thaliana] gi|2319842...   200   1e-48

>gb|EYU19258.1| hypothetical protein MIMGU_mgv1a006213mg [Mimulus guttatus]
          Length = 452

 Score =  410 bits (1053), Expect = e-112
 Identities = 237/434 (54%), Positives = 286/434 (65%), Gaps = 8/434 (1%)
 Frame = -2

Query: 1301 LGIDFDLESXXXXXXXXXXXXXPAKRPSTAPNLRPTEEEDDDDFVSLARVSEPPRPLKRL 1122
            LG+D DL+S             PAKR S A +L PT EED+DDF S  RVS+PPR  KRL
Sbjct: 13   LGLDLDLDSEPHPAPPPNPIPQPAKRASIAASL-PTIEEDNDDFESPVRVSDPPRAFKRL 71

Query: 1121 RRVTTAK--PPAKEPKVEYEDKRCNVXXXXXXXXXXDCFKAGSPPTNSVCSSSKPSLQWN 948
            RR  TA+  P  + PK+   D RC+V          +    GS P+NS  SSSKPSL   
Sbjct: 72   RRGPTARVTPETRNPKLR--DGRCHVDDEIEGFSSEEDCPRGSIPSNSGGSSSKPSLFGQ 129

Query: 947  KVRTSDSGTQWISRKGKEVPSASASINVGTRGSNVIFPKLTSSPLRRFQLIDSDSDNPST 768
               T++SG+QW SRKGK V SASAS+ V  RGS++IFP+LT SPLRRFQLIDSDSD+P  
Sbjct: 130  SAVTTESGSQWRSRKGKGVSSASASVTVEKRGSSLIFPQLTVSPLRRFQLIDSDSDDPPL 189

Query: 767  IEDTHKVLPSAILSPEEKISDFCKQTTFGNQGSKT-SVGKHQIKDLWKDFDQKKSSSIPT 591
                         SP+EK SD  K     N G+K  SVGK++ +DLW+DF  +KS+ +PT
Sbjct: 190  NS-----------SPKEKQSDSLKHGASRNLGAKKESVGKYEKEDLWRDFCSEKSTRVPT 238

Query: 590  PAFDEFCEEYFTGLKNKSMPKVDCKQTDSGMHMDKTSCLPALFYFFHNDSRIQKLVRERL 411
            P FDEFCEEYFT  K K+ P+ + K T++G  +++ S   A  YFFH DSRIQKLVR+RL
Sbjct: 239  PVFDEFCEEYFTKAKTKNKPETNLKNTNNGKKLEEGSLPSAHCYFFHTDSRIQKLVRDRL 298

Query: 410  PHFFPLEVANNQEDT-QENISVIDYMGQFGGEQNARQTNRKQNAEKNIKRSKKNTKTSLV 234
            P+FFPL   NNQE T Q+N  VIDYMGQFG E N+R+T RK +AEK   RSK+N K    
Sbjct: 299  PYFFPLGAVNNQEYTQQQNSPVIDYMGQFGHEDNSRKTVRKNSAEKGPTRSKRNAKK--- 355

Query: 233  DGVSEHSENWINPKSCAGVQKNAGGRKVQAVSG----SSGRWYTGQDGRRAYVSKNGQEL 66
               S+ SENW+NPKS AG+QKNAG R+VQAVS     SSG WYTG DGRR YVSK GQEL
Sbjct: 356  ---SQDSENWVNPKSGAGLQKNAGSRRVQAVSDSSTKSSGHWYTGSDGRRVYVSKKGQEL 412

Query: 65   TGQIAYRQYRKESG 24
            TG+IAY  Y+KESG
Sbjct: 413  TGKIAYMNYKKESG 426


>gb|EPS68415.1| hypothetical protein M569_06358 [Genlisea aurea]
          Length = 418

 Score =  281 bits (720), Expect = 3e-73
 Identities = 171/406 (42%), Positives = 215/406 (52%), Gaps = 6/406 (1%)
 Frame = -2

Query: 1226 RPSTAPNLRP-----TEEEDDDDFVSLARVSEPPRPLKRLRRVTTAKPPAKEPKVEYEDK 1062
            +P   PN  P       EED   F    R S  P+  KRLRR +  +P  +         
Sbjct: 25   QPCPGPNSTPRCDGIASEEDGGGFEPYIRHSGSPQTFKRLRRGSKIRPSYESTDEGSRQP 84

Query: 1061 RCNVXXXXXXXXXXDCFKAGSPPTNSVCSSSKPSLQWNKVRTSDSGTQWISRKGKEVPSA 882
                          +  + G PPTNSVCSSSKPSL+ ++   S+   +W  +KGKE  SA
Sbjct: 85   GDAEDEEIEDFSEEEDMRPGVPPTNSVCSSSKPSLRLHRAVISEFRNRWELKKGKEASSA 144

Query: 881  SASINVGTRGSNVIFPKLTSSPLRRFQLIDSDSDNPSTIEDTHKV-LPSAILSPEEKISD 705
            SA  N+    SN+  P  ++SPLR+FQLIDSDSD+PS IE       P  ILSP+EK  +
Sbjct: 145  SAPTNIQRHDSNLTIPLSSASPLRKFQLIDSDSDDPSEIEGRRSSRAPYMILSPDEKQKN 204

Query: 704  FCKQTTFGNQGSKTSVGKHQIKDLWKDFDQKKSSSIPTPAFDEFCEEYFTGLKNKSMPKV 525
                                  D W +    KSS IPTP FDE C+EYF   +++S  KV
Sbjct: 205  ---------------------GDPWNELCSGKSSHIPTPVFDEVCDEYFKNAEHQSRAKV 243

Query: 524  DCKQTDSGMHMDKTSCLPALFYFFHNDSRIQKLVRERLPHFFPLEVANNQEDTQENISVI 345
            D K ++ G+     S   AL YFFH DSRIQKLVR+RLP+FFPL  A   E  Q NI+ +
Sbjct: 244  DWKDSNIGISRPMDSAPSALSYFFHKDSRIQKLVRDRLPYFFPLGAAKGNEHVQPNITGL 303

Query: 344  DYMGQFGGEQNARQTNRKQNAEKNIKRSKKNTKTSLVDGVSEHSENWINPKSCAGVQKNA 165
            D    F               E+     KKN K S    VS+ S+ W++PKS   V KNA
Sbjct: 304  DSRAHF---------------EEVPSIMKKNKKRSRNKPVSQESDIWVDPKSGGTVPKNA 348

Query: 164  GGRKVQAVSGSSGRWYTGQDGRRAYVSKNGQELTGQIAYRQYRKES 27
            G R+VQA S S+G WYT  DGR+ YVS NGQELTGQIAY+QY+KE+
Sbjct: 349  GNRRVQAASKSTGHWYTNSDGRKVYVSSNGQELTGQIAYKQYKKEN 394


>ref|XP_006346218.1| PREDICTED: uncharacterized protein LOC102582285 [Solanum tuberosum]
          Length = 463

 Score =  275 bits (702), Expect = 4e-71
 Identities = 181/438 (41%), Positives = 243/438 (55%), Gaps = 12/438 (2%)
 Frame = -2

Query: 1301 LGIDFDLESXXXXXXXXXXXXXPAKRPSTAPNLRPTEEEDDD----DFVSLARVSEPPRP 1134
            LG+DFDL+S                 P  + +LR   E DDD      V+  +VS+PP  
Sbjct: 22   LGLDFDLDSEPQSTVL----------PKPSVSLRTINEVDDDFEFPKLVTDPQVSDPPSS 71

Query: 1133 LKRLRRVTTAKPPAKEPKVEYEDKRCNVXXXXXXXXXXDCFKAGSPPTNS-VCSSSKPSL 957
            LKRLRR + +K      K++  +  CNV          +      P  +S VCSSSK  L
Sbjct: 72   LKRLRRGSISKSEPAAQKLKLGETWCNVDDDIEDFSSQEDEPKDHPKCHSSVCSSSKIPL 131

Query: 956  QWNKVRTSDSGTQWISRKGKEVPSASASINVGTRGSNVIFPKLTSSPLRRFQLIDSDSDN 777
            Q  +V +S S ++   RK +    +S   ++ T  SN++FP+LT SPLR+FQLIDSDSD 
Sbjct: 132  QGQRVLSSQSVSRCTGRKKEASNVSSIHQSMETNPSNLVFPELTISPLRKFQLIDSDSDE 191

Query: 776  PSTIEDTHKVLP--SAILSPEEKISDF---CKQTTFGNQGSKTSVGKHQIKDLWKDFDQK 612
            PS  E   +      + LS   + SD    C++ T        S G  + KDLW+DF   
Sbjct: 192  PSKSEFVERESDHVDSPLSGNRQHSDADLSCQRKT------GPSAGTLKTKDLWEDFCSD 245

Query: 611  KSSSIPTPAFDEFCEEYFTGLKN-KSMPKVDCKQTDSGMHMDKTSCLPALFYFFHNDSRI 435
             + +I TPA DE CEEYF  +K+ K         T+S M   +   LPA  YFFH D RI
Sbjct: 246  TTFNIHTPALDEVCEEYFKSVKDGKRTQTTKSGLTESNMR-PQGPLLPAHCYFFHKDPRI 304

Query: 434  QKLVRERLPHFFPLEVANNQEDTQENISVIDYMGQFGGEQNARQTNRKQNAEKNIKRSKK 255
            QKL+R+RLP+FFPL       + Q++ SVIDYMGQF  E  +++T +K     + ++S+K
Sbjct: 305  QKLIRDRLPNFFPLGAYKIPGENQDDASVIDYMGQFCHEGGSKRTAQKSADVTDSRKSRK 364

Query: 254  NTKTSLVDGVSEHSENWINPKSCAGVQKNAGGRKVQAV-SGSSGRWYTGQDGRRAYVSKN 78
            N K       S+ SE W+NPKS AG+ K+AG R+VQAV S S+G WYT  DGR+ YV+ N
Sbjct: 365  NVKQPNNVEESQGSERWVNPKSSAGIPKDAGRRRVQAVGSKSAGHWYTNGDGRKVYVANN 424

Query: 77   GQELTGQIAYRQYRKESG 24
            GQE +GQ AYR YRKESG
Sbjct: 425  GQEFSGQSAYRCYRKESG 442


>ref|XP_004244123.1| PREDICTED: uncharacterized protein LOC101249283 [Solanum
            lycopersicum]
          Length = 463

 Score =  267 bits (683), Expect = 7e-69
 Identities = 177/446 (39%), Positives = 238/446 (53%), Gaps = 13/446 (2%)
 Frame = -2

Query: 1301 LGIDFDLESXXXXXXXXXXXXXPAKRPSTAPNLRPTEE--EDDDDF-----VSLARVSEP 1143
            LG+DFDL+S                 P  + NLR  +E  +DDDDF     V+  +VS+P
Sbjct: 22   LGLDFDLDSEPQSTVL----------PKPSVNLRTIKEVVDDDDDFEFPKLVTDPQVSDP 71

Query: 1142 PRPLKRLRRVTTAKPPAKEPKVEYEDKRCNVXXXXXXXXXXDCFKAGSPPTNS-VCSSSK 966
               LKRLRR + +K      K++  +  CNV          +      P  +S V SSSK
Sbjct: 72   TSSLKRLRRGSISKSEPVAQKLKLGETWCNVDDDIEDFSSQEDEPKDHPKCHSSVRSSSK 131

Query: 965  PSLQWNKVRTSDSGTQWISRKGKEVPSASASINVGTRGSNVIFPKLTSSPLRRFQLIDSD 786
              LQ  +V +S S ++   RK +    +S   +  T  SN++FP+LT SPLRRFQLIDSD
Sbjct: 132  IPLQGQRVISSQSVSRCTGRKKEASNVSSVHQSKETNPSNLVFPELTISPLRRFQLIDSD 191

Query: 785  SDNPSTIE----DTHKVLPSAILSPEEKISDFCKQTTFGNQGSKTSVGKHQIKDLWKDFD 618
            SD PS  E    ++  V     ++ +   +D   Q   G    KT       KDLW+DF 
Sbjct: 192  SDEPSKSEFVERESDHVDSPLNVNRQHSDADLSYQRKTGPSALKT-------KDLWEDFC 244

Query: 617  QKKSSSIPTPAFDEFCEEYFTGLKNKSMPKVDCKQTDSGMHMDKTSCLPALFYFFHNDSR 438
               + +I TPA DE CEEYF  +K+    +             +   LPA  YFFH D R
Sbjct: 245  SDTTFNIHTPALDEVCEEYFKSVKDGKRTQTTKGGLTESYMRPQGPLLPAHCYFFHKDPR 304

Query: 437  IQKLVRERLPHFFPLEVANNQEDTQENISVIDYMGQFGGEQNARQTNRKQNAEKNIKRSK 258
            IQKLVR+RLP+FFPL   N      ++ SVIDYMGQF  E  +++T +K     N ++S+
Sbjct: 305  IQKLVRDRLPNFFPLGADNLPGGNLDDASVIDYMGQFSHEGGSKRTAQKSADGTNSRKSR 364

Query: 257  KNTKTSLVDGVSEHSENWINPKSCAGVQKNAGGRKVQAVSGSSGRWYTGQDGRRAYVSKN 78
            KN K       S+ SE W+NPKS AG+ K+AG R+VQAV  S+G WYT  DGR+ YV  N
Sbjct: 365  KNVKQPNNVEESQGSERWVNPKSSAGIPKDAGRRRVQAVGKSAGHWYTNGDGRKVYVDNN 424

Query: 77   GQELTGQIAYRQYRKE-SGMSRNLKK 3
            GQE +G+ AY  YRKE  G +++ KK
Sbjct: 425  GQEFSGRSAYICYRKEKGGFNKSTKK 450


>ref|XP_007020848.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508720476|gb|EOY12373.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 452

 Score =  258 bits (658), Expect = 5e-66
 Identities = 174/456 (38%), Positives = 242/456 (53%), Gaps = 23/456 (5%)
 Frame = -2

Query: 1301 LGIDFDLESXXXXXXXXXXXXXPAKRPSTAPNLRPTEEEDDD-----DFVSLARVSEPPR 1137
            LG+D D ++              A  P ++ +   TE+ DD+     +        EPPR
Sbjct: 11   LGLDLDPDTEPRSPTGNHPGPILA--PDSSASFDATEDGDDEFGPEQEVKDSDTPPEPPR 68

Query: 1136 PLKRLRRVTTAKPPAKEPKVEYEDKRCNVXXXXXXXXXXDCFKA------GSPPTNSVCS 975
             LKRLRR        K+     E ++  V           C          S   +SVC 
Sbjct: 69   VLKRLRRAGDKSSATKK-----ESEKPLVWNDGDDEIEEFCSSQEKNDVDSSTQNHSVCG 123

Query: 974  SSKPSLQWNKVRTSDSGTQWISRKGKEVPSASASINVGTRGSNVIFPKLTSSPLRRFQLI 795
            SSK SL+   V T+ S  Q  SRK ++V  A A+ ++  R   +IFPKL  SPLRRF+L+
Sbjct: 124  SSKISLKGLGVLTTQSSGQCSSRKKEQVSDAPATASLEARHGGLIFPKLNISPLRRFKLL 183

Query: 794  DSDSDN---PSTIEDTHKVLPSAILSPEEKISDFCKQTTFGNQGSKTSVGKHQIKDLWKD 624
            DSDSD    PS  +DT K   +  + P  K     +Q+T  N+  K SV   Q +DLWKD
Sbjct: 184  DSDSDGSEGPSDCDDTSK--GACKIDPPSKE----QQSTISNKKRKASVVTPQNEDLWKD 237

Query: 623  FDQKKSSSIPTPAFDEFCEEYFTGLKN-KSMPKVDCKQTDSGMHMDKTSCLPALFYFFHN 447
            F    +S IPTPAFDE  +EYF  +K+  +  K++ ++ +  +++D     PA  YFFH+
Sbjct: 238  FTPINTSHIPTPAFDEVFKEYFQSVKDTNAAQKLENQKFEELLNLDDP-LPPAHCYFFHD 296

Query: 446  DSRIQKLVRERLPHFFPLEVANNQEDTQENISVIDYMGQFGGEQNARQTNRKQNAEKNIK 267
            D RIQKLVR RLP F PL +  N  + Q N+SVIDYM QF   ++++Q   ++   K   
Sbjct: 297  DPRIQKLVRSRLPFFSPLHMVKNGGNQQHNVSVIDYMSQFSNGESSKQRGSQKGGGKKCS 356

Query: 266  RSK-KNTKTSLVDGVSEHSENWINPKSCAGVQKNAGGRKVQAVSGSSGRWYTGQDGRRAY 90
             S+ K +K S  +  +  SE W++ KS A + KNAG R+V A    +G WYT  +GR+ Y
Sbjct: 357  MSRRKKSKNSKAEETA--SEGWVDLKSSAAIPKNAGKRRVHASDQPAGHWYTSPEGRKVY 414

Query: 89   VSKNGQELTGQIAYRQYRKESG-------MSRNLKK 3
            VS++GQEL+GQ+AYR YRKESG         RN KK
Sbjct: 415  VSRSGQELSGQMAYRHYRKESGAGFRKSKKKRNAKK 450


>ref|XP_007020845.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508720473|gb|EOY12370.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 453

 Score =  257 bits (657), Expect = 7e-66
 Identities = 174/457 (38%), Positives = 242/457 (52%), Gaps = 24/457 (5%)
 Frame = -2

Query: 1301 LGIDFDLESXXXXXXXXXXXXXPAKRPSTAPNLRPTEEEDDD-----DFVSLARVSEPPR 1137
            LG+D D ++              A  P ++ +   TE+ DD+     +        EPPR
Sbjct: 11   LGLDLDPDTEPRSPTGNHPGPILA--PDSSASFDATEDGDDEFGPEQEVKDSDTPPEPPR 68

Query: 1136 PLKRLRRVTTAKPPAKEPKVEYEDKRCNVXXXXXXXXXXDCFKA-------GSPPTNSVC 978
             LKRLRR        K+     E ++  V           C           S   +SVC
Sbjct: 69   VLKRLRRAGDKSSATKK-----ESEKPLVWNDGDDEIEEFCSSQEKNADVDSSTQNHSVC 123

Query: 977  SSSKPSLQWNKVRTSDSGTQWISRKGKEVPSASASINVGTRGSNVIFPKLTSSPLRRFQL 798
             SSK SL+   V T+ S  Q  SRK ++V  A A+ ++  R   +IFPKL  SPLRRF+L
Sbjct: 124  GSSKISLKGLGVLTTQSSGQCSSRKKEQVSDAPATASLEARHGGLIFPKLNISPLRRFKL 183

Query: 797  IDSDSDN---PSTIEDTHKVLPSAILSPEEKISDFCKQTTFGNQGSKTSVGKHQIKDLWK 627
            +DSDSD    PS  +DT K   +  + P  K     +Q+T  N+  K SV   Q +DLWK
Sbjct: 184  LDSDSDGSEGPSDCDDTSK--GACKIDPPSKE----QQSTISNKKRKASVVTPQNEDLWK 237

Query: 626  DFDQKKSSSIPTPAFDEFCEEYFTGLKN-KSMPKVDCKQTDSGMHMDKTSCLPALFYFFH 450
            DF    +S IPTPAFDE  +EYF  +K+  +  K++ ++ +  +++D     PA  YFFH
Sbjct: 238  DFTPINTSHIPTPAFDEVFKEYFQSVKDTNAAQKLENQKFEELLNLDDP-LPPAHCYFFH 296

Query: 449  NDSRIQKLVRERLPHFFPLEVANNQEDTQENISVIDYMGQFGGEQNARQTNRKQNAEKNI 270
            +D RIQKLVR RLP F PL +  N  + Q N+SVIDYM QF   ++++Q   ++   K  
Sbjct: 297  DDPRIQKLVRSRLPFFSPLHMVKNGGNQQHNVSVIDYMSQFSNGESSKQRGSQKGGGKKC 356

Query: 269  KRSK-KNTKTSLVDGVSEHSENWINPKSCAGVQKNAGGRKVQAVSGSSGRWYTGQDGRRA 93
              S+ K +K S  +  +  SE W++ KS A + KNAG R+V A    +G WYT  +GR+ 
Sbjct: 357  SMSRRKKSKNSKAEETA--SEGWVDLKSSAAIPKNAGKRRVHASDQPAGHWYTSPEGRKV 414

Query: 92   YVSKNGQELTGQIAYRQYRKESG-------MSRNLKK 3
            YVS++GQEL+GQ+AYR YRKESG         RN KK
Sbjct: 415  YVSRSGQELSGQMAYRHYRKESGAGFRKSKKKRNAKK 451


>ref|XP_002317597.1| hypothetical protein POPTR_0011s14260g [Populus trichocarpa]
            gi|222860662|gb|EEE98209.1| hypothetical protein
            POPTR_0011s14260g [Populus trichocarpa]
          Length = 497

 Score =  255 bits (652), Expect = 3e-65
 Identities = 173/472 (36%), Positives = 233/472 (49%), Gaps = 39/472 (8%)
 Frame = -2

Query: 1301 LGIDFDLESXXXXXXXXXXXXXPAKRPSTAPNL---------RPTEEEDDDDFVSLARVS 1149
            LG+D D+ES                 P+++ N          + T+ E++++ +    + 
Sbjct: 11   LGLDLDIESEPRIPTHHFQTSTLNPAPNSSSNTPSDDQNGGPQVTDSEEEEEEIGPDVMD 70

Query: 1148 EPPRP-------LKRLRRVTTAKPPAKEPKVEYEDKRCN-----VXXXXXXXXXXDCFKA 1005
              P P       L+RLRR   A   +K  KVE E   C+     +               
Sbjct: 71   SDPEPGPGPTRVLRRLRR-GPATQKSKVRKVELEGFCCDHGDDDIEEFSSQEDLGVRDAK 129

Query: 1004 GSPPTNSVCSSSKPSLQWNKVRTSDSGTQWISRKGKEVPSASASINVGTRGSNVIFPKLT 825
             S    SVCSSSK  L+   V TS S +     K ++   AS S ++ T  S ++FPKLT
Sbjct: 130  VSTQFTSVCSSSKVPLKGCGVLTSQSPSLLKGNKKEQASIASVSSSLETGHSGLMFPKLT 189

Query: 824  SSPLRRFQLIDSDSDNPSTIEDTHKVLPSAILSPEEKISDFCKQTTFGNQGSKTSVGKHQ 645
             SPLRRFQLIDSDSD  S   D          S +++      Q T   + +KT +G+H+
Sbjct: 190  ISPLRRFQLIDSDSDEASISADASGKTQKTDSSSKKQ------QPTTSERKNKTLLGEHR 243

Query: 644  IKDLWKDFDQKKSSSIPTPAFDEFCEEYFTGL---KNKSMPKVDCKQTDSG--MHMDKTS 480
             +DLWKDF   KS  + TP  DE C EYF  L   KNK+       QT      H D  S
Sbjct: 244  NEDLWKDFCPIKSYPVQTPVLDEMCNEYFQSLQDNKNKAHKLQSNLQTGDSTRFHQDPNS 303

Query: 479  CL-------------PALFYFFHNDSRIQKLVRERLPHFFPLEVANNQEDTQENISVIDY 339
             +             PA  YFFH D RIQ+LV  RLP+FFPL + NN+ +     S IDY
Sbjct: 304  MVDFQQCWNLADPLPPAHHYFFHEDLRIQRLVHSRLPYFFPLGIVNNKGNQLITESAIDY 363

Query: 338  MGQFGGEQNARQTNRKQNAEKNIKRSKKNTKTSLVDGVSEHSENWINPKSCAGVQKNAGG 159
            M QF  E + +Q  ++ N+EK   R +  +K S    VS  SE W++PKS   + K+AG 
Sbjct: 364  MSQFNREASRKQGTQRTNSEKGSTRGRNKSKKSNAGEVSLASEGWVDPKSSTAIPKDAGK 423

Query: 158  RKVQAVSGSSGRWYTGQDGRRAYVSKNGQELTGQIAYRQYRKESGMSRNLKK 3
            R+V A     G WYT  +GR+ Y+SKNGQEL+GQIAYR Y+K+SG  R  KK
Sbjct: 424  RRVHASDQGDGHWYTSPEGRKVYISKNGQELSGQIAYRHYKKDSGGFRRSKK 475


>ref|XP_002522945.1| conserved hypothetical protein [Ricinus communis]
            gi|223537757|gb|EEF39375.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 477

 Score =  254 bits (649), Expect = 6e-65
 Identities = 174/430 (40%), Positives = 228/430 (53%), Gaps = 35/430 (8%)
 Frame = -2

Query: 1187 EDDDDFVSLARVSEP------PRPLKRLRRVTTAKPPAKEPKVEYEDKRC-NVXXXXXXX 1029
            EDDDDF      S+P      PR  KRLRR    +    E K E E   C N        
Sbjct: 46   EDDDDFGLEVVDSDPETGPSSPRVFKRLRRGPAVEESRME-KREQEKVFCDNGDDEIEEF 104

Query: 1028 XXXDCFKAGSPPT---NSVCSSSKPSLQWNKVRTSDSGTQWISRKGKEVPS-ASASINVG 861
               + F   + P+   NSVCSSSK  L    V  +   ++ +  K KE  S A +S  +G
Sbjct: 105  SSQEDFIRDAYPSAEYNSVCSSSKIPLHGCGVSLTTQSSKQLKEKKKERASDAPSSSCLG 164

Query: 860  TRGSNVIFPKLTSSPLRRFQLIDSDSDNPSTIEDTHKVLPSAILSPEEKISDFCKQTTFG 681
            T  + +IFP LT SPLRRFQLIDSDS+ PST  D  + +    LS +E+  + C++    
Sbjct: 165  TGNNGLIFPNLTISPLRRFQLIDSDSEEPSTRNDVSRKISGTDLSSKERQPNSCEKKR-- 222

Query: 680  NQGSKTSVGKHQIKDLWKDFDQKKSSSIPTPAFDEFCEEYFTGLKN-KSMPKVDC---KQ 513
                  S  KHQ +DLWKDF  KKS  +PTP  DE CEEYF  L++  S  K+     K 
Sbjct: 223  ----NPSAEKHQSEDLWKDFCPKKSFHVPTPVLDEVCEEYFQSLRDTNSAKKLGTNLPKD 278

Query: 512  TDSGMHMDKTSCL-------------PALFYFFHNDSRIQKLVRERLPHFFPLEVANNQE 372
               G H+D  +               PA  YF H+DSRIQ LVR RLP+F PL + NN+E
Sbjct: 279  GGVGCHLDANTIAGFEQSWNLADPLPPAYNYFCHDDSRIQSLVRSRLPNFSPLCIINNRE 338

Query: 371  DTQENISVIDYMGQFGGEQNARQTNRKQNAEKNIKRSKKNTKTSLVDGVSEHSENWINPK 192
            + Q +  VI+YM QF GE + +    + N  K+  R +  +K S+V      S+ WI+PK
Sbjct: 339  NHQPSEPVINYMSQFNGEASKKGGTCRNN-NKDSTRGRSKSKKSIVKEALPASQVWIDPK 397

Query: 191  SCAGVQKNAGGRKVQAVSGSSGRWYTGQDGRRAYVSKNGQELTGQIAYRQYRK------- 33
              A + K+AG R+V A   ++G WYT  +GR+ YVS++GQELTGQ+AYR YRK       
Sbjct: 398  RSASIPKDAGKRRVHANGQAAGHWYTSPEGRKVYVSRSGQELTGQMAYRHYRKASSGTSS 457

Query: 32   ESGMSRNLKK 3
            ESG  R  KK
Sbjct: 458  ESGGYRKSKK 467


>ref|XP_007020846.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508720474|gb|EOY12371.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 447

 Score =  252 bits (643), Expect = 3e-64
 Identities = 167/440 (37%), Positives = 236/440 (53%), Gaps = 16/440 (3%)
 Frame = -2

Query: 1301 LGIDFDLESXXXXXXXXXXXXXPAKRPSTAPNLRPTEEEDDD-----DFVSLARVSEPPR 1137
            LG+D D ++              A  P ++ +   TE+ DD+     +        EPPR
Sbjct: 11   LGLDLDPDTEPRSPTGNHPGPILA--PDSSASFDATEDGDDEFGPEQEVKDSDTPPEPPR 68

Query: 1136 PLKRLRRVTTAKPPAKEPKVEYEDKRCNVXXXXXXXXXXDCFKA------GSPPTNSVCS 975
             LKRLRR        K+     E ++  V           C          S   +SVC 
Sbjct: 69   VLKRLRRAGDKSSATKK-----ESEKPLVWNDGDDEIEEFCSSQEKNDVDSSTQNHSVCG 123

Query: 974  SSKPSLQWNKVRTSDSGTQWISRKGKEVPSASASINVGTRGSNVIFPKLTSSPLRRFQLI 795
            SSK SL+   V T+ S  Q  SRK ++V  A A+ ++  R   +IFPKL  SPLRRF+L+
Sbjct: 124  SSKISLKGLGVLTTQSSGQCSSRKKEQVSDAPATASLEARHGGLIFPKLNISPLRRFKLL 183

Query: 794  DSDSDN---PSTIEDTHKVLPSAILSPEEKISDFCKQTTFGNQGSKTSVGKHQIKDLWKD 624
            DSDSD    PS  +DT K   +  + P  K     +Q+T  N+  K SV   Q +DLWKD
Sbjct: 184  DSDSDGSEGPSDCDDTSK--GACKIDPPSKE----QQSTISNKKRKASVVTPQNEDLWKD 237

Query: 623  FDQKKSSSIPTPAFDEFCEEYFTGLKN-KSMPKVDCKQTDSGMHMDKTSCLPALFYFFHN 447
            F    +S IPTPAFDE  +EYF  +K+  +  K++ ++ +  +++D     PA  YFFH+
Sbjct: 238  FTPINTSHIPTPAFDEVFKEYFQSVKDTNAAQKLENQKFEELLNLDDP-LPPAHCYFFHD 296

Query: 446  DSRIQKLVRERLPHFFPLEVANNQEDTQENISVIDYMGQFGGEQNARQTNRKQNAEKNIK 267
            D RIQKLVR RLP F PL +  N  + Q N+SVIDYM QF   ++++Q   ++   K   
Sbjct: 297  DPRIQKLVRSRLPFFSPLHMVKNGGNQQHNVSVIDYMSQFSNGESSKQRGSQKGGGKKCS 356

Query: 266  RSK-KNTKTSLVDGVSEHSENWINPKSCAGVQKNAGGRKVQAVSGSSGRWYTGQDGRRAY 90
             S+ K +K S  +  +  SE W++ KS A + KNAG R+V A    +G WYT  +GR+ Y
Sbjct: 357  MSRRKKSKNSKAEETA--SEGWVDLKSSAAIPKNAGKRRVHASDQPAGHWYTSPEGRKVY 414

Query: 89   VSKNGQELTGQIAYRQYRKE 30
            VS++GQEL+GQ+AYR YRK+
Sbjct: 415  VSRSGQELSGQMAYRHYRKK 434


>ref|XP_002278240.2| PREDICTED: uncharacterized protein LOC100255618 [Vitis vinifera]
          Length = 470

 Score =  250 bits (639), Expect = 8e-64
 Identities = 156/399 (39%), Positives = 222/399 (55%), Gaps = 22/399 (5%)
 Frame = -2

Query: 1133 LKRLRRVTTAKPPAKEPKVEYEDKRCNVXXXXXXXXXXDCFKAGSPPT---NSVCSSSKP 963
            LKRLRR      P +  + E  +  CNV          + F+    P+   +SVCSSSK 
Sbjct: 63   LKRLRR-----GPGRVHRRELAEAWCNVDEEIEEFSSQEGFRRDEHPSTQYHSVCSSSKF 117

Query: 962  SLQWNKVRTSDSGTQWISRKGKEVPSASASINVGTRGSNVIFPKLTSSPLRRFQLIDSDS 783
             L+ + V TS S +   + K ++  +  AS ++ T  S ++FPKLT SPLRRFQL+DSD 
Sbjct: 118  PLRASGVLTSRSASHRKAGKREQASNHPASSSLETSSSKLMFPKLTISPLRRFQLLDSDD 177

Query: 782  DNPSTIEDTHKVLPSAILSPEEKISDFCKQTTFG-NQGSKTSVGKHQIKDLWKDFDQKKS 606
            D+PS IED ++   +   S + + S+  + +    ++ +KT V   Q  DLWKDF   +S
Sbjct: 178  DDPSVIEDANQEAKNTHPSAKVRQSNHRQYSCASEDKSTKTFVSMPQNVDLWKDFWPNRS 237

Query: 605  SSIPTPAFDEFCEEYFTGLKNKSMP----KVDCKQTDSGMHMDKTS-------------C 477
              IPTPA DE CEEYF  +K+K++        C   +   + +K +              
Sbjct: 238  VGIPTPALDEVCEEYFRSVKDKNVTVKLGSDGCISNEKRSYQNKNNRKTVQHQLDLADPL 297

Query: 476  LPALFYFFHNDSRIQKLVRERLPHFFPLEVANNQEDTQENISVIDYMGQFG-GEQNARQT 300
             PA  YFFH D RIQKLVR RLP+F PL V +N  + Q   SVIDYM QF  GE + +Q 
Sbjct: 298  PPAHRYFFHADPRIQKLVRSRLPNFSPLGVVSNT-NMQHGASVIDYMSQFSHGEASKKQV 356

Query: 299  NRKQNAEKNIKRSKKNTKTSLVDGVSEHSENWINPKSCAGVQKNAGGRKVQAVSGSSGRW 120
            N+  +  ++  +++KN +    D     S +W+NPKSCA + K AG  +V A   S+ RW
Sbjct: 357  NQDVSIGRSTMQARKNARKFNADEALNASGSWVNPKSCASIPKKAGKGQVHANGQSASRW 416

Query: 119  YTGQDGRRAYVSKNGQELTGQIAYRQYRKESGMSRNLKK 3
            YT  DGR+ YV+K+GQELTG +AYR Y+K++G+     K
Sbjct: 417  YTSPDGRKVYVTKSGQELTGSMAYRHYKKDNGVRSKKSK 455


>ref|XP_006452328.1| hypothetical protein CICLE_v10008166mg [Citrus clementina]
            gi|568842498|ref|XP_006475183.1| PREDICTED:
            uncharacterized protein LOC102619494 [Citrus sinensis]
            gi|557555554|gb|ESR65568.1| hypothetical protein
            CICLE_v10008166mg [Citrus clementina]
          Length = 477

 Score =  243 bits (619), Expect = 2e-61
 Identities = 165/409 (40%), Positives = 218/409 (53%), Gaps = 34/409 (8%)
 Frame = -2

Query: 1148 EPPRPLKRLRRVTTAKPPAKEPKV-------EYEDKRCN------VXXXXXXXXXXDCFK 1008
            EP R LKRLRR      PA    V       E E   C+      +             +
Sbjct: 66   EPTRVLKRLRRGVVRPAPALTNPVSSSVKTQELERSSCDGNGDDDIEDFSSQEDLLVRDE 125

Query: 1007 AGSPPTNSVCSSSKPSLQWNKVRTSDSGTQWISRKGKEVPSASASINVGTRGSNVIFPKL 828
                  NSVCSSSK  L+   V T+ S +   +RK +    A +S ++ T  S ++FPKL
Sbjct: 126  HQPAQYNSVCSSSKIPLRGCGVLTTQSSSVSKTRKRELASDAPSSASMETSHSGLLFPKL 185

Query: 827  TSSPLRRFQLIDSDSDN--PSTIED----THKVLPSAILSPEEKISDFCKQTTFGNQGSK 666
            T SPLRRFQL+DSDSD+  P   ED    +HK+ P     P + +       T  +Q  K
Sbjct: 186  TVSPLRRFQLLDSDSDSDHPYVSEDIKKGSHKIEP-----PSKGLG-----LTASDQKRK 235

Query: 665  TSVGKHQIKDLWKDFDQKKSSSIPTPAFDEFCEEYFTGLKNKSMPKVD--------CKQT 510
              V + Q +DLWKDF   KS  IPTPA DE CEEYF   KNK+   +D        C  T
Sbjct: 236  VLVDRPQNEDLWKDFCPAKSFHIPTPALDEVCEEYFQSFKNKNAASIDAYLGNSRECHAT 295

Query: 509  DSGMHM-----DKTSCLPALF-YFFHNDSRIQKLVRERLPHFFPLEVANNQEDTQENISV 348
             S   +     D TS LP    YFFH+D RIQKLVR RLP+F PL +  + E+ Q    V
Sbjct: 296  ASTSEIFEQCWDSTSPLPPSHGYFFHDDPRIQKLVRSRLPNFSPLGIVASIENQQPCAPV 355

Query: 347  IDYMGQFG-GEQNARQTNRKQNAEKNIKRSKKNTKTSLVDGVSEHSENWINPKSCAGVQK 171
            I+YM QF  GE +  +  +K N++K+  R +  +K S        SE W++PKS +   K
Sbjct: 356  INYMSQFSNGESSKPKGTQKINSKKSSTRGRNKSKKS------NASEGWVDPKSSSTAPK 409

Query: 170  NAGGRKVQAVSGSSGRWYTGQDGRRAYVSKNGQELTGQIAYRQYRKESG 24
            +AG R+V A + S+G WYT  +GR+ Y+S++GQEL+GQ AYRQYRKE+G
Sbjct: 410  DAGKRRVHATTQSAGHWYTSPEGRKVYISRSGQELSGQTAYRQYRKENG 458


>emb|CAN63914.1| hypothetical protein VITISV_004851 [Vitis vinifera]
          Length = 510

 Score =  236 bits (603), Expect = 1e-59
 Identities = 140/344 (40%), Positives = 199/344 (57%), Gaps = 19/344 (5%)
 Frame = -2

Query: 977  SSSKPSLQWNKVRTSDSGTQWISRKGKEVPSASASINVGTRGSNVIFPKLTSSPLRRFQL 798
            SSSK  L+ + V TS S +   + K ++  +  AS ++ T  S ++FPKLT SPLRRFQL
Sbjct: 153  SSSKFPLRASGVLTSRSASHRKAGKREQASNHPASSSLETSSSKLMFPKLTISPLRRFQL 212

Query: 797  IDSDSDNPSTIEDTHKVLPSAILSPEEKISDFCKQTTFG-NQGSKTSVGKHQIKDLWKDF 621
            +DSD D+PS IED ++   +   S + + S+  + +    ++ +KT V   Q  DLWKDF
Sbjct: 213  LDSDDDDPSVIEDANQEAKNTHPSAKVRQSNHRQYSCASEDKSTKTFVSMPQNVDLWKDF 272

Query: 620  DQKKSSSIPTPAFDEFCEEYFTGLKNKSMP----KVDCKQTDSGMHMDKTS--------- 480
               +S  IPTPA DE CEEYF  +K+K++        C   +   + +K +         
Sbjct: 273  WPNRSVGIPTPALDEVCEEYFRSVKDKNVTVKLGSDGCISNEKRSYQNKNNRKTVQHQLD 332

Query: 479  ----CLPALFYFFHNDSRIQKLVRERLPHFFPLEVANNQEDTQENISVIDYMGQFG-GEQ 315
                  PA  YFFH D RIQKLVR RLP+F PL V +N  + Q   SVIDYM QF  GE 
Sbjct: 333  LADPLPPAHRYFFHADPRIQKLVRSRLPNFSPLGVVSNT-NMQHGASVIDYMSQFSHGEA 391

Query: 314  NARQTNRKQNAEKNIKRSKKNTKTSLVDGVSEHSENWINPKSCAGVQKNAGGRKVQAVSG 135
            + +Q N+  +  ++  +++KN +    D     S +W+NPKSCA + K AG  +V A   
Sbjct: 392  SKKQVNQDVSIGRSTMQARKNARKFNADEALNASGSWVNPKSCASIPKKAGKGQVHANGQ 451

Query: 134  SSGRWYTGQDGRRAYVSKNGQELTGQIAYRQYRKESGMSRNLKK 3
            S+ RWYT  DGR+ YV+K+GQELTG +AYR Y+K++G+     K
Sbjct: 452  SASRWYTSPDGRKVYVTKSGQELTGSMAYRHYKKDNGVRSKKSK 495


>gb|EXB50302.1| hypothetical protein L484_017840 [Morus notabilis]
          Length = 523

 Score =  229 bits (584), Expect = 2e-57
 Identities = 170/488 (34%), Positives = 230/488 (47%), Gaps = 89/488 (18%)
 Frame = -2

Query: 1220 STAPNLRPTEEEDDDDFVSLARV-----SEPPRPLKRLRR-------VTTAKPPAKEPKV 1077
            ST+P L+     D D    +A       SEPPR LKRLRR        T  +    E  +
Sbjct: 36   STSPTLQDDAGGDTDFGPRVAESDPESRSEPPRVLKRLRRGPPQLRETTALRSCVAEDDI 95

Query: 1076 EYEDKRCNVXXXXXXXXXXDCFKAGSPPTN--SVCSSSKPSLQWNKVRTSDSGTQWISRK 903
            E    + +V                 PPT   S+CSSSK  L      T  S ++W +R 
Sbjct: 96   EEFSSQEDVLEELH------------PPTQYRSMCSSSKIPLHGCGAITKQS-SEWKARN 142

Query: 902  GKEVPSASASINVGTRGSNVIFPKLTSSPLRRFQLIDSDSDNPSTIEDTHKVL----PSA 735
             + V +A+AS +     S  +FPKLT SPLR+FQLIDSDSD PST E   KV+    P  
Sbjct: 143  KEPVSTATASASAEISHSERLFPKLTISPLRKFQLIDSDSDEPSTSE---KVMIMGDPQI 199

Query: 734  ILSPEEKISDFCKQTTFGNQGSKTSVGKHQIKDLWKDFDQKKSSSIPTPAFDEFCEEYFT 555
              S +++ S+  +  T   Q    S    +  DLWKDF   KS  IPTPA DE C +YF 
Sbjct: 200  DQSSKKQQSNHGQSATTSGQKRNASDCMPKSADLWKDFCPVKSFRIPTPALDEMCNQYFH 259

Query: 554  GLKNKSMP-----------KVDCKQTDSGMHMDK-----TSCLPALFYFFHNDSRIQKLV 423
             +K+K+                 ++T +G  +++        LPA  YF H+D RI+KLV
Sbjct: 260  SVKDKNASVKLGSDKSVKSSSGFRETTNGQSIEQPWNTANLILPAHRYFLHHDPRIRKLV 319

Query: 422  RERLPHFFPLEVANNQEDTQENISVIDYMGQFGG--------------EQNARQTNRKQN 285
            R RLP+FFPL +  N E+ Q   +VIDYMGQFG               E+N+++   K N
Sbjct: 320  RNRLPNFFPLGIDENNENQQNGAAVIDYMGQFGNREPSKRQATQQVDPERNSKRGRTKAN 379

Query: 284  AEKNIKRSKKNTKTSLVDGVSEHSENWINPK----------------------------- 192
            AE + K+    ++     GV   SE W++PK                             
Sbjct: 380  AENSSKKQSNTSRRLNEGGVLHASEGWVDPKRGKVAGKKSVKKSSKNRRNTAQASSAGEG 439

Query: 191  ------------SCAGVQKNAGGRKVQAVSGSSGRWYTGQDGRRAYVSKNGQELTGQIAY 48
                        S    +KN+G +   +   S G+WYTG +GR+ YV+KNGQELTGQIAY
Sbjct: 440  LHDSGSWLDPRSSVTSNKKNSGKQGSHSNGQSVGQWYTGPNGRKVYVNKNGQELTGQIAY 499

Query: 47   RQYRKESG 24
            RQY+K+ G
Sbjct: 500  RQYKKDKG 507


>ref|XP_006858203.1| hypothetical protein AMTR_s00062p00174310 [Amborella trichopoda]
            gi|548862306|gb|ERN19670.1| hypothetical protein
            AMTR_s00062p00174310 [Amborella trichopoda]
          Length = 540

 Score =  224 bits (570), Expect = 8e-56
 Identities = 155/444 (34%), Positives = 211/444 (47%), Gaps = 36/444 (8%)
 Frame = -2

Query: 1226 RPSTAPNLRPTEEEDDDDFVSLARVSEPPRPLKRLRRVTTAKPPAKEPKVEYEDKRCNVX 1047
            R + A    P       D V   + SEP   +  L R+    P     KV+ +  R N  
Sbjct: 77   RSNEAEVFYPKRPNSSLDLVKELQSSEPEPAVHVLNRLRRG-PSQSASKVKCKLSRDNED 135

Query: 1046 XXXXXXXXXDCFKAGSPPTNS---VCSSSKPSLQWNKVRTSDSGTQWISRKGKEVPSASA 876
                     D   A   P+      CSSS+ SL    V TS       S K      AS 
Sbjct: 136  DIEDISSEEDYPNADDYPSTQNHFACSSSRLSLHGRGVLTSQLTNDRRSEKPSVASDASL 195

Query: 875  SINVGTRGSNVIFPKLTSSPLRRFQLIDSDSDNPSTIEDTHKVLPSAILSPEEKISDFCK 696
              +     +   FP++T SP+R+FQL+DSDSD+PS+ +D    +   + S + K+S    
Sbjct: 196  LSSFDGNSNKKAFPRITISPIRKFQLLDSDSDDPSSSKDVPTSVKK-VASAQVKVSHSVL 254

Query: 695  QTTFGNQGSKTSVGKHQIKDLWKDFDQKKSSSIPTPAFDEFCEEYFTGLKNKSMPKVDCK 516
            +      G    + + Q   LWKDF  K+S  + TPA DEFC+EYF+ +  ++   V C+
Sbjct: 255  EIHEQKGGKNLKIPQSQ--SLWKDFSAKESVKLKTPALDEFCKEYFSTVNARN--PVQCQ 310

Query: 515  QTDSGMHMDKT----SCL---------------------------PALFYFFHNDSRIQK 429
            + DS     K     SCL                           PA  YF+H+D RI+ 
Sbjct: 311  REDSNSSTSKLFVSDSCLIDGFDHIQENAAHKIVHRHDNVGDPLPPAYGYFYHDDQRIRD 370

Query: 428  LVRERLPHFFPLEVANNQEDTQENISVIDYMGQFG--GEQNARQTNRKQNAEKNIKRSKK 255
            LVR RLP+F PL  AN   + + +  +IDYM QFG  G QN  ++   +  E + K+ +K
Sbjct: 371  LVRRRLPYFCPLGAANFGGNCRSDEVLIDYMSQFGQRGGQNQPRSTLNEGNEGSSKKKRK 430

Query: 254  NTKTSLVDGVSEHSENWINPKSCAGVQKNAGGRKVQAVSGSSGRWYTGQDGRRAYVSKNG 75
                       + S+ W+NPKS     K+AG R+V A   SSG WYTG+DGR+ YV+KNG
Sbjct: 431  TQSKGKAKRAPQTSDGWVNPKSEVNPPKDAGKRRVSADGVSSGHWYTGEDGRKVYVTKNG 490

Query: 74   QELTGQIAYRQYRKESGMSRNLKK 3
            QELTGQ AYR YRKESGM    KK
Sbjct: 491  QELTGQTAYRHYRKESGMGYKRKK 514


>ref|XP_007020849.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508720477|gb|EOY12374.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 429

 Score =  219 bits (558), Expect = 2e-54
 Identities = 153/418 (36%), Positives = 217/418 (51%), Gaps = 16/418 (3%)
 Frame = -2

Query: 1301 LGIDFDLESXXXXXXXXXXXXXPAKRPSTAPNLRPTEEEDDD-----DFVSLARVSEPPR 1137
            LG+D D ++              A  P ++ +   TE+ DD+     +        EPPR
Sbjct: 11   LGLDLDPDTEPRSPTGNHPGPILA--PDSSASFDATEDGDDEFGPEQEVKDSDTPPEPPR 68

Query: 1136 PLKRLRRVTTAKPPAKEPKVEYEDKRCNVXXXXXXXXXXDCFKA------GSPPTNSVCS 975
             LKRLRR        K+     E ++  V           C          S   +SVC 
Sbjct: 69   VLKRLRRAGDKSSATKK-----ESEKPLVWNDGDDEIEEFCSSQEKNDVDSSTQNHSVCG 123

Query: 974  SSKPSLQWNKVRTSDSGTQWISRKGKEVPSASASINVGTRGSNVIFPKLTSSPLRRFQLI 795
            SSK SL+   V T+ S  Q  SRK ++V  A A+ ++  R   +IFPKL  SPLRRF+L+
Sbjct: 124  SSKISLKGLGVLTTQSSGQCSSRKKEQVSDAPATASLEARHGGLIFPKLNISPLRRFKLL 183

Query: 794  DSDSDN---PSTIEDTHKVLPSAILSPEEKISDFCKQTTFGNQGSKTSVGKHQIKDLWKD 624
            DSDSD    PS  +DT K   +  + P  K     +Q+T  N+  K SV   Q +DLWKD
Sbjct: 184  DSDSDGSEGPSDCDDTSK--GACKIDPPSKE----QQSTISNKKRKASVVTPQNEDLWKD 237

Query: 623  FDQKKSSSIPTPAFDEFCEEYFTGLKN-KSMPKVDCKQTDSGMHMDKTSCLPALFYFFHN 447
            F    +S IPTPAFDE  +EYF  +K+  +  K++ ++ +  +++D     PA  YFFH+
Sbjct: 238  FTPINTSHIPTPAFDEVFKEYFQSVKDTNAAQKLENQKFEELLNLDDP-LPPAHCYFFHD 296

Query: 446  DSRIQKLVRERLPHFFPLEVANNQEDTQENISVIDYMGQF-GGEQNARQTNRKQNAEKNI 270
            D RIQKLVR RLP F PL +  N  + Q N+SVIDYM QF  GE + ++ ++K   +K  
Sbjct: 297  DPRIQKLVRSRLPFFSPLHMVKNGGNQQHNVSVIDYMSQFSNGESSKQRGSQKGGGKKCS 356

Query: 269  KRSKKNTKTSLVDGVSEHSENWINPKSCAGVQKNAGGRKVQAVSGSSGRWYTGQDGRR 96
               +K +K S  +  +  SE W++ KS A + KNAG R+V A    +G WYT  +GR+
Sbjct: 357  MSRRKKSKNSKAEETA--SEGWVDLKSSAAIPKNAGKRRVHASDQPAGHWYTSPEGRK 412


>ref|XP_004491393.1| PREDICTED: uncharacterized protein LOC101503265 [Cicer arietinum]
          Length = 501

 Score =  219 bits (557), Expect = 3e-54
 Identities = 165/473 (34%), Positives = 224/473 (47%), Gaps = 73/473 (15%)
 Frame = -2

Query: 1223 PSTAPNLRPTEEEDDDDFVSLARVSEPPRPLKRLRRVTTAKPPAKEPK-VEYEDKRCNVX 1047
            PST+PN  P  +  D D       + P   LKRLRR   +      P  ++ +D   ++ 
Sbjct: 23   PSTSPNHDPLPQVPDSDPDPETLPNPPLHILKRLRRGPPSSSKTDPPSCIDVDDD--DIE 80

Query: 1046 XXXXXXXXXDCFKAGSPPTNSVCSSSKPSLQWNKVRTSDSGTQWISRKGKEVPSASASIN 867
                       F   S   +SVCSSSK SL+   V T  S      +K K+     AS+ 
Sbjct: 81   EFSSQEDPVQGFAHSSVRNHSVCSSSKVSLKGVGVLTPHSFINSNEKKRKQDSDIPASVG 140

Query: 866  VGTRGSNVIFPKLTSSPLRRFQLIDSDSDNPSTI--EDT---HKVLPSAILSPEEKISDF 702
            + T     +  KL +SPLRRF+L+DSD D+   +  ED    +KV PS+ L P       
Sbjct: 141  LETGQRGFLLRKLAASPLRRFKLLDSDDDDDDDLVCEDVTWENKVGPSSSLGP------L 194

Query: 701  CKQTT---FGNQGSKTSVGKHQIKDLWKDFDQKKSSSIPTPAFDEFCEEYFTGLKNKSMP 531
            C ++T      Q  KT    ++ +DLWKD    K+ S+PTP F+E  EEYF   KN  +P
Sbjct: 195  CNRSTPLISLEQDRKTQFDVNRNQDLWKDLSPVKNFSVPTPVFNEVFEEYFRSAKNVEVP 254

Query: 530  K--VDCKQT--------DSGMHMDKT------SCLPALFYFFHNDSRIQKLVRERLPHFF 399
            K  +D  +         +SG   D+          PA  YFFH D RIQ+LVR RL +F 
Sbjct: 255  KSRIDISENHNATYGGFNSGWQKDEQVWEAAGPLPPAHRYFFHEDPRIQQLVRSRLCNFT 314

Query: 398  PLEVANNQEDTQENISVIDYMGQF----------------GGEQNARQTNRKQNAEK--- 276
            PL V  N+ + Q+N+S IDY+GQF                 G  + R   +  N E+   
Sbjct: 315  PLGV--NRVNQQQNVSHIDYLGQFDNGGVSKTPVVRKGRASGSSSRRSKAKNLNVEQIFN 372

Query: 275  -----------------NIKRSKKNTKTSLVDGVSEH------------SENWINPKSCA 183
                                R K   ++S    VS+             S NW+ PKSC 
Sbjct: 373  ASEGWVDPKIISPFSSGTSSRKKATKRSSTKSSVSKSKNGQSKLNPSNVSGNWVEPKSCT 432

Query: 182  GVQKNAGGRKVQAVSGSSGRWYTGQDGRRAYVSKNGQELTGQIAYRQYRKESG 24
             + ++AG R+VQA S S+G WYTG DGR+ YV+K+GQELTG+ AYR YRKESG
Sbjct: 433  SMPRDAGKRRVQASSQSAGHWYTGSDGRKVYVNKSGQELTGRNAYRNYRKESG 485


>ref|XP_004512818.1| PREDICTED: uncharacterized protein LOC101490882 [Cicer arietinum]
          Length = 492

 Score =  213 bits (541), Expect = 2e-52
 Identities = 159/439 (36%), Positives = 218/439 (49%), Gaps = 65/439 (14%)
 Frame = -2

Query: 1145 PPRPLKRLRR----VTTAKPPAKEPKVEYEDKRCNVXXXXXXXXXXDCFKAGSPPTNSVC 978
            P R LKRLRR         PP+     + ED                 F   S   +S+ 
Sbjct: 50   PRRTLKRLRRGLRSSAQTNPPSCFDAADEEDD----IEEFSQEDPVQVFAHSSVRNHSIF 105

Query: 977  SSSKPSLQWNKVRTSDSGTQWISRKGKEVPSASASINVGTRGSNVIFPKLTSSPLRRFQL 798
             SSK SL+   V T     +   +K K+   + AS+ + T  S  +FPKL +SP R FQL
Sbjct: 106  RSSKVSLKGAGVLTPHPSRE---KKRKQSSDSPASVVLETGQSCFVFPKLAASPPRIFQL 162

Query: 797  IDSDSDN--PSTIEDTHKVLPSAILSPEEKISDFCKQTTFGNQGSKTSVGKHQIKDLWKD 624
            +DSD D+     +++ +KV PS+   P   + +     T   Q  KT    ++  DLWKD
Sbjct: 163  LDSDDDDLVGKDVDNENKVGPSSSTRP---VCNRSTPLTSSEQDRKTQSDVNKNHDLWKD 219

Query: 623  FDQKKSSSIPTPAFDEFCEEYFTGLKNKSMPKVDC----------KQTDSGMHMDKT--- 483
                K+ S+PTPAF++ CEEYF   KN  +PK             +  +SG   D+    
Sbjct: 220  LSPVKNFSVPTPAFNKVCEEYFHSAKNTQVPKSGIGISENHNETFRGVNSGYQKDEQIWD 279

Query: 482  ---SCLPALFYFFHNDSRIQKLVRERLPHFFPLEVANNQEDTQENISVIDYMGQFGGEQN 312
                  PA  YFFH D RIQ+LVR RL +F PL V   + D Q+++S IDY+GQF   + 
Sbjct: 280  AVGPPPPAHRYFFHEDPRIQQLVRSRLHNFSPLGVI--KVDQQQDVSHIDYLGQFDNRRA 337

Query: 311  AR-----------QTNRKQNAEK-NI----------------------KRSKKN-TKTSL 237
            ++            T+RK  ++K NI                      K +K+N TK S+
Sbjct: 338  SKTPSVPKGRGNGSTSRKSKSKKLNIEETFNDFEGWVNPEIISSSSRNKTTKRNSTKRSV 397

Query: 236  VDGVSEHSE--------NWINPKSCAGVQKNAGGRKVQAVSGSSGRWYTGQDGRRAYVSK 81
                +E S+        NW+ PKSC+ + K+A  R+VQA S S+G WYTG DGR+ YVSK
Sbjct: 398  SKSKNEKSKLNSSNVSGNWVEPKSCSSMPKDASERRVQASSQSAGHWYTGSDGRKVYVSK 457

Query: 80   NGQELTGQIAYRQYRKESG 24
            +GQELTG+ AYRQYRKESG
Sbjct: 458  SGQELTGRNAYRQYRKESG 476


>ref|XP_007214595.1| hypothetical protein PRUPE_ppa024431mg [Prunus persica]
            gi|462410460|gb|EMJ15794.1| hypothetical protein
            PRUPE_ppa024431mg [Prunus persica]
          Length = 528

 Score =  205 bits (522), Expect = 3e-50
 Identities = 151/435 (34%), Positives = 211/435 (48%), Gaps = 64/435 (14%)
 Frame = -2

Query: 1145 PPRPLKRLRRVTTAKPPAKEPKVEYEDKRCNVXXXXXXXXXXDCFKAGSPPTNSVCSSSK 966
            P RPLKRL+R    K   +EP     +   ++                     +V SSSK
Sbjct: 68   PVRPLKRLKRGLALK---REPATPIRNIDDDIEEFSSPEDIIRADAYRPTQYQTVSSSSK 124

Query: 965  PSLQWNKVRTSDSGTQWISRKGKEVPSASASINVGTRGSNVIFPKLTSSPLRRFQLIDSD 786
              L  + V TS S    + RK K     SAS+ +      ++FPKLT+SPLRRFQLIDSD
Sbjct: 125  IPLHGSGVLTSQSSCHSMGRKRKPASDVSASVGMEANRQGLMFPKLTTSPLRRFQLIDSD 184

Query: 785  SDNPSTIEDTHKVLPSAILSPEEKISDFCKQTTFGNQGSKTSVGKH-QIKDLWKDFDQKK 609
            SD+PS   +  +V  +   S +++  + C   +      K SV +     DLWKDF   K
Sbjct: 185  SDDPSVRGNGSRVTCNVDPSSKKQHFNSCHSASTSETKKKLSVPQDGGDVDLWKDFSPIK 244

Query: 608  SSSIPTPAFDEFCEEYFTGLKNKSMPKV---DCKQTDSGMHMDKTSCL------------ 474
              SIPTPA DE C+E+    K+K+  K+    C  T+  +  + T C+            
Sbjct: 245  KFSIPTPALDEVCQEFLQSAKDKTTQKLGRDSCLHTNE-IFQETTCCVQDVEQLWNVADP 303

Query: 473  --PALFYFFHNDSRIQKLVRERLPHFFPLEVANNQEDTQENISVIDYMGQF-GGEQNARQ 303
              PA  YFFH+D  I+KLV  RLP+FFPL + N + + Q   SVIDYMGQF  GE + ++
Sbjct: 304  LPPAHHYFFHDDPNIRKLVCSRLPNFFPLGI-NIRGNQQNGSSVIDYMGQFSNGEASKQK 362

Query: 302  TNRKQNAEKNIKRSKKNTKTSLVDGV---------------------------------- 225
             N+K + +++ KR  K+  +++ +G+                                  
Sbjct: 363  VNQKIHLDQSSKRRNKSNISNVEEGLHASGGWMNPKGKAAQKGSVNKSSRKVRNRSAKSN 422

Query: 224  ---SEH-SENWINPKSCAG---VQKNA---GGRKVQAVSG-SSGRWYTGQDGRRAYVSKN 78
                EH S NW+ P+S A    +Q NA   G     + SG ++G WYTG  GR+ YVSK 
Sbjct: 423  FGNGEHTSGNWVEPRSNASTKRIQANAQPSGQWSTPSASGQAAGHWYTGPGGRKVYVSKT 482

Query: 77   GQELTGQIAYRQYRK 33
            GQE+TG  AYR YRK
Sbjct: 483  GQEVTGSAAYRLYRK 497


>ref|XP_003518524.1| PREDICTED: uncharacterized protein LOC100798619 [Glycine max]
          Length = 533

 Score =  205 bits (522), Expect = 3e-50
 Identities = 164/478 (34%), Positives = 212/478 (44%), Gaps = 104/478 (21%)
 Frame = -2

Query: 1145 PPRPL-KRLRRVTTAKPPAKEPKVEYEDKRCNVXXXXXXXXXXDCFKAGSPPTNSVCSSS 969
            PPRPL KR RR     PP  +   + E+                 F +     +SVCSSS
Sbjct: 44   PPRPLLKRFRR--DLPPPCLDADDDIEE-----FSSQEDPDQGHAFPSAWNRNHSVCSSS 96

Query: 968  KPSLQWNKVRT--SDSGTQWISRKGKEVPS-ASASINVGTRGSNVIFPKLTSSPLRRFQL 798
            K SL  + V T  S S +    RK KE+ +   AS  + TR S ++FPKL +SPLRRFQL
Sbjct: 97   KVSLNGSGVLTPHSCSNSSSRDRKRKELSNDVPASSRLETRKSGLMFPKLNTSPLRRFQL 156

Query: 797  IDSDSDNPSTIEDTHKVLPSAILSPEEKI-----------SDFCKQTTF----------- 684
            IDSDSD+        KV P++ L  ++K             DF                 
Sbjct: 157  IDSDSDDADVDVGADKVNPNSHLEQDKKTLVDLNGNEDLWKDFSPVKNVYVNRFQLLSDS 216

Query: 683  ------------GNQGSKTSVGKHQI--KDLWKDFDQKKSSSIPTPAFDEFCEEYFTGLK 546
                         N  S     K  +  +DLWKDF   K+ S+PTPAF+E CEEYF    
Sbjct: 217  DDLDVDVGAANKANLDSNLEKNKKTLLDEDLWKDFSPVKNVSVPTPAFNEMCEEYFRSAY 276

Query: 545  NKSMPKVDCK--------------QTDSGMHMDKTSCLPALFYFFHNDSRIQKLVRERLP 408
             K +   D                Q D           PA  YFFH D RI++LV  RL 
Sbjct: 277  CKEVGGGDVSKSFNERNPGVSSSCQRDQQQQESTDPVHPAHSYFFHEDPRIRRLVCSRLQ 336

Query: 407  HFFPLEVANNQEDTQENISVIDYMGQFG--------GEQNARQTNR-------------- 294
            +F PL   N     Q N+S IDYM QFG        G QN R +N               
Sbjct: 337  NFNPLGTINTVNQ-QPNVSHIDYMRQFGNGGASNMQGVQNGRVSNSTRGKNKSSNLNVEG 395

Query: 293  ----------------------------KQNAEKNIKRSKKNTKTSLVDGVSEHSENWIN 198
                                        K+N+ KN   SK N KT+  +  ++ SE W+ 
Sbjct: 396  YFDASGGWMDPKFVSPFSHGKSSRKKATKRNSTKN-SVSKSNNKTNKSNPSNQSSEGWVE 454

Query: 197  PKSCAGVQKNAGGRKVQAVSGSSGRWYTGQDGRRAYVSKNGQELTGQIAYRQYRKESG 24
            P+SC  + K+AG R+VQA   S+G W+T  +GR+ YV+K+G+ELTG+ AYRQYRKESG
Sbjct: 455  PRSCTSLPKDAGKRRVQASGQSAGHWFTSPEGRKVYVNKSGEELTGRNAYRQYRKESG 512


>gb|AAM96977.1| unknown protein [Arabidopsis thaliana] gi|23198428|gb|AAN15741.1|
            unknown protein [Arabidopsis thaliana]
          Length = 458

 Score =  200 bits (509), Expect = 1e-48
 Identities = 146/428 (34%), Positives = 218/428 (50%), Gaps = 27/428 (6%)
 Frame = -2

Query: 1223 PSTAPNLRPTEEEDDDDFVSLARVSEPPRPLKRLRR-VTTAKPPAKEPK-VEYEDKRCNV 1050
            P     +  ++ E + DF S          LKRLRR +   K   K+ + V  ED+  ++
Sbjct: 44   PELGLTVSDSDREPEPDFTSPV--------LKRLRRGINPNKCSVKDDRSVAVEDRDDDI 95

Query: 1049 XXXXXXXXXXDCFKAGSPPTNSVCSSSKPSLQWNKVRTSDSGTQWISRKGKEVPSASASI 870
                          A +    S CSS  P L  + V ++        RK  +V +++AS 
Sbjct: 96   EEFSSPEDFPTDAPASTRSHFSSCSSRVP-LHGSGVLSNQPSISRGKRKQSDVQASAAS- 153

Query: 869  NVGTRGSNVIFPKLTSSPLRRFQLIDSDS--DNPSTIEDTHKVLPSAILSPEEKISDFCK 696
              G      +F   + SPLRRFQL+DSDS  D+PST  D        +    +K   F K
Sbjct: 154  --GISSVASLFQMSSRSPLRRFQLLDSDSEDDHPSTSRD--------LSGATKKHDSFSK 203

Query: 695  QTTFGNQGSKTSVGKHQ-------IKDLWKDFDQKKSSSIPTPAFDEFCEEYFTGLKNKS 537
                 NQ S  S  K +       IKDLWKDF    SS I TPAFD+ C++YF  +K  S
Sbjct: 204  -----NQPSIASKPKRKEPGSIPCIKDLWKDFSPA-SSKIQTPAFDDVCQDYFISIKTTS 257

Query: 536  MPKVD----CKQTDSGMH----MDKTSCL--------PALFYFFHNDSRIQKLVRERLPH 405
              +         ++SG H      +T           P+  +F H+D RI+ L R+RLP+
Sbjct: 258  TAQKQSSAVASSSNSGNHNLTGFQQTELFHDFSHPSPPSHRFFLHSDPRIRNLARQRLPN 317

Query: 404  FFPLEVANNQEDTQENISVIDYMGQFGGEQNARQTNRKQNAEKNIKRSKKNTKTSLVDGV 225
            FFPL + N++E +Q  + ++DYM QFG + +++  +   ++ K+ +R K  +K S     
Sbjct: 318  FFPLGIVNDRE-SQREVFLVDYMNQFGSKGSSKAGD---SSSKSCRRGKTKSKVSKSQES 373

Query: 224  SEHSENWINPKSCAGVQKNAGGRKVQAVSGSSGRWYTGQDGRRAYVSKNGQELTGQIAYR 45
            + +SE W+NPK+ A   K+AG R+V A SGS+G W+T  +GR+ Y+SK+GQE +GQ AYR
Sbjct: 374  AHNSEGWLNPKTRAAAPKDAGKRRVSADSGSAGHWFTSPEGRKVYISKSGQEFSGQSAYR 433

Query: 44   QYRKESGM 21
             Y+KE+G+
Sbjct: 434  CYKKENGV 441


Top