BLASTX nr result

ID: Mentha26_contig00001828 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00001828
         (868 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU43410.1| hypothetical protein MIMGU_mgv1a005781mg [Mimulus...   280   5e-73
ref|XP_002519031.1| conserved hypothetical protein [Ricinus comm...   223   1e-55
ref|XP_004232845.1| PREDICTED: uncharacterized protein LOC101266...   221   2e-55
ref|XP_002305687.2| hypothetical protein POPTR_0004s04000g [Popu...   221   3e-55
ref|XP_006377324.1| hypothetical protein POPTR_0011s04900g [Popu...   217   5e-54
ref|XP_003528449.1| PREDICTED: uncharacterized protein LOC100806...   217   5e-54
ref|XP_006347070.1| PREDICTED: uncharacterized protein LOC102601...   214   3e-53
ref|XP_003542764.1| PREDICTED: uncharacterized protein LOC100789...   213   1e-52
gb|EXB82799.1| hypothetical protein L484_012112 [Morus notabilis]     211   4e-52
emb|CBI35691.3| unnamed protein product [Vitis vinifera]              211   4e-52
ref|XP_007159304.1| hypothetical protein PHAVU_002G226800g [Phas...   207   5e-51
ref|XP_002268025.2| PREDICTED: uncharacterized protein LOC100249...   207   5e-51
ref|XP_007025688.1| WAPL protein, putative isoform 6, partial [T...   204   3e-50
ref|XP_007025687.1| WAPL protein, putative isoform 5, partial [T...   204   3e-50
ref|XP_007025685.1| WAPL protein, putative isoform 3 [Theobroma ...   204   3e-50
ref|XP_007025683.1| WAPL protein, putative isoform 1 [Theobroma ...   204   3e-50
ref|XP_007214611.1| hypothetical protein PRUPE_ppa001140mg [Prun...   204   3e-50
ref|XP_006449301.1| hypothetical protein CICLE_v10014178mg [Citr...   196   1e-47
ref|XP_004505031.1| PREDICTED: uncharacterized protein LOC101498...   195   2e-47
ref|XP_006467835.1| PREDICTED: uncharacterized protein LOC102612...   193   6e-47

>gb|EYU43410.1| hypothetical protein MIMGU_mgv1a005781mg [Mimulus guttatus]
          Length = 471

 Score =  280 bits (716), Expect = 5e-73
 Identities = 170/326 (52%), Positives = 201/326 (61%), Gaps = 53/326 (16%)
 Frame = +1

Query: 49   GFNSMEQSSYLNVEATQVCSQ----KWRVESSQARSCSGTSCTPNYVTX----------- 183
            G +  E+    ++E++Q+ +     K RVESSQA  CSGTS   N  T            
Sbjct: 86   GSSQNEKMGVCSMESSQLSADPLFLKQRVESSQAGLCSGTSWNSNNATHIISSDDSDTEF 145

Query: 184  -----KLRLTNSGITE-DCPDPFGFTDT-------------------------------- 249
                 +L   N+G+ E    DPF F +                                 
Sbjct: 146  GGAKRQLMCANTGVMEYGGGDPFAFDEDDFEPSKWELLSVNGKKPLSQDSRGYNKYDKNP 205

Query: 250  SDHIPVSQQESNNVEYHHSQETSSSSVVDEDKSNLLADCLLTAVKVLMNLSNDNPEGCRQ 429
            S   PVS Q+ +N +   SQE S S  VDEDKSNLL+DCLLTAVKVLMNL+NDNPEGC+Q
Sbjct: 206  SPTPPVSSQQESNNDCFRSQEASCSLSVDEDKSNLLSDCLLTAVKVLMNLTNDNPEGCQQ 265

Query: 430  IGSCGGLEILSSLITGHFPSFSLPLPXXXXXXXXXXXXXXXXTIYHCSNTPLTDQELDFL 609
            IG+CGGLEILSSLI GHFPSFSL LP                      ++PLTD+ELDFL
Sbjct: 266  IGTCGGLEILSSLIAGHFPSFSLSLPHFGDVREGGLS---------AKSSPLTDRELDFL 316

Query: 610  VAILGLLVNLVEKDGGNRSQLAAASVSIPHIVGLESEKQSNMIPILCSIFLANQGTSEDA 789
            VAILGLLVNLVEKDG NRS+LAAASVS+P++ GL+SE QS++I +LCS+FLANQGT E A
Sbjct: 317  VAILGLLVNLVEKDGCNRSRLAAASVSLPNLEGLDSEDQSDLISLLCSVFLANQGTGEAA 376

Query: 790  GEEKSLSWEDEESILQGEKEAEKMIV 867
            GEEK LSWEDEESILQGEKEAEKMIV
Sbjct: 377  GEEKQLSWEDEESILQGEKEAEKMIV 402


>ref|XP_002519031.1| conserved hypothetical protein [Ricinus communis]
            gi|223541694|gb|EEF43242.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 905

 Score =  223 bits (567), Expect = 1e-55
 Identities = 137/314 (43%), Positives = 173/314 (55%), Gaps = 59/314 (18%)
 Frame = +1

Query: 103  CSQKWRVESSQARSCSGTSCTPNYVTXKL-----------------RLTNSGITEDCPDP 231
            C  + R+ SS + SCSGT  + N  T                    + T   + ED  DP
Sbjct: 529  CQVRMRIHSSTSSSCSGTRRSTNSGTPSTSNGLRTKFGLPERTNCTKSTKYDLLEDSLDP 588

Query: 232  FGFT--------------------------------DTSDHIPVSQQESNN--------- 288
            + F                                 D   + P+SQ+ESNN         
Sbjct: 589  YAFDEDEFQPSKWDLLSGKQTKSRSQNCAVTSRALEDGCQYRPMSQEESNNSENSEQKAR 648

Query: 289  -VEYHHSQETSSSSVVDEDKSNLLADCLLTAVKVLMNLSNDNPEGCRQIGSCGGLEILSS 465
             VE H SQ+ S S+  +E+  +L+ADCLLTAVKVLMNL+NDNP GC+QI +CGGLE + S
Sbjct: 649  NVECHPSQKNSCSNASEEEHFSLMADCLLTAVKVLMNLTNDNPIGCKQIAACGGLEKMCS 708

Query: 466  LITGHFPSFSLPLPXXXXXXXXXXXXXXXXTIYHCSNTPLTDQELDFLVAILGLLVNLVE 645
            LI GHFPSFS  L                 ++   ++  LTDQELDFLVAILGLLVNLVE
Sbjct: 709  LIAGHFPSFSSSLSCFSETKGDTT------SMESQNDNHLTDQELDFLVAILGLLVNLVE 762

Query: 646  KDGGNRSQLAAASVSIPHIVGLESEKQSNMIPILCSIFLANQGTSEDAGEEKSLSWEDEE 825
            KDG NRS+LAA +VS+    GLE E   ++IP+LCSIFLANQG  + +GE   ++W DE 
Sbjct: 763  KDGHNRSRLAATTVSVSSSEGLEEESDRDVIPLLCSIFLANQGAGDASGEGNIVAWNDEA 822

Query: 826  SILQGEKEAEKMIV 867
            ++LQGEKEAEKMIV
Sbjct: 823  AVLQGEKEAEKMIV 836


>ref|XP_004232845.1| PREDICTED: uncharacterized protein LOC101266688 [Solanum
            lycopersicum]
          Length = 952

 Score =  221 bits (564), Expect = 2e-55
 Identities = 144/328 (43%), Positives = 185/328 (56%), Gaps = 57/328 (17%)
 Frame = +1

Query: 55   NSMEQSSYLNVEATQVCSQKWRVESSQARSCSGTS-----------CTPNYVTXKLRLTN 201
            +S+    + +   +     K R+ESS++ SCSGTS              N++    +  N
Sbjct: 565  SSISSLEFASTSTSDSWQLKLRIESSKSGSCSGTSEDFSFGVNKNSSKVNFLIGDNQRIN 624

Query: 202  SG----ITEDCPDPFGFTD-----------TSDHIPV---------------------SQ 273
                  + E+  DPF F D           T   +P                      SQ
Sbjct: 625  GDKRLELMEESQDPFAFDDDFGPSRWDLMSTKQKVPETQIRQTSLFERDDEYLSLIVPSQ 684

Query: 274  QESN---------NVEYHHSQETSSSSVVDEDKSNLLADCLLTAVKVLMNLSNDNPEGCR 426
            QES+         + E + S +TS SSV D++ S LLADCLLTAVKVLMNL+NDNP GC+
Sbjct: 685  QESSCQENKPQSSSKENNQSGQTSCSSVADDEMSTLLADCLLTAVKVLMNLTNDNPVGCQ 744

Query: 427  QIGSCGGLEILSSLITGHFPSFSLPLPXXXXXXXXXXXXXXXXTIYHCSNTPLTDQELDF 606
            QI + GGLE LS+LI  HFPSFSL L                 ++   S+  L DQELDF
Sbjct: 745  QIAAGGGLEALSALIASHFPSFSLHL---------DRNGLSKSSVGSDSDGHLNDQELDF 795

Query: 607  LVAILGLLVNLVEKDGGNRSQLAAASVSIPHIVGL-ESEKQSNMIPILCSIFLANQGTSE 783
            LVAILGLLVNLVEKDG NRS+LAAAS+S+P   GL + E Q+++IP+LC+IFL NQG  E
Sbjct: 796  LVAILGLLVNLVEKDGCNRSRLAAASISLPGSEGLFKGETQTDVIPLLCAIFLENQGAGE 855

Query: 784  DAGEEKSLSWEDEESILQGEKEAEKMIV 867
             AGE K L W+DE+++LQGEKEAEKMI+
Sbjct: 856  AAGEGKCLQWDDEDAVLQGEKEAEKMII 883


>ref|XP_002305687.2| hypothetical protein POPTR_0004s04000g [Populus trichocarpa]
            gi|550340276|gb|EEE86198.2| hypothetical protein
            POPTR_0004s04000g [Populus trichocarpa]
          Length = 890

 Score =  221 bits (563), Expect = 3e-55
 Identities = 117/198 (59%), Positives = 143/198 (72%)
 Frame = +1

Query: 274  QESNNVEYHHSQETSSSSVVDEDKSNLLADCLLTAVKVLMNLSNDNPEGCRQIGSCGGLE 453
            Q+S+N E +HSQ++S  SV DE+ S+LLADCLLTA+KVLMNL+NDNP GC+QI  CGGLE
Sbjct: 628  QKSSNGEQYHSQKSSHCSVPDEEHSSLLADCLLTAIKVLMNLTNDNPIGCQQIAVCGGLE 687

Query: 454  ILSSLITGHFPSFSLPLPXXXXXXXXXXXXXXXXTIYHCSNTPLTDQELDFLVAILGLLV 633
             +S+LI GHFPSFS  +                      ++  LTDQELDFLVAILGLLV
Sbjct: 688  TMSTLIAGHFPSFSSSISLVGEMQEDGSSIEPDNQ----NDVHLTDQELDFLVAILGLLV 743

Query: 634  NLVEKDGGNRSQLAAASVSIPHIVGLESEKQSNMIPILCSIFLANQGTSEDAGEEKSLSW 813
            NLVEKDG NRS+LAA SV +  + G E E + ++IP+LCSIFLANQG  + AGE   +SW
Sbjct: 744  NLVEKDGDNRSRLAATSVPLSILEGSEDESRKDVIPLLCSIFLANQGAGDAAGEGNVVSW 803

Query: 814  EDEESILQGEKEAEKMIV 867
             DE ++LQGEKEAEKMIV
Sbjct: 804  NDEAAVLQGEKEAEKMIV 821


>ref|XP_006377324.1| hypothetical protein POPTR_0011s04900g [Populus trichocarpa]
            gi|550327612|gb|ERP55121.1| hypothetical protein
            POPTR_0011s04900g [Populus trichocarpa]
          Length = 883

 Score =  217 bits (552), Expect = 5e-54
 Identities = 118/226 (52%), Positives = 151/226 (66%)
 Frame = +1

Query: 190  RLTNSGITEDCPDPFGFTDTSDHIPVSQQESNNVEYHHSQETSSSSVVDEDKSNLLADCL 369
            R+T   +   C       + S +      +S+N E+H SQ++S  +V DE+ S+LLADCL
Sbjct: 593  RVTPKEVENGCQYKLVSQEESSNGGNGLHKSSNREHHDSQKSSYCNVPDEEHSSLLADCL 652

Query: 370  LTAVKVLMNLSNDNPEGCRQIGSCGGLEILSSLITGHFPSFSLPLPXXXXXXXXXXXXXX 549
            LTA+KVLMNL+NDNP GC+QI +CGGLE +SSLI GHFP FS  +               
Sbjct: 653  LTAIKVLMNLTNDNPIGCQQIAACGGLETMSSLIAGHFPLFSSSISFFGEMQEDSSSIP- 711

Query: 550  XXTIYHCSNTPLTDQELDFLVAILGLLVNLVEKDGGNRSQLAAASVSIPHIVGLESEKQS 729
               + + ++  LTDQELD LVAILGLLVNLVEKDG NRS+LAA S+S+    G E E + 
Sbjct: 712  ---LENQNDIHLTDQELDLLVAILGLLVNLVEKDGDNRSRLAATSISLSSSEGSEDESRK 768

Query: 730  NMIPILCSIFLANQGTSEDAGEEKSLSWEDEESILQGEKEAEKMIV 867
            ++IP+LCSIFLANQG  + AGE   +SW DE ++LQGEKEAEKMIV
Sbjct: 769  DVIPLLCSIFLANQGAGDAAGEGNIVSWNDEAAVLQGEKEAEKMIV 814


>ref|XP_003528449.1| PREDICTED: uncharacterized protein LOC100806542 [Glycine max]
          Length = 862

 Score =  217 bits (552), Expect = 5e-54
 Identities = 145/316 (45%), Positives = 173/316 (54%), Gaps = 45/316 (14%)
 Frame = +1

Query: 55   NSMEQSSYLNVEATQVCSQKWRVESSQARSCSGTSCTPNYVTXKLRLTNSG--------- 207
            +S+  S   +   T   S K RV SS + SCSG S +    T  ++  +SG         
Sbjct: 488  SSLSISETPSTSTTDTYSLKTRVSSSMSGSCSGASKSSYCKTSTIQ-NSSGKNVRFMEGT 546

Query: 208  ---ITEDCPDPFGF---------------------------------TDTSDHIPVSQQE 279
               I +D  DPF F                                  +   H  VSQ+E
Sbjct: 547  PVVILDDSQDPFAFDEDDFAPSKWDLLSGKQKKSHSKKHLVANREFENECQSHTNVSQRE 606

Query: 280  SNNVEYHHSQETSSSSVVDEDKSNLLADCLLTAVKVLMNLSNDNPEGCRQIGSCGGLEIL 459
             +N + +     SSS V DE  S+LLADCLLTAVKVLMNL+NDNP GCRQI + GGLE +
Sbjct: 607  LSNGDIN----CSSSDVGDEKDSSLLADCLLTAVKVLMNLTNDNPVGCRQIANYGGLETM 662

Query: 460  SSLITGHFPSFSLPLPXXXXXXXXXXXXXXXXTIYHCSNTPLTDQELDFLVAILGLLVNL 639
            S LI GHFPSFS                    T  H S+  LTD ELDFLVAILGLLVNL
Sbjct: 663  SMLIAGHFPSFS-----SSSSFAQIKENGAGTTKDHQSDRHLTDHELDFLVAILGLLVNL 717

Query: 640  VEKDGGNRSQLAAASVSIPHIVGLESEKQSNMIPILCSIFLANQGTSEDAGEEKSLSWED 819
            VEKDG NRS+LAAASV +P  V L  E + ++I +LCSIFLAN G SE AGE+K L   D
Sbjct: 718  VEKDGHNRSRLAAASVLLPSSVSLHQEVRKDVIQLLCSIFLANLGESEGAGEDKHLQLND 777

Query: 820  EESILQGEKEAEKMIV 867
            E ++LQGEKEAEKMIV
Sbjct: 778  EAAVLQGEKEAEKMIV 793


>ref|XP_006347070.1| PREDICTED: uncharacterized protein LOC102601713 [Solanum tuberosum]
          Length = 961

 Score =  214 bits (546), Expect = 3e-53
 Identities = 140/337 (41%), Positives = 183/337 (54%), Gaps = 66/337 (19%)
 Frame = +1

Query: 55   NSMEQSSYLNVEATQVCSQKWRVESSQARSCSGTS-----------CTPNYVTXKLRLTN 201
            +S+    + +   +     K R+ESS++ SCSGTS              N++    +  N
Sbjct: 565  SSISSLEFASTSTSDSWQLKLRIESSKSGSCSGTSEDFSFGVNKNSSKVNFLIGDNQRIN 624

Query: 202  SG----ITEDCPDPFGFTD-----------TSDHIPVSQ--------------------- 273
                  + E+  DPF F D           T   +P +Q                     
Sbjct: 625  GDKRLELMEESQDPFAFDDDFGPSRWDLMSTKQKVPETQIRQTSLFERDDEYQSLIVRSQ 684

Query: 274  ------------------QESNNVEYHHSQETSSSSVVDEDKSNLLADCLLTAVKVLMNL 399
                               ES++ E + S +TS S+V D++ S LLADCLLTAVK LMNL
Sbjct: 685  QESSCQENKPESSSKENKPESSSKENNQSGQTSCSAVADDEMSTLLADCLLTAVKALMNL 744

Query: 400  SNDNPEGCRQIGSCGGLEILSSLITGHFPSFSLPLPXXXXXXXXXXXXXXXXTIYHCSNT 579
            +NDNP GC+QI + GGLE LS+LI  HFPSFSL L                 ++   S+ 
Sbjct: 745  TNDNPVGCQQIAAGGGLEALSALIASHFPSFSLHL---------DRNGSSKSSVGSDSDG 795

Query: 580  PLTDQELDFLVAILGLLVNLVEKDGGNRSQLAAASVSIPHIVGL-ESEKQSNMIPILCSI 756
             L DQELDFLVAILGLLVNLVEKDG NRS+LAAAS+S+P   GL + E Q+++IP+LC+I
Sbjct: 796  HLNDQELDFLVAILGLLVNLVEKDGCNRSRLAAASISLPGPEGLFKGETQTDVIPLLCAI 855

Query: 757  FLANQGTSEDAGEEKSLSWEDEESILQGEKEAEKMIV 867
            FLANQG  E A E K L W+DE+++LQGEKEAEKMI+
Sbjct: 856  FLANQGAGEAAEEGKCLQWDDEDAVLQGEKEAEKMII 892


>ref|XP_003542764.1| PREDICTED: uncharacterized protein LOC100789737 [Glycine max]
          Length = 865

 Score =  213 bits (541), Expect = 1e-52
 Identities = 144/308 (46%), Positives = 170/308 (55%), Gaps = 46/308 (14%)
 Frame = +1

Query: 82   NVEATQVCSQKWRVESSQARSCSGTSCTPNYVTXKLRLTNSG------------ITEDCP 225
            +   T   S K RV SS + SCSG S +    T +++  +SG            I +D  
Sbjct: 499  STSTTDSYSLKMRVNSSTSGSCSGASKSSYCKTSRIQ-NSSGKNVRFMEDTPVVILDDSQ 557

Query: 226  DPFGFTDTSDHIP----------------------------------VSQQESNNVEYHH 303
            DPF F D  D  P                                  VSQQE +N + + 
Sbjct: 558  DPFAF-DEDDFAPSKWDLLSGKPKKSHSKKHVVANREFENECQSLTNVSQQELSNGDIN- 615

Query: 304  SQETSSSSVVDEDKSNLLADCLLTAVKVLMNLSNDNPEGCRQIGSCGGLEILSSLITGHF 483
                SSS V DE  S+LLADCLL AVKVLMNL+NDNP GCRQI + GGLE +S LI GHF
Sbjct: 616  ---CSSSDVGDEKDSSLLADCLLAAVKVLMNLTNDNPVGCRQIANYGGLETMSMLIAGHF 672

Query: 484  PSFSLPLPXXXXXXXXXXXXXXXXTIYHCSNTPLTDQELDFLVAILGLLVNLVEKDGGNR 663
            PSFS                    T  + S+  LTD ELDFLVAILGLLVNLVEKDG NR
Sbjct: 673  PSFS----SSSSSFAQIKENGEGTTKDNQSDRHLTDHELDFLVAILGLLVNLVEKDGHNR 728

Query: 664  SQLAAASVSIPHIVGLESEKQSNMIPILCSIFLANQGTSEDAGEEKSLSWEDEESILQGE 843
            S+LAAASV +P  V L  E + ++I +LCSIFLAN G SE AGE+K L   DE ++LQGE
Sbjct: 729  SRLAAASVHLPSSVSLHQEVRKDVIQLLCSIFLANLGESEGAGEDKQLQLNDEAAVLQGE 788

Query: 844  KEAEKMIV 867
            KEAEKMIV
Sbjct: 789  KEAEKMIV 796


>gb|EXB82799.1| hypothetical protein L484_012112 [Morus notabilis]
          Length = 851

 Score =  211 bits (536), Expect = 4e-52
 Identities = 137/308 (44%), Positives = 172/308 (55%), Gaps = 42/308 (13%)
 Frame = +1

Query: 70   SSYLNVEATQVCSQKWRVESSQARSCSGTS--CTPNYVTXKLRLTNSGIT--EDCPDPFG 237
            S   +   T   S K R  SS + SCSG S   + +  T    + N  I   +D  DPF 
Sbjct: 488  SETTSTSMTDGYSLKTRRRSSASSSCSGMSRSLSGSNATKNSSMKNVDIVLLDDSQDPFA 547

Query: 238  FTDTS---------------------------------DHIPVSQQESNNVEYHHSQETS 318
            F +                                     I +SQ+E+++ E +HS E S
Sbjct: 548  FDEDDLEPSKWEVLSGKQNTSRTKRIGLKDREPDYGFQSRIKMSQEETSSGENNHSHEAS 607

Query: 319  SSSVVDEDKSNLLADCLLTAVKVLMNLSNDNPEGCRQIGSCGGLEILSSLITGHFPSFSL 498
             S+ VDE +S+LLADCLLTAVK LMN++NDNP GC+QI +CGGLE +SSLI  HFPSFS 
Sbjct: 608  CSTSVDEGRSSLLADCLLTAVKALMNVTNDNPVGCQQIAACGGLETMSSLIALHFPSFSS 667

Query: 499  PLPXXXXXXXXXXXXXXXXTIYHCSNTPLTDQELDFLVAILGLLVNLVEKDGGNRSQLAA 678
              P                 + + S+ PLTD ELDFLVAILGLLVNLVEKDG NRS+LA+
Sbjct: 668  SPP-------------SFLDVDNQSDRPLTDHELDFLVAILGLLVNLVEKDGENRSRLAS 714

Query: 679  ASVSIPHIVGLESE-----KQSNMIPILCSIFLANQGTSEDAGEEKSLSWEDEESILQGE 843
            ASV + H     SE      + ++IP+LCSIFLANQG  E   E K   W+DE ++LQGE
Sbjct: 715  ASVPL-HKSNFYSEFCGKASRKDVIPLLCSIFLANQGAGEAVHEGKVQPWDDEAAVLQGE 773

Query: 844  KEAEKMIV 867
            KEAEKMI+
Sbjct: 774  KEAEKMIL 781


>emb|CBI35691.3| unnamed protein product [Vitis vinifera]
          Length = 903

 Score =  211 bits (536), Expect = 4e-52
 Identities = 124/236 (52%), Positives = 148/236 (62%), Gaps = 6/236 (2%)
 Frame = +1

Query: 178  TXKLRLTNSGITEDCPDPFGFTDTSDHIPVSQQESNNVEYHHSQETSS------SSVVDE 339
            T K R+T  G+ + C            +  SQQES+N E +   E S       S  ++ 
Sbjct: 611  TKKCRVTYRGLEDGC---------LSQLMTSQQESSNRESNELHEISCPAEISCSDAINN 661

Query: 340  DKSNLLADCLLTAVKVLMNLSNDNPEGCRQIGSCGGLEILSSLITGHFPSFSLPLPXXXX 519
            + SNLLADCLL AVKVLMNL+NDNP GC+QI  CGGLE +S+LI  HFPSFS        
Sbjct: 662  ENSNLLADCLLNAVKVLMNLTNDNPVGCQQIADCGGLETMSALIADHFPSFSSSSSPSCE 721

Query: 520  XXXXXXXXXXXXTIYHCSNTPLTDQELDFLVAILGLLVNLVEKDGGNRSQLAAASVSIPH 699
                             ++T LTDQELDFLVAILGLLVNLVEKD  NRS+LAAASVS+P 
Sbjct: 722  MKDIAMFSNSSVEFDPQNDTHLTDQELDFLVAILGLLVNLVEKDDRNRSRLAAASVSLPS 781

Query: 700  IVGLESEKQSNMIPILCSIFLANQGTSEDAGEEKSLSWEDEESILQGEKEAEKMIV 867
              GLE   + ++IP+LCSIFLAN+G  E A E   LSW DE ++LQGEKEAEKMIV
Sbjct: 782  SEGLEEGTRRDVIPLLCSIFLANKGAGEAAEE---LSWNDEAALLQGEKEAEKMIV 834


>ref|XP_007159304.1| hypothetical protein PHAVU_002G226800g [Phaseolus vulgaris]
            gi|561032719|gb|ESW31298.1| hypothetical protein
            PHAVU_002G226800g [Phaseolus vulgaris]
          Length = 857

 Score =  207 bits (526), Expect = 5e-51
 Identities = 145/316 (45%), Positives = 167/316 (52%), Gaps = 45/316 (14%)
 Frame = +1

Query: 55   NSMEQSSYLNVEATQVCSQKWRVESSQARSCSGTS----CTPNYVTXKLRL-------TN 201
            +S+  S   +   T   S K RV SS + SCSG S    C  + +   LR        T 
Sbjct: 483  SSLSISETPSTSTTDTYSLKMRVSSSTSGSCSGASKSSYCKTSMIQNDLRKNVRFMESTP 542

Query: 202  SGITEDCPDPFGFTDTSDHIP----------------------------------VSQQE 279
              I +D  DPF F D  D  P                                  VSQQE
Sbjct: 543  VVILDDSQDPFAF-DEDDIAPSKWDLLSGKQKKPHSKKHVVASREFEIECQSNTSVSQQE 601

Query: 280  SNNVEYHHSQETSSSSVVDEDKSNLLADCLLTAVKVLMNLSNDNPEGCRQIGSCGGLEIL 459
             +N + +     SSS   DE  S+LL DCLL AVKVLMNL+NDNP GC QI S GGLE +
Sbjct: 602  LSNGDIN----CSSSDDGDEKDSSLLTDCLLAAVKVLMNLTNDNPVGCHQIASYGGLETM 657

Query: 460  SSLITGHFPSFSLPLPXXXXXXXXXXXXXXXXTIYHCSNTPLTDQELDFLVAILGLLVNL 639
            S LI  HFPSFS PL                 T  H S+  LTD ELDFLVAILGLLVNL
Sbjct: 658  SMLIACHFPSFSSPLSFAQIKENAAGT-----TKDHQSDRHLTDHELDFLVAILGLLVNL 712

Query: 640  VEKDGGNRSQLAAASVSIPHIVGLESEKQSNMIPILCSIFLANQGTSEDAGEEKSLSWED 819
            VEKDG NRS+LAAASV +P  VGL  E   ++I +LCSIFLAN G  E  GE+K L   D
Sbjct: 713  VEKDGHNRSRLAAASVLLPSSVGLCQEVWGDVIQLLCSIFLANLGEGEGDGEDKQLQLND 772

Query: 820  EESILQGEKEAEKMIV 867
            E ++LQ EKEAEKMIV
Sbjct: 773  EAAVLQSEKEAEKMIV 788


>ref|XP_002268025.2| PREDICTED: uncharacterized protein LOC100249879 [Vitis vinifera]
          Length = 897

 Score =  207 bits (526), Expect = 5e-51
 Identities = 121/236 (51%), Positives = 147/236 (62%), Gaps = 6/236 (2%)
 Frame = +1

Query: 178  TXKLRLTNSGITEDCPDPFGFTDTSDHIPVSQQESNNVEYHHSQETSS------SSVVDE 339
            T K R+T  G+ + C            +  SQQES+N E +   E S       S  ++ 
Sbjct: 602  TKKCRVTYRGLEDGC---------LSQLMTSQQESSNRESNELHEISCPAEISCSDAINN 652

Query: 340  DKSNLLADCLLTAVKVLMNLSNDNPEGCRQIGSCGGLEILSSLITGHFPSFSLPLPXXXX 519
            + SNLLADCLL AVKVLMNL+NDNP GC+QI  CGGLE +S+LI  HFPSFS        
Sbjct: 653  ENSNLLADCLLNAVKVLMNLTNDNPVGCQQIADCGGLETMSALIADHFPSFSSSSSPSCE 712

Query: 520  XXXXXXXXXXXXTIYHCSNTPLTDQELDFLVAILGLLVNLVEKDGGNRSQLAAASVSIPH 699
                             ++T LTDQELDFLVAILGLLVNLVEKD  NRS+LAAASVS+P 
Sbjct: 713  MKDIAMFSNSSVEFDPQNDTHLTDQELDFLVAILGLLVNLVEKDDRNRSRLAAASVSLPS 772

Query: 700  IVGLESEKQSNMIPILCSIFLANQGTSEDAGEEKSLSWEDEESILQGEKEAEKMIV 867
              GLE   + ++IP+LCSIFLAN+G  E A E   ++  DE ++LQGEKEAEKMIV
Sbjct: 773  SEGLEEGTRRDVIPLLCSIFLANKGAGEAAEELSWVTMNDEAALLQGEKEAEKMIV 828


>ref|XP_007025688.1| WAPL protein, putative isoform 6, partial [Theobroma cacao]
            gi|508781054|gb|EOY28310.1| WAPL protein, putative
            isoform 6, partial [Theobroma cacao]
          Length = 859

 Score =  204 bits (520), Expect = 3e-50
 Identities = 121/231 (52%), Positives = 148/231 (64%), Gaps = 3/231 (1%)
 Frame = +1

Query: 184  KLRLTNSGITEDCPDPFGFT---DTSDHIPVSQQESNNVEYHHSQETSSSSVVDEDKSNL 354
            KL L N  I ++    F FT     S +  + Q E  N EY HS  TS S   +E+ S+L
Sbjct: 612  KLGLRNGEIQDE--HQFQFTISQQESSNGEICQTEFTNEEYRHSNATSGSQSAEEEYSSL 669

Query: 355  LADCLLTAVKVLMNLSNDNPEGCRQIGSCGGLEILSSLITGHFPSFSLPLPXXXXXXXXX 534
            L+DCLL AVKVLMNL+NDNP GC+QI + G LE LS+LI  HFPSF   LP         
Sbjct: 670  LSDCLLAAVKVLMNLTNDNPLGCQQIAASGALETLSTLIASHFPSFCSYLP----RVSEM 725

Query: 535  XXXXXXXTIYHCSNTPLTDQELDFLVAILGLLVNLVEKDGGNRSQLAAASVSIPHIVGLE 714
                    ++  ++ PLTD ELDFLVAILGLLVNLVEKD  NRS+LAAASV +P+  GL 
Sbjct: 726  EENSLSLELHDRNDRPLTDPELDFLVAILGLLVNLVEKDEHNRSRLAAASVFVPNSEGLA 785

Query: 715  SEKQSNMIPILCSIFLANQGTSEDAGEEKSLSWEDEESILQGEKEAEKMIV 867
             + Q  +IP+LC+IFLANQG  + AGE   L W DE ++LQ EKEAEKMI+
Sbjct: 786  EKSQMAVIPLLCAIFLANQGEDDAAGE--VLPWNDEAAVLQEEKEAEKMIL 834


>ref|XP_007025687.1| WAPL protein, putative isoform 5, partial [Theobroma cacao]
            gi|508781053|gb|EOY28309.1| WAPL protein, putative
            isoform 5, partial [Theobroma cacao]
          Length = 857

 Score =  204 bits (520), Expect = 3e-50
 Identities = 121/231 (52%), Positives = 148/231 (64%), Gaps = 3/231 (1%)
 Frame = +1

Query: 184  KLRLTNSGITEDCPDPFGFT---DTSDHIPVSQQESNNVEYHHSQETSSSSVVDEDKSNL 354
            KL L N  I ++    F FT     S +  + Q E  N EY HS  TS S   +E+ S+L
Sbjct: 612  KLGLRNGEIQDE--HQFQFTISQQESSNGEICQTEFTNEEYRHSNATSGSQSAEEEYSSL 669

Query: 355  LADCLLTAVKVLMNLSNDNPEGCRQIGSCGGLEILSSLITGHFPSFSLPLPXXXXXXXXX 534
            L+DCLL AVKVLMNL+NDNP GC+QI + G LE LS+LI  HFPSF   LP         
Sbjct: 670  LSDCLLAAVKVLMNLTNDNPLGCQQIAASGALETLSTLIASHFPSFCSYLP----RVSEM 725

Query: 535  XXXXXXXTIYHCSNTPLTDQELDFLVAILGLLVNLVEKDGGNRSQLAAASVSIPHIVGLE 714
                    ++  ++ PLTD ELDFLVAILGLLVNLVEKD  NRS+LAAASV +P+  GL 
Sbjct: 726  EENSLSLELHDRNDRPLTDPELDFLVAILGLLVNLVEKDEHNRSRLAAASVFVPNSEGLA 785

Query: 715  SEKQSNMIPILCSIFLANQGTSEDAGEEKSLSWEDEESILQGEKEAEKMIV 867
             + Q  +IP+LC+IFLANQG  + AGE   L W DE ++LQ EKEAEKMI+
Sbjct: 786  EKSQMAVIPLLCAIFLANQGEDDAAGE--VLPWNDEAAVLQEEKEAEKMIL 834


>ref|XP_007025685.1| WAPL protein, putative isoform 3 [Theobroma cacao]
            gi|508781051|gb|EOY28307.1| WAPL protein, putative
            isoform 3 [Theobroma cacao]
          Length = 928

 Score =  204 bits (520), Expect = 3e-50
 Identities = 121/231 (52%), Positives = 148/231 (64%), Gaps = 3/231 (1%)
 Frame = +1

Query: 184  KLRLTNSGITEDCPDPFGFT---DTSDHIPVSQQESNNVEYHHSQETSSSSVVDEDKSNL 354
            KL L N  I ++    F FT     S +  + Q E  N EY HS  TS S   +E+ S+L
Sbjct: 612  KLGLRNGEIQDE--HQFQFTISQQESSNGEICQTEFTNEEYRHSNATSGSQSAEEEYSSL 669

Query: 355  LADCLLTAVKVLMNLSNDNPEGCRQIGSCGGLEILSSLITGHFPSFSLPLPXXXXXXXXX 534
            L+DCLL AVKVLMNL+NDNP GC+QI + G LE LS+LI  HFPSF   LP         
Sbjct: 670  LSDCLLAAVKVLMNLTNDNPLGCQQIAASGALETLSTLIASHFPSFCSYLP----RVSEM 725

Query: 535  XXXXXXXTIYHCSNTPLTDQELDFLVAILGLLVNLVEKDGGNRSQLAAASVSIPHIVGLE 714
                    ++  ++ PLTD ELDFLVAILGLLVNLVEKD  NRS+LAAASV +P+  GL 
Sbjct: 726  EENSLSLELHDRNDRPLTDPELDFLVAILGLLVNLVEKDEHNRSRLAAASVFVPNSEGLA 785

Query: 715  SEKQSNMIPILCSIFLANQGTSEDAGEEKSLSWEDEESILQGEKEAEKMIV 867
             + Q  +IP+LC+IFLANQG  + AGE   L W DE ++LQ EKEAEKMI+
Sbjct: 786  EKSQMAVIPLLCAIFLANQGEDDAAGE--VLPWNDEAAVLQEEKEAEKMIL 834


>ref|XP_007025683.1| WAPL protein, putative isoform 1 [Theobroma cacao]
            gi|590624723|ref|XP_007025684.1| WAPL protein, putative
            isoform 1 [Theobroma cacao] gi|508781049|gb|EOY28305.1|
            WAPL protein, putative isoform 1 [Theobroma cacao]
            gi|508781050|gb|EOY28306.1| WAPL protein, putative
            isoform 1 [Theobroma cacao]
          Length = 903

 Score =  204 bits (520), Expect = 3e-50
 Identities = 121/231 (52%), Positives = 148/231 (64%), Gaps = 3/231 (1%)
 Frame = +1

Query: 184  KLRLTNSGITEDCPDPFGFT---DTSDHIPVSQQESNNVEYHHSQETSSSSVVDEDKSNL 354
            KL L N  I ++    F FT     S +  + Q E  N EY HS  TS S   +E+ S+L
Sbjct: 612  KLGLRNGEIQDE--HQFQFTISQQESSNGEICQTEFTNEEYRHSNATSGSQSAEEEYSSL 669

Query: 355  LADCLLTAVKVLMNLSNDNPEGCRQIGSCGGLEILSSLITGHFPSFSLPLPXXXXXXXXX 534
            L+DCLL AVKVLMNL+NDNP GC+QI + G LE LS+LI  HFPSF   LP         
Sbjct: 670  LSDCLLAAVKVLMNLTNDNPLGCQQIAASGALETLSTLIASHFPSFCSYLP----RVSEM 725

Query: 535  XXXXXXXTIYHCSNTPLTDQELDFLVAILGLLVNLVEKDGGNRSQLAAASVSIPHIVGLE 714
                    ++  ++ PLTD ELDFLVAILGLLVNLVEKD  NRS+LAAASV +P+  GL 
Sbjct: 726  EENSLSLELHDRNDRPLTDPELDFLVAILGLLVNLVEKDEHNRSRLAAASVFVPNSEGLA 785

Query: 715  SEKQSNMIPILCSIFLANQGTSEDAGEEKSLSWEDEESILQGEKEAEKMIV 867
             + Q  +IP+LC+IFLANQG  + AGE   L W DE ++LQ EKEAEKMI+
Sbjct: 786  EKSQMAVIPLLCAIFLANQGEDDAAGE--VLPWNDEAAVLQEEKEAEKMIL 834


>ref|XP_007214611.1| hypothetical protein PRUPE_ppa001140mg [Prunus persica]
            gi|462410476|gb|EMJ15810.1| hypothetical protein
            PRUPE_ppa001140mg [Prunus persica]
          Length = 897

 Score =  204 bits (520), Expect = 3e-50
 Identities = 117/208 (56%), Positives = 139/208 (66%)
 Frame = +1

Query: 244  DTSDHIPVSQQESNNVEYHHSQETSSSSVVDEDKSNLLADCLLTAVKVLMNLSNDNPEGC 423
            D +  + +SQ+ S+N E H + ETS S  V  + S LLADCLLTAVKVLMNL+NDNP GC
Sbjct: 626  DNTLQLIMSQEASSNGENHLAHETSYSGAVGREGSGLLADCLLTAVKVLMNLANDNPVGC 685

Query: 424  RQIGSCGGLEILSSLITGHFPSFSLPLPXXXXXXXXXXXXXXXXTIYHCSNTPLTDQELD 603
            +QI + GGLE LSSLI  HFP FS                     + H +N  LTDQELD
Sbjct: 686  QQIAANGGLETLSSLIANHFPLFS----SLSSPFSERSENTSSVELGHQNNRHLTDQELD 741

Query: 604  FLVAILGLLVNLVEKDGGNRSQLAAASVSIPHIVGLESEKQSNMIPILCSIFLANQGTSE 783
            FLVAILGLLVNLVEKDG NRS+LAAASV +P   G E E + ++I ++CSIFLANQG  E
Sbjct: 742  FLVAILGLLVNLVEKDGQNRSRLAAASVHVPSSEGFEEESRKDLILLICSIFLANQGAGE 801

Query: 784  DAGEEKSLSWEDEESILQGEKEAEKMIV 867
               EE  L   DE ++LQGE+EAEKMIV
Sbjct: 802  GGAEEMILP-NDEAAVLQGEQEAEKMIV 828


>ref|XP_006449301.1| hypothetical protein CICLE_v10014178mg [Citrus clementina]
            gi|557551912|gb|ESR62541.1| hypothetical protein
            CICLE_v10014178mg [Citrus clementina]
          Length = 940

 Score =  196 bits (497), Expect = 1e-47
 Identities = 108/200 (54%), Positives = 132/200 (66%)
 Frame = +1

Query: 268  SQQESNNVEYHHSQETSSSSVVDEDKSNLLADCLLTAVKVLMNLSNDNPEGCRQIGSCGG 447
            + Q S++ EYH S E+S +   D + S L ADCLLTAVKVLMNL+NDNP GC+QI + GG
Sbjct: 682  NHQVSSSGEYHFSHESSCAHADDSENSTLFADCLLTAVKVLMNLTNDNPIGCQQIAAYGG 741

Query: 448  LEILSSLITGHFPSFSLPLPXXXXXXXXXXXXXXXXTIYHCSNTPLTDQELDFLVAILGL 627
            LE +S LI  HF SFS  +                    H  + PLTDQELDFLVAILGL
Sbjct: 742  LETMSLLIASHFRSFSSSVSPSRDGFESD----------HKDDKPLTDQELDFLVAILGL 791

Query: 628  LVNLVEKDGGNRSQLAAASVSIPHIVGLESEKQSNMIPILCSIFLANQGTSEDAGEEKSL 807
            LVNLVEKD  NRS+LAAA +S+P+  G E E   ++I +LCSIFLANQG  + AGE  + 
Sbjct: 792  LVNLVEKDEDNRSRLAAARISLPNSEGFEEESHRDVIQLLCSIFLANQGAGDPAGEGTAE 851

Query: 808  SWEDEESILQGEKEAEKMIV 867
               DE ++L+GEKEAE MIV
Sbjct: 852  PLNDEAALLEGEKEAEMMIV 871


>ref|XP_004505031.1| PREDICTED: uncharacterized protein LOC101498764 [Cicer arietinum]
          Length = 965

 Score =  195 bits (495), Expect = 2e-47
 Identities = 114/202 (56%), Positives = 138/202 (68%), Gaps = 1/202 (0%)
 Frame = +1

Query: 265  VSQQESNNVEYHHSQETSSSSVVDEDKSNLLADCLLTAVKVLMNLSNDNPEGCRQIGSCG 444
            +SQQES++ + +     SSS +  E+ S+LL DCLLTAVKVLMNL+NDNP GC+QI + G
Sbjct: 704  MSQQESSDGDIN----CSSSDISYEEDSSLLTDCLLTAVKVLMNLTNDNPIGCQQIAANG 759

Query: 445  GLEILSSLITGHFPSFSLPLPXXXXXXXXXXXXXXXXTIYHCSNTPLTDQELDFLVAILG 624
            GLE +S LI GHFPSFS                       H  +  LTD ELDFLVAILG
Sbjct: 760  GLEAMSMLIAGHFPSFSSSSSFAQIKEDSLRIEKD-----HLCDRHLTDHELDFLVAILG 814

Query: 625  LLVNLVEKDGGNRSQLAAASVSIPHIVGLESEKQSNMIPILCSIFLANQGTSE-DAGEEK 801
            LLVNLVEKDG NRS+LAAASV +P   GL+ E + ++I +LCSIFLANQG SE  AGE+K
Sbjct: 815  LLVNLVEKDGRNRSRLAAASVLLPSSEGLDKEVRRDVIQLLCSIFLANQGESEGGAGEDK 874

Query: 802  SLSWEDEESILQGEKEAEKMIV 867
            +    D  ++LQGEKEAEKMIV
Sbjct: 875  NFQLNDPAAVLQGEKEAEKMIV 896


>ref|XP_006467835.1| PREDICTED: uncharacterized protein LOC102612111 [Citrus sinensis]
          Length = 940

 Score =  193 bits (491), Expect = 6e-47
 Identities = 107/200 (53%), Positives = 131/200 (65%)
 Frame = +1

Query: 268  SQQESNNVEYHHSQETSSSSVVDEDKSNLLADCLLTAVKVLMNLSNDNPEGCRQIGSCGG 447
            + Q S++ EYH S E+S +   D + S L ADCLLTAVKVLMNL+NDNP GC+QI + GG
Sbjct: 682  NHQVSSSGEYHFSHESSCAHADDSENSTLFADCLLTAVKVLMNLTNDNPIGCQQIAAYGG 741

Query: 448  LEILSSLITGHFPSFSLPLPXXXXXXXXXXXXXXXXTIYHCSNTPLTDQELDFLVAILGL 627
            LE +S LI  HF SFS  +                    H  + PLTDQELDFLVAILGL
Sbjct: 742  LETMSLLIASHFRSFSSSVSPSRDGFESD----------HKDDRPLTDQELDFLVAILGL 791

Query: 628  LVNLVEKDGGNRSQLAAASVSIPHIVGLESEKQSNMIPILCSIFLANQGTSEDAGEEKSL 807
            LVNLVEKD  NRS+LAAA +S+P+  G E E   ++I +LCSIFLANQG  + AGE  + 
Sbjct: 792  LVNLVEKDEDNRSRLAAARISLPNSEGFEEESHRDVIQLLCSIFLANQGAGDPAGEGTAE 851

Query: 808  SWEDEESILQGEKEAEKMIV 867
               DE ++L+GEKEAE  IV
Sbjct: 852  PLNDEAALLEGEKEAEMTIV 871


Top