BLASTX nr result

ID: Mentha23_contig00024793 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha23_contig00024793
         (1127 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU37261.1| hypothetical protein MIMGU_mgv1a001119mg [Mimulus...   253   1e-64
ref|XP_006344743.1| PREDICTED: uncharacterized protein LOC102600...   149   2e-33
ref|XP_002278233.1| PREDICTED: uncharacterized protein LOC100264...   147   1e-32
emb|CBI39573.3| unnamed protein product [Vitis vinifera]              146   1e-32
emb|CAN73069.1| hypothetical protein VITISV_005845 [Vitis vinifera]   145   2e-32
ref|XP_004230289.1| PREDICTED: uncharacterized protein LOC101268...   142   3e-31
ref|XP_006433512.1| hypothetical protein CICLE_v10000207mg [Citr...   116   2e-23
ref|XP_006433511.1| hypothetical protein CICLE_v10000207mg [Citr...   116   2e-23
ref|XP_006433510.1| hypothetical protein CICLE_v10000207mg [Citr...   116   2e-23
ref|XP_006472178.1| PREDICTED: dentin sialophosphoprotein-like [...   114   8e-23
ref|XP_007031123.1| Uncharacterized protein isoform 6, partial [...   114   8e-23
ref|XP_007031122.1| Uncharacterized protein isoform 5 [Theobroma...   114   8e-23
ref|XP_007031121.1| Uncharacterized protein isoform 4 [Theobroma...   114   8e-23
ref|XP_007031120.1| Uncharacterized protein isoform 3, partial [...   114   8e-23
ref|XP_007031119.1| Uncharacterized protein isoform 2, partial [...   114   8e-23
ref|XP_007031118.1| Uncharacterized protein isoform 1 [Theobroma...   114   8e-23
ref|XP_002512492.1| conserved hypothetical protein [Ricinus comm...   108   3e-21
ref|XP_006651302.1| PREDICTED: dentin sialophosphoprotein-like [...   103   1e-19
ref|XP_007207147.1| hypothetical protein PRUPE_ppa001266mg [Prun...   103   2e-19
ref|XP_006382417.1| hypothetical protein POPTR_0005s01960g [Popu...   102   3e-19

>gb|EYU37261.1| hypothetical protein MIMGU_mgv1a001119mg [Mimulus guttatus]
          Length = 883

 Score =  253 bits (645), Expect = 1e-64
 Identities = 175/377 (46%), Positives = 221/377 (58%), Gaps = 8/377 (2%)
 Frame = -1

Query: 1127 EKPSSSRRPS-EVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXX 951
            EK SSSRR S EVLR+NNQKQN  S   D N  PSCS+ KE++E NLS NY+NGR     
Sbjct: 352  EKKSSSRRASSEVLRLNNQKQNCVSEGYDENPGPSCSKLKERKEPNLSNNYVNGRTNRTV 411

Query: 950  XXXXXXXXXTSRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDE 771
                     TSR+TNFVAADPGKEV   R KT +KK+L  +G+I        KAM VKDE
Sbjct: 412  NKIVIGDVATSRRTNFVAADPGKEVPLSRPKTNAKKKLSIDGSI------THKAMMVKDE 465

Query: 770  KSIKCNVSFEDD--SKWDGIDKKSSCDVVSFTFTSPIKKHVGSNSSA-AMLEATXXXXXX 600
            KS+K NV+F  D  S+WDG DKKSS DVVSFTFTSPIKK   S+SS   +LEA       
Sbjct: 466  KSVKRNVAFAGDAESEWDGNDKKSSLDVVSFTFTSPIKKSGASSSSCNTILEANSSSFTN 525

Query: 599  XXXSARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXX 420
                   S+LR+S + SS FNVIGGD LS+LLEQKLKEL+S++E SQKD           
Sbjct: 526  SDPCVHGSELRDSGSYSSRFNVIGGDTLSLLLEQKLKELSSKIELSQKDVSE-------- 577

Query: 419  XXXXNMGSAVNLVESMDNGVCESEREAQDGSDCASTEKLWLRAEKSSKVPEYF-DGVVDG 243
                   +A     ++ N + +++ E ++    AS +K+ L+AEK +K  EY  +G  D 
Sbjct: 578  ------SAASCSSSAISNSILKTKEEIKN----ASIDKILLKAEKENKEVEYIEEGDGDD 627

Query: 242  NN---IQRYXXXXXXXXXXXXXXXXXXSCDSFDVDRSASNEGPMGCLSLDSCEGTNWRAT 72
            N+    QRY                  S DSFD+DR++ NEG +  LS++S EG NW +T
Sbjct: 628  NSNIEYQRY-LHLLGSLSASNQPLSRTSSDSFDLDRNSCNEGRLPYLSVESYEGINW-ST 685

Query: 71   RKPHITEGDQEISDTAS 21
            R       + E+SDTAS
Sbjct: 686  R-------NVEVSDTAS 695


>ref|XP_006344743.1| PREDICTED: uncharacterized protein LOC102600562 isoform X1 [Solanum
            tuberosum] gi|565355747|ref|XP_006344744.1| PREDICTED:
            uncharacterized protein LOC102600562 isoform X2 [Solanum
            tuberosum]
          Length = 907

 Score =  149 bits (376), Expect = 2e-33
 Identities = 124/379 (32%), Positives = 171/379 (45%), Gaps = 7/379 (1%)
 Frame = -1

Query: 1127 EKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXX 948
            E+ +S  RPS+VLR NNQKQN AS +D  + + S    KEK+   LS+     R      
Sbjct: 355  ERKNSLNRPSDVLRQNNQKQNSASNKDGESSKTSAPYQKEKK---LSSTGNMSRSTKTVS 411

Query: 947  XXXXXXXXTSRKTNFVAADPGKEVSSLRAKTTSK---KRLLANGNIQSSGGVAQKAMAVK 777
                     +   + V  D GK++SS R    S    K+   N +I S G  A   M  K
Sbjct: 412  RIVVNTTTATGIASIVETDVGKDLSSSRDSRVSSFTGKKQSVNVDIGSDGCGADNMMKSK 471

Query: 776  DEKSIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHV-GSNSSAAMLEATXXXXXX 600
            DE+SIKCN++ E  S W+  D+K+  DVVSFTFTSPIKK + G  SS+ +LE        
Sbjct: 472  DERSIKCNLAIEGCSNWETADRKNGSDVVSFTFTSPIKKSMTGPTSSSHVLEKNNALCLF 531

Query: 599  XXXSARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXX 420
                  +SD R S   S     IGGD L +LLEQK+KELTS+V  S +D           
Sbjct: 532  PGSYDDQSDSRTSTMPSF---PIGGDDLGILLEQKIKELTSKVRPSCED---FIKTGTAS 585

Query: 419  XXXXNMGSAVNLVESMDNGVCESEREAQDGSDCASTEKLWLRAEKSSKVPEYFDGVVDGN 240
                    +V++V        +   E       +S + L L A +  + P   +     +
Sbjct: 586  ISASTFEDSVSIVAHGRRPQVDLLNEKAGDHGHSSVDDLRLTATQMWQGPNRVENPKTAS 645

Query: 239  NIQ---RYXXXXXXXXXXXXXXXXXXSCDSFDVDRSASNEGPMGCLSLDSCEGTNWRATR 69
                   +                  SC+S D  RS + +G    LS  S E  NW+   
Sbjct: 646  RFTCEGEFSLPCTSLASSMEPSISGGSCNSLDSYRSLATDGSKYHLSDGSHEMMNWKTYM 705

Query: 68   KPHITEGDQEISDTASSLS 12
            + H  EGD E+ D+ASS+S
Sbjct: 706  RTHFVEGDAELLDSASSVS 724


>ref|XP_002278233.1| PREDICTED: uncharacterized protein LOC100264914 [Vitis vinifera]
          Length = 919

 Score =  147 bits (370), Expect = 1e-32
 Identities = 119/384 (30%), Positives = 170/384 (44%), Gaps = 13/384 (3%)
 Frame = -1

Query: 1124 KPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXXX 945
            K +S+ R S  L+ NNQKQN  S RD    + + S  K K+  +++ ++   +       
Sbjct: 354  KRTSTNRTSNALKQNNQKQNGGSTRDVLTSKTAVSNQKSKKAPSVNGSFGPSKTVNKVVI 413

Query: 944  XXXXXXXTSRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKS 765
                    S+K   VA D  KE S  + K  S+K+L  +GNI   G +A   +  KD KS
Sbjct: 414  NTEAG---SKKMGSVANDIRKESSLSKTKNASRKKLSVDGNICFEGSIADGVLTNKDVKS 470

Query: 764  IKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHV-GSNSSAAMLEATXXXXXXXXXS 588
            IKCNV+ E  + W G + K   DVVSFTFTSP+KK + GS SS  ++EA           
Sbjct: 471  IKCNVAVEGGTDWGGDNIKKGMDVVSFTFTSPMKKPIPGSMSSDQVMEAKYQFNIDSNDE 530

Query: 587  ARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXXXXXX 408
                  +NS+ SS G NVIG D+L VLLEQKL+ELT RV  S  D               
Sbjct: 531  NDAHGSKNSSISSLGPNVIGADSLGVLLEQKLRELTFRVGSSHSDLFAPGTAASSTSRLQ 590

Query: 407  NMGSAVNLVESMDN--------GVCESEREAQDGSDCASTEKLWLRAEKSSKVPEYFDGV 252
            +    VN+V              + E + +     D +S   L    +    V E  + +
Sbjct: 591  DSDLRVNVVAPTSTKHTSRLLPDLHEDKSDGPHYFDFSSVGGLQANQKWQVHVSEGMEEL 650

Query: 251  VDGNNIQR----YXXXXXXXXXXXXXXXXXXSCDSFDVDRSASNEGPMGCLSLDSCEGTN 84
               +N                          +C+S D   S S  G   C   ++ E  +
Sbjct: 651  SGNSNNNEMGNGLSGQHPSPVLSLESSFSNITCNSPDSRNSYSVNGSEQCSLAETDEVDS 710

Query: 83   WRATRKPHITEGDQEISDTASSLS 12
            W +  K  + EG+ E+SD+ASS+S
Sbjct: 711  WTSRSKSQLAEGEAELSDSASSVS 734


>emb|CBI39573.3| unnamed protein product [Vitis vinifera]
          Length = 901

 Score =  146 bits (369), Expect = 1e-32
 Identities = 119/385 (30%), Positives = 170/385 (44%), Gaps = 13/385 (3%)
 Frame = -1

Query: 1127 EKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXX 948
            E  +S+ R S  L+ NNQKQN  S RD    + + S  K K+  +++ ++   +      
Sbjct: 335  EVKTSTNRTSNALKQNNQKQNGGSTRDVLTSKTAVSNQKSKKAPSVNGSFGPSKTVNKVV 394

Query: 947  XXXXXXXXTSRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEK 768
                     S+K   VA D  KE S  + K  S+K+L  +GNI   G +A   +  KD K
Sbjct: 395  INTEAG---SKKMGSVANDIRKESSLSKTKNASRKKLSVDGNICFEGSIADGVLTNKDVK 451

Query: 767  SIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHV-GSNSSAAMLEATXXXXXXXXX 591
            SIKCNV+ E  + W G + K   DVVSFTFTSP+KK + GS SS  ++EA          
Sbjct: 452  SIKCNVAVEGGTDWGGDNIKKGMDVVSFTFTSPMKKPIPGSMSSDQVMEAKYQFNIDSND 511

Query: 590  SARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXXXXX 411
                   +NS+ SS G NVIG D+L VLLEQKL+ELT RV  S  D              
Sbjct: 512  ENDAHGSKNSSISSLGPNVIGADSLGVLLEQKLRELTFRVGSSHSDLFAPGTAASSTSRL 571

Query: 410  XNMGSAVNLVESMDN--------GVCESEREAQDGSDCASTEKLWLRAEKSSKVPEYFDG 255
             +    VN+V              + E + +     D +S   L    +    V E  + 
Sbjct: 572  QDSDLRVNVVAPTSTKHTSRLLPDLHEDKSDGPHYFDFSSVGGLQANQKWQVHVSEGMEE 631

Query: 254  VVDGNNIQR----YXXXXXXXXXXXXXXXXXXSCDSFDVDRSASNEGPMGCLSLDSCEGT 87
            +   +N                          +C+S D   S S  G   C   ++ E  
Sbjct: 632  LSGNSNNNEMGNGLSGQHPSPVLSLESSFSNITCNSPDSRNSYSVNGSEQCSLAETDEVD 691

Query: 86   NWRATRKPHITEGDQEISDTASSLS 12
            +W +  K  + EG+ E+SD+ASS+S
Sbjct: 692  SWTSRSKSQLAEGEAELSDSASSVS 716


>emb|CAN73069.1| hypothetical protein VITISV_005845 [Vitis vinifera]
          Length = 1640

 Score =  145 bits (367), Expect = 2e-32
 Identities = 120/376 (31%), Positives = 170/376 (45%), Gaps = 5/376 (1%)
 Frame = -1

Query: 1124 KPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXXX 945
            K +S+ R S  L+ NNQKQN  S RD    + + S  K K+  ++S ++   +       
Sbjct: 1106 KRTSTNRTSNALKQNNQKQNGGSTRDVLTSKTAVSNQKSKKAPSVSGSFGPSKTVNKVVI 1165

Query: 944  XXXXXXXTSRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKS 765
                    S+K   VA D  KE S  + K  S+K+L  +GNI   G +A   +  KD KS
Sbjct: 1166 NTEAG---SKKMGSVANDIRKESSLSKTKNASQKKLSVDGNICFEGSIADGVLTNKDVKS 1222

Query: 764  IKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHV-GSNSSAAMLEATXXXXXXXXXS 588
            IKCNV+ E  + W G + K   DVVSFTFTSP+KK + GS SS  ++EA           
Sbjct: 1223 IKCNVAVEGGTDWGGDNIKKGMDVVSFTFTSPMKKPIPGSMSSDQVMEAKYQFNIDSNDE 1282

Query: 587  ARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXXXXXX 408
                  +NS+ SS G NVIG D+L VLLEQKL+ELT RV  S  D               
Sbjct: 1283 NDAHGSKNSSISSLGLNVIGADSLGVLLEQKLRELTFRVGLSHSDLFAP----------- 1331

Query: 407  NMGSAVNLVESMDNGVCESEREAQDGSDCASTEKLWLRAEKSSKVPEYFD----GVVDGN 240
              G+A +    + +        A   +   S     L  +KS   P YFD    G +  N
Sbjct: 1332 --GTAASSTSRLQDSDLRVNVVAPTSTKHTSRLLPDLHEDKSDG-PHYFDFSSVGGLQAN 1388

Query: 239  NIQRYXXXXXXXXXXXXXXXXXXSCDSFDVDRSASNEGPMGCLSLDSCEGTNWRATRKPH 60
              Q++                    +S + +      G   C   ++ E  +W +  K  
Sbjct: 1389 --QKWQVHVSEGMEELSG-------NSNNNEMGNGLSGSEQCSLAETDEVDSWTSRSKSQ 1439

Query: 59   ITEGDQEISDTASSLS 12
            + EG+ E+SD+ASS+S
Sbjct: 1440 LAEGEAELSDSASSVS 1455


>ref|XP_004230289.1| PREDICTED: uncharacterized protein LOC101268805 [Solanum
            lycopersicum]
          Length = 902

 Score =  142 bits (358), Expect = 3e-31
 Identities = 121/376 (32%), Positives = 169/376 (44%), Gaps = 4/376 (1%)
 Frame = -1

Query: 1127 EKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXX 948
            E+ +S  RPS+VLR NNQKQN AS +D  +   S    KEK+ S+        R      
Sbjct: 355  ERKNSLNRPSDVLRQNNQKQNSASNKDGESSNTSAPYHKEKKSSSTGNM---SRSTKTVS 411

Query: 947  XXXXXXXXTSRKTNFVAADPGKEVSSLR---AKTTSKKRLLANGNIQSSGGVAQKAMAVK 777
                     +   + V  D GK++SS R    ++ + K+   N +I S    A   M  K
Sbjct: 412  RIVVNTTAATGIASIVETDVGKDLSSSRDSRVRSFTGKKQPVNVDIGSDECGADNMMKNK 471

Query: 776  DEKSIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHV-GSNSSAAMLEATXXXXXX 600
            DE+SIKCN++ E  S W+  D+K+  DVVSFTFTSPIKK + G  SS+ +LE        
Sbjct: 472  DERSIKCNLTIEGCSNWETADRKNGSDVVSFTFTSPIKKSMPGPTSSSHVLEKNSALCLF 531

Query: 599  XXXSARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFSQKDXXXXXXXXXXX 420
                  +SD R S   S     IGGD L +LLEQK+KELTS+V  S +D           
Sbjct: 532  PGSYDDQSDSRTSTMPSFR---IGGDDLGILLEQKIKELTSKVGPSCED---FIKTGTAS 585

Query: 419  XXXXNMGSAVNLVESMDNGVCESEREAQDGSDCASTEKLWLRAEKSSKVPEYFDGVVDGN 240
                    +V++V        +   E       +S + L L A +  + P   +     +
Sbjct: 586  TSTNAFEDSVSIVAHGRRPQVDLLNEKAGDPGHSSVDDLQLTATQMWQGPNRVENPKTAS 645

Query: 239  NIQRYXXXXXXXXXXXXXXXXXXSCDSFDVDRSASNEGPMGCLSLDSCEGTNWRATRKPH 60
            +I                     SC S D  RS + +G    LS  S    NW+   + H
Sbjct: 646  SIT--CEGEFSLASSMEPSISGGSCSSLDSFRSLATDGSKYHLSDGSHYMMNWKTYMRTH 703

Query: 59   ITEGDQEISDTASSLS 12
            + EGD E+ D+ASS S
Sbjct: 704  LVEGDAELLDSASSAS 719


>ref|XP_006433512.1| hypothetical protein CICLE_v10000207mg [Citrus clementina]
            gi|557535634|gb|ESR46752.1| hypothetical protein
            CICLE_v10000207mg [Citrus clementina]
          Length = 916

 Score =  116 bits (290), Expect = 2e-23
 Identities = 76/222 (34%), Positives = 114/222 (51%)
 Frame = -1

Query: 1127 EKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXX 948
            +K + + R + VLR NNQKQN    +D  N +      + ++  + S +    R      
Sbjct: 359  QKGTPTNRTNNVLRQNNQKQNHILNKDGSNLKACVINQQVRKLKSTSGSIGPNRTVSKAV 418

Query: 947  XXXXXXXXTSRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEK 768
                     SR+T     D  KE+SS +AK +S+K+  AN +  S      +    KDE+
Sbjct: 419  ANSETG---SRRTGLTTNDTRKELSSSKAKNSSQKKQSANADSMSVESTDDEMK--KDER 473

Query: 767  SIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHVGSNSSAAMLEATXXXXXXXXXS 588
            SIKCN++ E        ++K+  DVVSFTF+SPI+    + SS  ++             
Sbjct: 474  SIKCNIAIEGGMTRATDNRKTGMDVVSFTFSSPIRSRPDTESSGRVMRTNNCFNIDHFGD 533

Query: 587  ARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFS 462
              +  LRN+++SS   N+IGG+ALSVLLEQKL ELT +V+ S
Sbjct: 534  NNQLYLRNTSSSSPWLNIIGGNALSVLLEQKLMELTCKVDSS 575


>ref|XP_006433511.1| hypothetical protein CICLE_v10000207mg [Citrus clementina]
            gi|557535633|gb|ESR46751.1| hypothetical protein
            CICLE_v10000207mg [Citrus clementina]
          Length = 707

 Score =  116 bits (290), Expect = 2e-23
 Identities = 76/222 (34%), Positives = 114/222 (51%)
 Frame = -1

Query: 1127 EKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXX 948
            +K + + R + VLR NNQKQN    +D  N +      + ++  + S +    R      
Sbjct: 359  QKGTPTNRTNNVLRQNNQKQNHILNKDGSNLKACVINQQVRKLKSTSGSIGPNRTVSKAV 418

Query: 947  XXXXXXXXTSRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEK 768
                     SR+T     D  KE+SS +AK +S+K+  AN +  S      +    KDE+
Sbjct: 419  ANSETG---SRRTGLTTNDTRKELSSSKAKNSSQKKQSANADSMSVESTDDEMK--KDER 473

Query: 767  SIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHVGSNSSAAMLEATXXXXXXXXXS 588
            SIKCN++ E        ++K+  DVVSFTF+SPI+    + SS  ++             
Sbjct: 474  SIKCNIAIEGGMTRATDNRKTGMDVVSFTFSSPIRSRPDTESSGRVMRTNNCFNIDHFGD 533

Query: 587  ARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFS 462
              +  LRN+++SS   N+IGG+ALSVLLEQKL ELT +V+ S
Sbjct: 534  NNQLYLRNTSSSSPWLNIIGGNALSVLLEQKLMELTCKVDSS 575


>ref|XP_006433510.1| hypothetical protein CICLE_v10000207mg [Citrus clementina]
            gi|557535632|gb|ESR46750.1| hypothetical protein
            CICLE_v10000207mg [Citrus clementina]
          Length = 702

 Score =  116 bits (290), Expect = 2e-23
 Identities = 76/222 (34%), Positives = 114/222 (51%)
 Frame = -1

Query: 1127 EKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXX 948
            +K + + R + VLR NNQKQN    +D  N +      + ++  + S +    R      
Sbjct: 359  QKGTPTNRTNNVLRQNNQKQNHILNKDGSNLKACVINQQVRKLKSTSGSIGPNRTVSKAV 418

Query: 947  XXXXXXXXTSRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEK 768
                     SR+T     D  KE+SS +AK +S+K+  AN +  S      +    KDE+
Sbjct: 419  ANSETG---SRRTGLTTNDTRKELSSSKAKNSSQKKQSANADSMSVESTDDEMK--KDER 473

Query: 767  SIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHVGSNSSAAMLEATXXXXXXXXXS 588
            SIKCN++ E        ++K+  DVVSFTF+SPI+    + SS  ++             
Sbjct: 474  SIKCNIAIEGGMTRATDNRKTGMDVVSFTFSSPIRSRPDTESSGRVMRTNNCFNIDHFGD 533

Query: 587  ARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFS 462
              +  LRN+++SS   N+IGG+ALSVLLEQKL ELT +V+ S
Sbjct: 534  NNQLYLRNTSSSSPWLNIIGGNALSVLLEQKLMELTCKVDSS 575


>ref|XP_006472178.1| PREDICTED: dentin sialophosphoprotein-like [Citrus sinensis]
          Length = 916

 Score =  114 bits (285), Expect = 8e-23
 Identities = 75/222 (33%), Positives = 113/222 (50%)
 Frame = -1

Query: 1127 EKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXX 948
            +K + + R + VLR NNQKQN    +D  N +      + ++  + S +    R      
Sbjct: 359  QKGTPTNRTNNVLRQNNQKQNHILNKDGSNLKACVINQQVRKLKSTSGSIGPNRTVSKAV 418

Query: 947  XXXXXXXXTSRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEK 768
                     SR+T     D  KE+SS +AK +S+K+  AN +  S      +    KD++
Sbjct: 419  ANSETG---SRRTGLTTNDTRKELSSSKAKNSSQKKQSANADSMSVESTDNEMK--KDKR 473

Query: 767  SIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHVGSNSSAAMLEATXXXXXXXXXS 588
            SIKCN++ E        ++K+  DVVSFTF+SPI+    + SS  ++             
Sbjct: 474  SIKCNIAIEGGMTRAADNRKTGMDVVSFTFSSPIRSRPDTESSGRVMRTNNCFNIDHFGD 533

Query: 587  ARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFS 462
              +  LRN ++SS   N+IGG+ALSVLLEQKL ELT +V+ S
Sbjct: 534  NNQLYLRNISSSSPWLNIIGGNALSVLLEQKLMELTCKVDSS 575


>ref|XP_007031123.1| Uncharacterized protein isoform 6, partial [Theobroma cacao]
            gi|508719728|gb|EOY11625.1| Uncharacterized protein
            isoform 6, partial [Theobroma cacao]
          Length = 697

 Score =  114 bits (285), Expect = 8e-23
 Identities = 87/224 (38%), Positives = 111/224 (49%), Gaps = 2/224 (0%)
 Frame = -1

Query: 1127 EKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXX 948
            EK +S+ R + VLR NNQKQN  S RD      S S+T   ++       +NG       
Sbjct: 359  EKGTSANRTNNVLRPNNQKQNCISTRDY-----STSKTSTLDQHARKARSMNGTIGRNRT 413

Query: 947  XXXXXXXXT--SRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKD 774
                       SRKT  VA D  KE+   R K   KK+   N ++ S    +  +     
Sbjct: 414  LNKVTINSEPQSRKTGSVANDAAKELPMSRRKNLPKKKRPVNEDLASGETSSDTSSINYS 473

Query: 773  EKSIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHVGSNSSAAMLEATXXXXXXXX 594
            EKSIKCNV+       D    K S DVVSFTFTSPI + V   SS+   + +        
Sbjct: 474  EKSIKCNVATNGHLNRDAEKMKKSMDVVSFTFTSPISR-VAEKSSSFDSDPSGDNYLLY- 531

Query: 593  XSARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFS 462
                   L++SA SS GFN+IGGD+LSVLLE+KL+ELT  VE S
Sbjct: 532  -------LKSSAFSSPGFNIIGGDSLSVLLEKKLQELTCGVESS 568


>ref|XP_007031122.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508719727|gb|EOY11624.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 699

 Score =  114 bits (285), Expect = 8e-23
 Identities = 87/224 (38%), Positives = 111/224 (49%), Gaps = 2/224 (0%)
 Frame = -1

Query: 1127 EKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXX 948
            EK +S+ R + VLR NNQKQN  S RD      S S+T   ++       +NG       
Sbjct: 358  EKGTSANRTNNVLRPNNQKQNCISTRDY-----STSKTSTLDQHARKARSMNGTIGRNRT 412

Query: 947  XXXXXXXXT--SRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKD 774
                       SRKT  VA D  KE+   R K   KK+   N ++ S    +  +     
Sbjct: 413  LNKVTINSEPQSRKTGSVANDAAKELPMSRRKNLPKKKRPVNEDLASGETSSDTSSINYS 472

Query: 773  EKSIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHVGSNSSAAMLEATXXXXXXXX 594
            EKSIKCNV+       D    K S DVVSFTFTSPI + V   SS+   + +        
Sbjct: 473  EKSIKCNVATNGHLNRDAEKMKKSMDVVSFTFTSPISR-VAEKSSSFDSDPSGDNYLLY- 530

Query: 593  XSARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFS 462
                   L++SA SS GFN+IGGD+LSVLLE+KL+ELT  VE S
Sbjct: 531  -------LKSSAFSSPGFNIIGGDSLSVLLEKKLQELTCGVESS 567


>ref|XP_007031121.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508719726|gb|EOY11623.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 909

 Score =  114 bits (285), Expect = 8e-23
 Identities = 87/224 (38%), Positives = 111/224 (49%), Gaps = 2/224 (0%)
 Frame = -1

Query: 1127 EKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXX 948
            EK +S+ R + VLR NNQKQN  S RD      S S+T   ++       +NG       
Sbjct: 358  EKGTSANRTNNVLRPNNQKQNCISTRDY-----STSKTSTLDQHARKARSMNGTIGRNRT 412

Query: 947  XXXXXXXXT--SRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKD 774
                       SRKT  VA D  KE+   R K   KK+   N ++ S    +  +     
Sbjct: 413  LNKVTINSEPQSRKTGSVANDAAKELPMSRRKNLPKKKRPVNEDLASGETSSDTSSINYS 472

Query: 773  EKSIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHVGSNSSAAMLEATXXXXXXXX 594
            EKSIKCNV+       D    K S DVVSFTFTSPI + V   SS+   + +        
Sbjct: 473  EKSIKCNVATNGHLNRDAEKMKKSMDVVSFTFTSPISR-VAEKSSSFDSDPSGDNYLLY- 530

Query: 593  XSARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFS 462
                   L++SA SS GFN+IGGD+LSVLLE+KL+ELT  VE S
Sbjct: 531  -------LKSSAFSSPGFNIIGGDSLSVLLEKKLQELTCGVESS 567


>ref|XP_007031120.1| Uncharacterized protein isoform 3, partial [Theobroma cacao]
            gi|590644591|ref|XP_007031124.1| Uncharacterized protein
            isoform 3, partial [Theobroma cacao]
            gi|508719725|gb|EOY11622.1| Uncharacterized protein
            isoform 3, partial [Theobroma cacao]
            gi|508719729|gb|EOY11626.1| Uncharacterized protein
            isoform 3, partial [Theobroma cacao]
          Length = 787

 Score =  114 bits (285), Expect = 8e-23
 Identities = 87/224 (38%), Positives = 111/224 (49%), Gaps = 2/224 (0%)
 Frame = -1

Query: 1127 EKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXX 948
            EK +S+ R + VLR NNQKQN  S RD      S S+T   ++       +NG       
Sbjct: 358  EKGTSANRTNNVLRPNNQKQNCISTRDY-----STSKTSTLDQHARKARSMNGTIGRNRT 412

Query: 947  XXXXXXXXT--SRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKD 774
                       SRKT  VA D  KE+   R K   KK+   N ++ S    +  +     
Sbjct: 413  LNKVTINSEPQSRKTGSVANDAAKELPMSRRKNLPKKKRPVNEDLASGETSSDTSSINYS 472

Query: 773  EKSIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHVGSNSSAAMLEATXXXXXXXX 594
            EKSIKCNV+       D    K S DVVSFTFTSPI + V   SS+   + +        
Sbjct: 473  EKSIKCNVATNGHLNRDAEKMKKSMDVVSFTFTSPISR-VAEKSSSFDSDPSGDNYLLY- 530

Query: 593  XSARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFS 462
                   L++SA SS GFN+IGGD+LSVLLE+KL+ELT  VE S
Sbjct: 531  -------LKSSAFSSPGFNIIGGDSLSVLLEKKLQELTCGVESS 567


>ref|XP_007031119.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
            gi|508719724|gb|EOY11621.1| Uncharacterized protein
            isoform 2, partial [Theobroma cacao]
          Length = 840

 Score =  114 bits (285), Expect = 8e-23
 Identities = 87/224 (38%), Positives = 111/224 (49%), Gaps = 2/224 (0%)
 Frame = -1

Query: 1127 EKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXX 948
            EK +S+ R + VLR NNQKQN  S RD      S S+T   ++       +NG       
Sbjct: 359  EKGTSANRTNNVLRPNNQKQNCISTRDY-----STSKTSTLDQHARKARSMNGTIGRNRT 413

Query: 947  XXXXXXXXT--SRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKD 774
                       SRKT  VA D  KE+   R K   KK+   N ++ S    +  +     
Sbjct: 414  LNKVTINSEPQSRKTGSVANDAAKELPMSRRKNLPKKKRPVNEDLASGETSSDTSSINYS 473

Query: 773  EKSIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHVGSNSSAAMLEATXXXXXXXX 594
            EKSIKCNV+       D    K S DVVSFTFTSPI + V   SS+   + +        
Sbjct: 474  EKSIKCNVATNGHLNRDAEKMKKSMDVVSFTFTSPISR-VAEKSSSFDSDPSGDNYLLY- 531

Query: 593  XSARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFS 462
                   L++SA SS GFN+IGGD+LSVLLE+KL+ELT  VE S
Sbjct: 532  -------LKSSAFSSPGFNIIGGDSLSVLLEKKLQELTCGVESS 568


>ref|XP_007031118.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508719723|gb|EOY11620.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 910

 Score =  114 bits (285), Expect = 8e-23
 Identities = 87/224 (38%), Positives = 111/224 (49%), Gaps = 2/224 (0%)
 Frame = -1

Query: 1127 EKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXX 948
            EK +S+ R + VLR NNQKQN  S RD      S S+T   ++       +NG       
Sbjct: 359  EKGTSANRTNNVLRPNNQKQNCISTRDY-----STSKTSTLDQHARKARSMNGTIGRNRT 413

Query: 947  XXXXXXXXT--SRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKD 774
                       SRKT  VA D  KE+   R K   KK+   N ++ S    +  +     
Sbjct: 414  LNKVTINSEPQSRKTGSVANDAAKELPMSRRKNLPKKKRPVNEDLASGETSSDTSSINYS 473

Query: 773  EKSIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHVGSNSSAAMLEATXXXXXXXX 594
            EKSIKCNV+       D    K S DVVSFTFTSPI + V   SS+   + +        
Sbjct: 474  EKSIKCNVATNGHLNRDAEKMKKSMDVVSFTFTSPISR-VAEKSSSFDSDPSGDNYLLY- 531

Query: 593  XSARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFS 462
                   L++SA SS GFN+IGGD+LSVLLE+KL+ELT  VE S
Sbjct: 532  -------LKSSAFSSPGFNIIGGDSLSVLLEKKLQELTCGVESS 568


>ref|XP_002512492.1| conserved hypothetical protein [Ricinus communis]
            gi|223548453|gb|EEF49944.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 851

 Score =  108 bits (271), Expect = 3e-21
 Identities = 76/224 (33%), Positives = 108/224 (48%)
 Frame = -1

Query: 1124 KPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXXX 945
            K +S  R + VLR NNQKQN +S ++  N + S S    K    +S++    R       
Sbjct: 356  KKTSENRTTNVLRQNNQKQNSSSGKESTNLKNSFSNQAGKRVQTMSSSVGQSRTTNKVVL 415

Query: 944  XXXXXXXTSRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEKS 765
                    SRK + V  D  KE    +    S K+   NG  Q   GV+      + E+S
Sbjct: 416  KPET----SRKMHLVVTDTEKE----KPNNISLKKRPVNGEPQIGRGVSDNESLNRVERS 467

Query: 764  IKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHVGSNSSAAMLEATXXXXXXXXXSA 585
            IKCN++ +        ++K+  DVVSFTFTSP+KK       + M ++          + 
Sbjct: 468  IKCNLAVDGCMNTAVDNRKNGMDVVSFTFTSPVKKATPDPQPSVMEKSKSSVIDLFGSNG 527

Query: 584  RESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFSQKD 453
                  N + S  G N+IGGDAL VLLEQKL+EL ++VE SQ +
Sbjct: 528  HP--YFNKSTSFPGLNIIGGDALGVLLEQKLRELANKVESSQSN 569


>ref|XP_006651302.1| PREDICTED: dentin sialophosphoprotein-like [Oryza brachyantha]
          Length = 949

 Score =  103 bits (257), Expect = 1e-19
 Identities = 81/225 (36%), Positives = 107/225 (47%), Gaps = 5/225 (2%)
 Frame = -1

Query: 1127 EKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPS---CSQTKEKEESNLSTNYI-NGRXX 960
            +KPSSS   S VLR NNQKQN    R  G   P+    SQ   K   + ST  + NG   
Sbjct: 380  KKPSSSGTSSPVLRQNNQKQNSMVTR--GKSAPNKSVSSQQGRKMAGDCSTGKLKNGNKI 437

Query: 959  XXXXXXXXXXXXTSRKTNFVAADPGKEVSSLRAKT-TSKKRLLANGNIQSSGGVAQKAMA 783
                         SRK    +    KE SS   K    KKRL+   +    G    +  A
Sbjct: 438  SKGG---------SRKDIIESISGDKEGSSSNNKDFPQKKRLIERNSTNEKGTFVPEKSA 488

Query: 782  VKDEKSIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHVGSNSSAAMLEATXXXXX 603
             + +K ++ NV  ++  KW+  D K S DVVSFTFTSP+ K     S  +    T     
Sbjct: 489  ARIQKQVQPNVVMDEHIKWNN-DSKDSTDVVSFTFTSPLVKPSAGPSRLSGKWDTRGNFS 547

Query: 602  XXXXSARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVE 468
                +  +SD ++   SS G N + GDALS+LLE+KLKEL S++E
Sbjct: 548  LDAINEDDSDKKSEGLSSGGLNFVNGDALSLLLEKKLKELASKIE 592


>ref|XP_007207147.1| hypothetical protein PRUPE_ppa001266mg [Prunus persica]
            gi|462402789|gb|EMJ08346.1| hypothetical protein
            PRUPE_ppa001266mg [Prunus persica]
          Length = 867

 Score =  103 bits (256), Expect = 2e-19
 Identities = 78/227 (34%), Positives = 106/227 (46%), Gaps = 5/227 (2%)
 Frame = -1

Query: 1124 KPSSSRRPSEVLRMNNQKQNRASARD---DGNFEPSCSQTKEKEESNLSTNYINGRXXXX 954
            K +S      VL+ NNQKQN  S +D     N  P+   T+    +N S+     R    
Sbjct: 359  KKTSPDSTKSVLKQNNQKQNCVSNKDKTTSKNIVPN-PPTRRMRSTNGSS-----RPGKT 412

Query: 953  XXXXXXXXXXTSRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKD 774
                       S K   +    GKE S    K  S K      ++     V+  A   +D
Sbjct: 413  VSKVLVNSETGSGKMGSMGNFTGKEFSLSTMKKVSGKLRSVGQDVHLEEAVSDNAFISED 472

Query: 773  EKSIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHVG--SNSSAAMLEATXXXXXX 600
            E+S+KCNVS +  +     ++K + DVVSFTFTSP+K+ +     S   M          
Sbjct: 473  ERSVKCNVSMDGCTSLGADNRKQAMDVVSFTFTSPLKRSISELQCSGQVMSRNNSFYIDS 532

Query: 599  XXXSARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFSQ 459
               + ++    N   SS GFNVIGGDALSVLLEQKL+EL+ +VE SQ
Sbjct: 533  FGNNDQQRYPENFTLSSPGFNVIGGDALSVLLEQKLQELSCKVELSQ 579


>ref|XP_006382417.1| hypothetical protein POPTR_0005s01960g [Populus trichocarpa]
            gi|550337777|gb|ERP60214.1| hypothetical protein
            POPTR_0005s01960g [Populus trichocarpa]
          Length = 915

 Score =  102 bits (254), Expect = 3e-19
 Identities = 76/223 (34%), Positives = 104/223 (46%), Gaps = 1/223 (0%)
 Frame = -1

Query: 1127 EKPSSSRRPSEVLRMNNQKQNRASARDDGNFEPSCSQTKEKEESNLSTNYINGRXXXXXX 948
            +K  S  R S VL+ NN KQN A  +D    + S S  + ++  + S +    R      
Sbjct: 364  QKRISESRTSNVLQQNNLKQNSAPNKDSSGLKNSLSNQQGRKTKSTSGSVGQSRTVKKVV 423

Query: 947  XXXXXXXXTSRKTNFVAADPGKEVSSLRAKTTSKKRLLANGNIQSSGGVAQKAMAVKDEK 768
                      RK   V  D  KE    + K  ++K+   +G++Q            KDE 
Sbjct: 424  VKPETVP---RKMGLVMTDSEKE----KTKNIARKKRSVSGDLQIDRNATPNVSFNKDEM 476

Query: 767  SIKCNVSFEDDSKWDGIDKKSSCDVVSFTFTSPIKKHV-GSNSSAAMLEATXXXXXXXXX 591
            S K NV  + +      ++KS  DVVSFTF+SPIK+    S SS  MLE           
Sbjct: 477  STKSNVVMDGNMNMAMDNRKSGMDVVSFTFSSPIKRATPSSQSSGQMLEKCSSSAIDSFG 536

Query: 590  SARESDLRNSAASSSGFNVIGGDALSVLLEQKLKELTSRVEFS 462
            S     L++S +   G NV+GGD L VLLEQKL+ELT +VE S
Sbjct: 537  SKDHPSLKSSMSYFPGLNVMGGDVLGVLLEQKLRELTYKVESS 579


Top