BLASTX nr result

ID: Sinomenium21_contig00018580 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00018580
         (1311 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002274314.2| PREDICTED: uncharacterized protein LOC100248...   381   e-103
ref|XP_007024589.1| Uncharacterized protein isoform 6 [Theobroma...   366   1e-98
ref|XP_007024587.1| Uncharacterized protein isoform 4 [Theobroma...   366   1e-98
ref|XP_007024588.1| Uncharacterized protein isoform 5 [Theobroma...   361   4e-97
ref|XP_007024585.1| Uncharacterized protein isoform 2 [Theobroma...   361   4e-97
emb|CAN69468.1| hypothetical protein VITISV_042555 [Vitis vinifera]   353   1e-94
ref|XP_007024584.1| Uncharacterized protein isoform 1 [Theobroma...   351   4e-94
ref|XP_002521347.1| conserved hypothetical protein [Ricinus comm...   350   1e-93
emb|CBI35892.3| unnamed protein product [Vitis vinifera]              342   2e-91
ref|XP_002299597.2| hydroxyproline-rich glycoprotein [Populus tr...   341   4e-91
gb|EXB29673.1| hypothetical protein L484_013447 [Morus notabilis]     337   9e-90
ref|XP_007135474.1| hypothetical protein PHAVU_010G132600g [Phas...   331   5e-88
ref|XP_003528451.1| PREDICTED: dentin sialophosphoprotein-like i...   331   5e-88
ref|XP_006583149.1| PREDICTED: dentin sialophosphoprotein-like i...   325   3e-86
ref|XP_006583148.1| PREDICTED: dentin sialophosphoprotein-like i...   325   3e-86
ref|XP_002304144.2| hypothetical protein POPTR_0003s06200g [Popu...   323   1e-85
ref|XP_004163891.1| PREDICTED: uncharacterized protein LOC101226...   319   1e-84
ref|XP_004141213.1| PREDICTED: uncharacterized protein LOC101203...   319   1e-84
ref|XP_004510436.1| PREDICTED: flocculation protein FLO11-like [...   314   5e-83
ref|XP_006598817.1| PREDICTED: putative uncharacterized protein ...   303   1e-79

>ref|XP_002274314.2| PREDICTED: uncharacterized protein LOC100248075 [Vitis vinifera]
          Length = 860

 Score =  381 bits (978), Expect = e-103
 Identities = 219/442 (49%), Positives = 280/442 (63%), Gaps = 9/442 (2%)
 Frame = -3

Query: 1303 SRLDAGTPNISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEV 1124
            SR++ GT  + A VRKTIQSIKEIV NHS+ADIYVTL+E+NMDPNET QKLL QDPFHEV
Sbjct: 5    SRMEGGTQILPARVRKTIQSIKEIVGNHSDADIYVTLRETNMDPNETTQKLLYQDPFHEV 64

Query: 1123 RRRRDKKKENMSYMASVEQRRQTEHTQVVKSQTFPDRNVRRGGFVRNSL-------PGVS 965
            +R+RDKKKE+  Y    E R   E+    K ++FPDRNVRRGG+ R++L        G+ 
Sbjct: 65   KRKRDKKKESTGYKRPTEPRIYIENVGQGKFRSFPDRNVRRGGYSRSTLMVRILLDAGIG 124

Query: 964  REFRVVRDNRVNQNTSRDIKPSSLQSSSSANVEVFQNVPAK-SPTGNLTDQEHLAARNSE 788
            REFRVVRDNRVNQNT+RD+KP S Q ++S N +V  N+  K + TG   +Q+  + R   
Sbjct: 125  REFRVVRDNRVNQNTNRDMKPVSPQLATSVNEQVISNISEKGNSTGTSNNQKPSSGR--- 181

Query: 787  EHKSSQATNRPSHSTSAQVQEVYSSGAHRKGLFEDTWSKVPSSVSDLTQGLKPRNSHPSS 608
              +SSQ+ N P+ +     Q+  SSG++RK L E+  + +P++VS + Q +KP +S P S
Sbjct: 182  --QSSQSLNGPTDARPGIPQDANSSGSNRKELLEERQATIPNAVSRV-QAVKPNDSQPYS 238

Query: 607  ATLASSNSEVGVYSSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXX 428
            A+LAS++S VGVYSS SDPVHVPSPDSRSS  VGAIKREVGVVGVRRQ +ENS KH    
Sbjct: 239  ASLASNSSVVGVYSSSSDPVHVPSPDSRSSAIVGAIKREVGVVGVRRQSTENSVKHSSAP 298

Query: 427  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKA 248
                                             Q TV + ++ ++  +RSF+GNQY S+ 
Sbjct: 299  SSSLPSSLLGRENSPSTEPFRPFNAIPKSDQPRQTTVPDHVIPSMPVNRSFLGNQYGSRP 358

Query: 247  HKL-MGHQKAMQPNMEWXXXXXXXXXXXXSGVIGTAVTLISPPTNNPTDSIMEADHLQGK 71
            H+  +GHQKA QPN EW             GVIGT    +SP  +N  D   E   LQ K
Sbjct: 359  HQQPVGHQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESETAKLQDK 418

Query: 70   FSQLNITENQHVIIPQHLRVPE 5
             SQ +I+ENQ+VII QH+RVPE
Sbjct: 419  LSQASISENQNVIIAQHIRVPE 440


>ref|XP_007024589.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508779955|gb|EOY27211.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 839

 Score =  366 bits (939), Expect = 1e-98
 Identities = 210/427 (49%), Positives = 272/427 (63%), Gaps = 2/427 (0%)
 Frame = -3

Query: 1279 NISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEVRRRRDKKK 1100
            +ISA VRKTIQSIKEIV NHS+ADIYV LKE+NMDPNET QKLL+QD FHEVRR+RD+KK
Sbjct: 10   DISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQDTFHEVRRKRDRKK 69

Query: 1099 ENMSYMASVEQRRQTEHT-QVVKSQTFPDRNVRRGGFVRNSLPGVSREFRVVRDNRVNQN 923
            E++ Y  S++ R+++E+  Q +K + +P+R  RRG + RN+LPGV+REFRVVRDNRVNQN
Sbjct: 70   ESIEYKVSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRNTLPGVNREFRVVRDNRVNQN 129

Query: 922  TSRDIKPSSLQSSSSANVEVFQNVPAKSPTGNLTDQEHLAARNSEEHKSSQATNRPSHST 743
             ++D+K    Q S+SAN +V  NV  K  TG  ++Q   ++R+      SQ +N PS S 
Sbjct: 130  ANKDMKTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRS-----LSQTSNGPSSSQ 184

Query: 742  SAQVQEVYSSGAHRKGLFEDTWSKVPSSVSDLTQGLKPRNSHPSSATLASSNSEVGVYSS 563
            +   ++  SSG  RK + E+  + +P++V   +Q +KP NS   +AT +SS+S VGVYSS
Sbjct: 185  TRHARDANSSGIDRKEISEEKRNFIPNAVL-RSQAVKPNNSQAHAATQSSSSSVVGVYSS 243

Query: 562  FSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXXXXXXXXXXXXXXX 383
             +DPVHVPSPDSRSSG VGAIKREVGVVGVRRQPSEN+ K                    
Sbjct: 244  STDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLSNSLVGRDNSS 303

Query: 382  XXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAH-KLMGHQKAMQPNM 206
                                T  E++M  +S SRSF+ NQY S+ + + +GHQKA Q N 
Sbjct: 304  EAFRSFPSISRADQLSHTSAT--ESIMPGISGSRSFLSNQYGSRQNQQALGHQKANQHNK 361

Query: 205  EWXXXXXXXXXXXXSGVIGTAVTLISPPTNNPTDSIMEADHLQGKFSQLNITENQHVIIP 26
            EW             GVIGT     SPP ++      E   LQ KFSQ+NI EN++VII 
Sbjct: 362  EWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYENENVIIA 421

Query: 25   QHLRVPE 5
            QH+RVPE
Sbjct: 422  QHIRVPE 428


>ref|XP_007024587.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508779953|gb|EOY27209.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 849

 Score =  366 bits (939), Expect = 1e-98
 Identities = 210/427 (49%), Positives = 272/427 (63%), Gaps = 2/427 (0%)
 Frame = -3

Query: 1279 NISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEVRRRRDKKK 1100
            +ISA VRKTIQSIKEIV NHS+ADIYV LKE+NMDPNET QKLL+QD FHEVRR+RD+KK
Sbjct: 10   DISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQDTFHEVRRKRDRKK 69

Query: 1099 ENMSYMASVEQRRQTEHT-QVVKSQTFPDRNVRRGGFVRNSLPGVSREFRVVRDNRVNQN 923
            E++ Y  S++ R+++E+  Q +K + +P+R  RRG + RN+LPGV+REFRVVRDNRVNQN
Sbjct: 70   ESIEYKVSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRNTLPGVNREFRVVRDNRVNQN 129

Query: 922  TSRDIKPSSLQSSSSANVEVFQNVPAKSPTGNLTDQEHLAARNSEEHKSSQATNRPSHST 743
             ++D+K    Q S+SAN +V  NV  K  TG  ++Q   ++R+      SQ +N PS S 
Sbjct: 130  ANKDMKTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRS-----LSQTSNGPSSSQ 184

Query: 742  SAQVQEVYSSGAHRKGLFEDTWSKVPSSVSDLTQGLKPRNSHPSSATLASSNSEVGVYSS 563
            +   ++  SSG  RK + E+  + +P++V   +Q +KP NS   +AT +SS+S VGVYSS
Sbjct: 185  TRHARDANSSGIDRKEISEEKRNFIPNAVL-RSQAVKPNNSQAHAATQSSSSSVVGVYSS 243

Query: 562  FSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXXXXXXXXXXXXXXX 383
             +DPVHVPSPDSRSSG VGAIKREVGVVGVRRQPSEN+ K                    
Sbjct: 244  STDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLSNSLVGRDNSS 303

Query: 382  XXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAH-KLMGHQKAMQPNM 206
                                T  E++M  +S SRSF+ NQY S+ + + +GHQKA Q N 
Sbjct: 304  EAFRSFPSISRADQLSHTSAT--ESIMPGISGSRSFLSNQYGSRQNQQALGHQKANQHNK 361

Query: 205  EWXXXXXXXXXXXXSGVIGTAVTLISPPTNNPTDSIMEADHLQGKFSQLNITENQHVIIP 26
            EW             GVIGT     SPP ++      E   LQ KFSQ+NI EN++VII 
Sbjct: 362  EWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYENENVIIA 421

Query: 25   QHLRVPE 5
            QH+RVPE
Sbjct: 422  QHIRVPE 428


>ref|XP_007024588.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508779954|gb|EOY27210.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 842

 Score =  361 bits (926), Expect = 4e-97
 Identities = 210/429 (48%), Positives = 272/429 (63%), Gaps = 4/429 (0%)
 Frame = -3

Query: 1279 NISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEVRRRRDKKK 1100
            +ISA VRKTIQSIKEIV NHS+ADIYV LKE+NMDPNET QKLL+QD FHEVRR+RD+KK
Sbjct: 10   DISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQDTFHEVRRKRDRKK 69

Query: 1099 ENMSYMASVEQRRQTEHT-QVVKSQTFPDRNVRRGGFVRNSLP--GVSREFRVVRDNRVN 929
            E++ Y  S++ R+++E+  Q +K + +P+R  RRG + RN+LP  GV+REFRVVRDNRVN
Sbjct: 70   ESIEYKVSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRNTLPDAGVNREFRVVRDNRVN 129

Query: 928  QNTSRDIKPSSLQSSSSANVEVFQNVPAKSPTGNLTDQEHLAARNSEEHKSSQATNRPSH 749
            QN ++D+K    Q S+SAN +V  NV  K  TG  ++Q   ++R+      SQ +N PS 
Sbjct: 130  QNANKDMKTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRS-----LSQTSNGPSS 184

Query: 748  STSAQVQEVYSSGAHRKGLFEDTWSKVPSSVSDLTQGLKPRNSHPSSATLASSNSEVGVY 569
            S +   ++  SSG  RK + E+  + +P++V   +Q +KP NS   +AT +SS+S VGVY
Sbjct: 185  SQTRHARDANSSGIDRKEISEEKRNFIPNAVL-RSQAVKPNNSQAHAATQSSSSSVVGVY 243

Query: 568  SSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXXXXXXXXXXXXX 389
            SS +DPVHVPSPDSRSSG VGAIKREVGVVGVRRQPSEN+ K                  
Sbjct: 244  SSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLSNSLVGRDN 303

Query: 388  XXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAH-KLMGHQKAMQP 212
                                  T  E++M  +S SRSF+ NQY S+ + + +GHQKA Q 
Sbjct: 304  SSEAFRSFPSISRADQLSHTSAT--ESIMPGISGSRSFLSNQYGSRQNQQALGHQKANQH 361

Query: 211  NMEWXXXXXXXXXXXXSGVIGTAVTLISPPTNNPTDSIMEADHLQGKFSQLNITENQHVI 32
            N EW             GVIGT     SPP ++      E   LQ KFSQ+NI EN++VI
Sbjct: 362  NKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYENENVI 421

Query: 31   IPQHLRVPE 5
            I QH+RVPE
Sbjct: 422  IAQHIRVPE 430


>ref|XP_007024585.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508779951|gb|EOY27207.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 852

 Score =  361 bits (926), Expect = 4e-97
 Identities = 210/429 (48%), Positives = 272/429 (63%), Gaps = 4/429 (0%)
 Frame = -3

Query: 1279 NISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEVRRRRDKKK 1100
            +ISA VRKTIQSIKEIV NHS+ADIYV LKE+NMDPNET QKLL+QD FHEVRR+RD+KK
Sbjct: 10   DISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQDTFHEVRRKRDRKK 69

Query: 1099 ENMSYMASVEQRRQTEHT-QVVKSQTFPDRNVRRGGFVRNSLP--GVSREFRVVRDNRVN 929
            E++ Y  S++ R+++E+  Q +K + +P+R  RRG + RN+LP  GV+REFRVVRDNRVN
Sbjct: 70   ESIEYKVSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRNTLPDAGVNREFRVVRDNRVN 129

Query: 928  QNTSRDIKPSSLQSSSSANVEVFQNVPAKSPTGNLTDQEHLAARNSEEHKSSQATNRPSH 749
            QN ++D+K    Q S+SAN +V  NV  K  TG  ++Q   ++R+      SQ +N PS 
Sbjct: 130  QNANKDMKTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRS-----LSQTSNGPSS 184

Query: 748  STSAQVQEVYSSGAHRKGLFEDTWSKVPSSVSDLTQGLKPRNSHPSSATLASSNSEVGVY 569
            S +   ++  SSG  RK + E+  + +P++V   +Q +KP NS   +AT +SS+S VGVY
Sbjct: 185  SQTRHARDANSSGIDRKEISEEKRNFIPNAVL-RSQAVKPNNSQAHAATQSSSSSVVGVY 243

Query: 568  SSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXXXXXXXXXXXXX 389
            SS +DPVHVPSPDSRSSG VGAIKREVGVVGVRRQPSEN+ K                  
Sbjct: 244  SSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLSNSLVGRDN 303

Query: 388  XXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAH-KLMGHQKAMQP 212
                                  T  E++M  +S SRSF+ NQY S+ + + +GHQKA Q 
Sbjct: 304  SSEAFRSFPSISRADQLSHTSAT--ESIMPGISGSRSFLSNQYGSRQNQQALGHQKANQH 361

Query: 211  NMEWXXXXXXXXXXXXSGVIGTAVTLISPPTNNPTDSIMEADHLQGKFSQLNITENQHVI 32
            N EW             GVIGT     SPP ++      E   LQ KFSQ+NI EN++VI
Sbjct: 362  NKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYENENVI 421

Query: 31   IPQHLRVPE 5
            I QH+RVPE
Sbjct: 422  IAQHIRVPE 430


>emb|CAN69468.1| hypothetical protein VITISV_042555 [Vitis vinifera]
          Length = 914

 Score =  353 bits (905), Expect = 1e-94
 Identities = 217/496 (43%), Positives = 278/496 (56%), Gaps = 63/496 (12%)
 Frame = -3

Query: 1303 SRLDAGTPNISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNET------------- 1163
            SR++ G   +   V KTIQ IKEIV NHS+ADIYV L+E NMDPNET             
Sbjct: 5    SRMEGGMQILPPQVHKTIQLIKEIVGNHSDADIYVALREMNMDPNETVQKLLNQDLDIHV 64

Query: 1162 ------------AQKLLNQDPFHEVRRRRDKKKENMSYMASVEQRRQTEHTQVVKSQTFP 1019
                        AQKLLNQDPFHEV+R+RDKKKE+  Y    E R   E+    K ++FP
Sbjct: 65   MLREMNMDPNEVAQKLLNQDPFHEVKRKRDKKKESTGYKRPTEPRIYIENVGQGKFRSFP 124

Query: 1018 DRNVRRGGFVRNSLPG------------------------------------VSREFRVV 947
            DRNVRRGG+ R+++PG                                    + REFRVV
Sbjct: 125  DRNVRRGGYSRSTVPGNAKTYQFYHSFVLELLYLTVCFLLSELMVRILLDAGIGREFRVV 184

Query: 946  RDNRVNQNTSRDIKPSSLQSSSSANVEVFQNVPAK-SPTGNLTDQEHLAARNSEEHKSSQ 770
            RDNRVNQNT+RD+KP S Q ++SAN +V  N+  K + TG   +Q+  + R     +SSQ
Sbjct: 185  RDNRVNQNTNRDMKPVSPQLATSANEQVISNISEKGNSTGTSNNQKPSSGR-----QSSQ 239

Query: 769  ATNRPSHSTSAQVQEVYSSGAHRKGLFEDTWSKVPSSVSDLTQGLKPRNSHPSSATLASS 590
            + N P+ +     Q+  SSG++RK L E+  + +P++VS + Q +KP +S P SA+LAS+
Sbjct: 240  SLNGPTDARPGIPQDANSSGSNRKELLEERQATIPNAVSRV-QAVKPNDSQPYSASLASN 298

Query: 589  NSEVGVYSSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXXXXXX 410
            +S VGVYSS SDPVHVPSPDSRSS  VGAIKREVGVVGVRRQ +ENS KH          
Sbjct: 299  SSVVGVYSSSSDPVHVPSPDSRSSAIVGAIKREVGVVGVRRQSTENSVKHSSAPSSSLPS 358

Query: 409  XXXXXXXXXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAHKL-MG 233
                                       Q TV + ++ ++  +RSF+GNQY S+ H+  +G
Sbjct: 359  SLLGRENSPSTEPFRPFNAIPKSDQPRQTTVPDHVIPSMPVNRSFLGNQYGSRPHQQPVG 418

Query: 232  HQKAMQPNMEWXXXXXXXXXXXXSGVIGTAVTLISPPTNNPTDSIMEADHLQGKFSQLNI 53
            HQKA QPN EW             GVIGT    +SP  +N  D   E   LQ K SQ +I
Sbjct: 419  HQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESETAKLQDKLSQASI 478

Query: 52   TENQHVIIPQHLRVPE 5
            +ENQ+VII QH+RVPE
Sbjct: 479  SENQNVIIAQHIRVPE 494


>ref|XP_007024584.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508779950|gb|EOY27206.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 883

 Score =  351 bits (901), Expect = 4e-94
 Identities = 210/454 (46%), Positives = 272/454 (59%), Gaps = 29/454 (6%)
 Frame = -3

Query: 1279 NISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEVRRRRDKKK 1100
            +ISA VRKTIQSIKEIV NHS+ADIYV LKE+NMDPNET QKLL+QD FHEVRR+RD+KK
Sbjct: 10   DISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQDTFHEVRRKRDRKK 69

Query: 1099 ENMSYMASVEQRRQTEHT-QVVKSQTFPDRNVRRGGFVRNSLPGVSREFRVVRDNRVNQN 923
            E++ Y  S++ R+++E+  Q +K + +P+R  RRG + RN+LPGV+REFRVVRDNRVNQN
Sbjct: 70   ESIEYKVSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRNTLPGVNREFRVVRDNRVNQN 129

Query: 922  TSRDIKPSSLQSSSSANVEVFQNVPAKSPTGNLTDQEHLAARNSEEHKSSQATNRPSHST 743
             ++D+K    Q S+SAN +V  NV  K  TG  ++Q   ++R+      SQ +N PS S 
Sbjct: 130  ANKDMKTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRS-----LSQTSNGPSSSQ 184

Query: 742  SAQVQEVYSSGAHRKGLFEDTWSKVPSSVSDLTQGLKPRNSHPSSATLASSNSEVGVYSS 563
            +   ++  SSG  RK + E+  + +P++V   +Q +KP NS   +AT +SS+S VGVYSS
Sbjct: 185  TRHARDANSSGIDRKEISEEKRNFIPNAVL-RSQAVKPNNSQAHAATQSSSSSVVGVYSS 243

Query: 562  FSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXXXXXXXXXXXXXXX 383
             +DPVHVPSPDSRSSG VGAIKREVGVVGVRRQPSEN+ K                    
Sbjct: 244  STDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLSNSLVGRDNSS 303

Query: 382  XXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAH-KLMGHQK------ 224
                                T  E++M  +S SRSF+ NQY S+ + + +GHQK      
Sbjct: 304  EAFRSFPSISRADQLSHTSAT--ESIMPGISGSRSFLSNQYGSRQNQQALGHQKEASYCS 361

Query: 223  ---------------------AMQPNMEWXXXXXXXXXXXXSGVIGTAVTLISPPTNNPT 107
                                 A Q N EW             GVIGT     SPP ++  
Sbjct: 362  AFHPFIDQISLWESLSCIFDAANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAK 421

Query: 106  DSIMEADHLQGKFSQLNITENQHVIIPQHLRVPE 5
                E   LQ KFSQ+NI EN++VII QH+RVPE
Sbjct: 422  GLDSETAKLQDKFSQVNIYENENVIIAQHIRVPE 455


>ref|XP_002521347.1| conserved hypothetical protein [Ricinus communis]
            gi|223539425|gb|EEF41015.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 864

 Score =  350 bits (897), Expect = 1e-93
 Identities = 207/431 (48%), Positives = 264/431 (61%), Gaps = 4/431 (0%)
 Frame = -3

Query: 1285 TPNISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEVRRRRDK 1106
            T  +SA VRKTIQSIKEIV N S+ADIY+ LKE+NMDPNETAQKLLNQDPFHEV+R+RDK
Sbjct: 18   THTLSATVRKTIQSIKEIVGNFSDADIYMALKETNMDPNETAQKLLNQDPFHEVKRKRDK 77

Query: 1105 KKENMSYMASVEQRRQTEHT-QVVKSQTFPDRNVRRGGFVRNSLP---GVSREFRVVRDN 938
            KKE+M+Y  S++ R+  E+  Q  K +TF DRN R+GG++R ++P   G++REFRVVRDN
Sbjct: 78   KKESMAYRGSLDSRKNPENMGQGTKFRTFSDRNTRQGGYIRAAVPGNAGINREFRVVRDN 137

Query: 937  RVNQNTSRDIKPSSLQSSSSANVEVFQNVPAKSPTGNLTDQEHLAARNSEEHKSSQATNR 758
            RVN NT+R+ KP+  Q S S++      V  K  +G+  + +H   R+     SSQA+N 
Sbjct: 138  RVNLNTTREPKPAMQQGSISSDELGISTVTEKGSSGSSGNVKHSGVRS-----SSQASNG 192

Query: 757  PSHSTSAQVQEVYSSGAHRKGLFEDTWSKVPSSVSDLTQGLKPRNSHPSSATLASSNSEV 578
            P  S S   ++  S+   RK + E+  + VPS+ S + Q +KP + H  SATLASSNS V
Sbjct: 193  PPDSQSRHTRDATSNFTDRKAMTEEKRAVVPSAASRI-QVMKPSSQH-HSATLASSNSVV 250

Query: 577  GVYSSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXXXXXXXXXX 398
            GVYSS  DPVHVPSP+SRSS  VGAIKREVGVVG RRQ SEN+ K+              
Sbjct: 251  GVYSSSMDPVHVPSPESRSSAAVGAIKREVGVVGGRRQSSENAVKNSSASSSSFSNSVLG 310

Query: 397  XXXXXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAHKLMGHQKAM 218
                                    V  +E+ M ++S  RSF+GNQY+      +GHQKA 
Sbjct: 311  RDGSLPESFQPFPTISKNDQVNEPV-ATESAMPSISVGRSFLGNQYSRTHQTAVGHQKAT 369

Query: 217  QPNMEWXXXXXXXXXXXXSGVIGTAVTLISPPTNNPTDSIMEADHLQGKFSQLNITENQH 38
            Q N EW             GVIGT     SPP  N  D   +A  +Q K  ++NI ENQ+
Sbjct: 370  QHNKEWKPKSSQKASVGSPGVIGTPTKSSSPPAGNSKDLESDATDMQEKLLRVNIYENQN 429

Query: 37   VIIPQHLRVPE 5
            VII QH+RVPE
Sbjct: 430  VIIAQHIRVPE 440


>emb|CBI35892.3| unnamed protein product [Vitis vinifera]
          Length = 809

 Score =  342 bits (877), Expect = 2e-91
 Identities = 208/451 (46%), Positives = 264/451 (58%), Gaps = 18/451 (3%)
 Frame = -3

Query: 1303 SRLDAGTPNISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEV 1124
            SR++ GT  + A VRKTIQSIKEIV NHS+ADIYVTL+E+NMDPNET QKLL QDPFHEV
Sbjct: 5    SRMEGGTQILPARVRKTIQSIKEIVGNHSDADIYVTLRETNMDPNETTQKLLYQDPFHEV 64

Query: 1123 RRRRDKKKENMSYMASVEQRRQTEHTQVVKSQTFPDRNVRRGGFVRNSLP---------- 974
            +R+RDKKKE+  Y    E R   E+    K ++FPDRNVRRGG+ R+++P          
Sbjct: 65   KRKRDKKKESTGYKRPTEPRIYIENVGQGKFRSFPDRNVRRGGYSRSTVPGNAKTYQFYH 124

Query: 973  ------GVSREFRVVRDNRVNQNTSRDIKPSSLQSSSSANVEVFQNVPAK-SPTGNLTDQ 815
                  G+ REFRVVRDNRVNQNT+RD+KP S Q ++S N +V  N+  K + TG   +Q
Sbjct: 125  SILLDAGIGREFRVVRDNRVNQNTNRDMKPVSPQLATSVNEQVISNISEKGNSTGTSNNQ 184

Query: 814  EHLAARNSEEHKSSQATNRPSHSTSAQVQEVYSSGAHRKGLFEDTWSKVPSSVSDLTQGL 635
            +  + R     +SSQ+ N P+ +              R G+ +D               +
Sbjct: 185  KPSSGR-----QSSQSLNGPTDA--------------RPGIPQD------------ANSM 213

Query: 634  KPRNSHPSSATLASSNSEVGVYSSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSE 455
            KP +S P SA+LAS++S VGVYSS SDPVHVPSPDSRSS  VGAIKREVGVVGVRRQ +E
Sbjct: 214  KPNDSQPYSASLASNSSVVGVYSSSSDPVHVPSPDSRSSAIVGAIKREVGVVGVRRQSTE 273

Query: 454  NSAKHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSF 275
            NS+                                       Q TV + ++ ++  +RSF
Sbjct: 274  NSSDQ-----------------------------------PRQTTVPDHVIPSMPVNRSF 298

Query: 274  VGNQYNSKAHKL-MGHQKAMQPNMEWXXXXXXXXXXXXSGVIGTAVTLISPPTNNPTDSI 98
            +GNQY S+ H+  +GHQKA QPN EW             GVIGT    +SP  +N  D  
Sbjct: 299  LGNQYGSRPHQQPVGHQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLE 358

Query: 97   MEADHLQGKFSQLNITENQHVIIPQHLRVPE 5
             E   LQ K SQ +I+ENQ+VII QH+RVPE
Sbjct: 359  SETAKLQDKLSQASISENQNVIIAQHIRVPE 389


>ref|XP_002299597.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550347518|gb|EEE84402.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 854

 Score =  341 bits (875), Expect = 4e-91
 Identities = 205/432 (47%), Positives = 259/432 (59%), Gaps = 5/432 (1%)
 Frame = -3

Query: 1285 TPNISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEVRRRRDK 1106
            T  +SA VRKTIQSIKEIV N S+ADIY+ LKE+NMDPNETAQKLLNQDPFHEV+R+R+K
Sbjct: 24   THTLSAKVRKTIQSIKEIVGNFSDADIYMVLKETNMDPNETAQKLLNQDPFHEVKRKREK 83

Query: 1105 KKENMSYMASVEQRRQTEH-TQVVKSQTFPDRNVRRGGFVRNSLP---GVSREFRVVRDN 938
            KKEN SY  SV+ R+ +E+  Q ++  TF DRN +RGG+ R + P   G++REFRVVRDN
Sbjct: 84   KKENTSYRGSVDSRKHSENFGQGMRPHTFSDRNAQRGGYTRTASPGNRGINREFRVVRDN 143

Query: 937  RVNQNTSRDIKPSSLQSSSSANVEVFQNVPAKSPTGNLTDQEHLAARNSEEHKSSQATNR 758
            RVNQNTSR+ KP+ L  S+SA  +    V  K  TG  ++      + S+   S QA+N 
Sbjct: 144  RVNQNTSREPKPALLHGSTSAKEQGSGVVTEKGSTGISSN-----LKPSDARSSHQASNG 198

Query: 757  PSHSTSAQVQEVYSSGAHRKGLFEDTWSKVPSSVSDLTQGLKPRNSHPSSATLASSNSEV 578
            P  S     ++  SS   RK + E+  S   ++ +   Q  K  NS   +A  ASSN  V
Sbjct: 199  PIDSEPRHNRDANSSVGDRKVVSEEKRSVASNATTSRVQVAKSNNSQQHNALQASSNPVV 258

Query: 577  GVYSSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXXXXXXXXXX 398
            GVYSS +DPVHVPSPDSRSSG VGAIKREVGVVG RRQ  EN+ K               
Sbjct: 259  GVYSSSTDPVHVPSPDSRSSGVVGAIKREVGVVGGRRQSFENAVKDLSSSNSFSESFRPF 318

Query: 397  XXXXXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAH-KLMGHQKA 221
                                     T +   M +V  +RSF+ NQYN++ H + +GH KA
Sbjct: 319  TAISKTDQVSQ--------------TAAIEPMPSVPVNRSFLNNQYNNRPHQQAVGHPKA 364

Query: 220  MQPNMEWXXXXXXXXXXXXSGVIGTAVTLISPPTNNPTDSIMEADHLQGKFSQLNITENQ 41
             Q N EW             GVIGT     SPPT+N  +  ++A +LQ KFS++NI ENQ
Sbjct: 365  SQHNKEWKPKSSQKSSVTSPGVIGTPTKSSSPPTDNSKNMELDAANLQDKFSRINIHENQ 424

Query: 40   HVIIPQHLRVPE 5
            +VII QH+RVPE
Sbjct: 425  NVIIAQHIRVPE 436


>gb|EXB29673.1| hypothetical protein L484_013447 [Morus notabilis]
          Length = 854

 Score =  337 bits (863), Expect = 9e-90
 Identities = 207/444 (46%), Positives = 263/444 (59%), Gaps = 10/444 (2%)
 Frame = -3

Query: 1306 SSRLDAGTPNISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHE 1127
            +SR+D G   +SA VRKTIQSIKEIV NHS+ DIY+ LKE+NMDPNETAQKLLNQDPFHE
Sbjct: 4    ASRIDGGPQILSAGVRKTIQSIKEIVGNHSDIDIYLALKETNMDPNETAQKLLNQDPFHE 63

Query: 1126 VRRRRDKKKENMSYMASVEQRRQTE-HTQVVKSQTFPDRNVRRGGFVRNSLP-------G 971
            VRR+RDKKKE+    +S + R  +E   Q  K  TF DRN RRGG+ RNSLP       G
Sbjct: 64   VRRKRDKKKESAGNDSSTDPRGHSEVKGQGSKVNTFSDRNARRGGYARNSLPDRIMLHAG 123

Query: 970  VSREFRVVRDNRVNQNTSRDIKPSSLQSSSSANVEVFQNVPAKSPTGNLTDQEHLAARNS 791
            VSREFRVVRDNRVN++ +R+ KP+   S+S      F+N+  K  TG+   ++  A++N 
Sbjct: 124  VSREFRVVRDNRVNRSLNREAKPA---SASPTPPSTFENISGKGSTGSSNSEKPTASKN- 179

Query: 790  EEHKSSQATNRPSHSTSAQVQEVYSSGAHRKGLFEDTWSKVPSSVSDLTQGLKPRNSHPS 611
                SSQ    PS S      ++ S+G  RK + E+      SSV+   Q  K  N+   
Sbjct: 180  ----SSQGLYGPSDSHLRIAHDIESTGLVRKEVSEEK-RVTFSSVASRVQAGKANNARSQ 234

Query: 610  SATLASSNSEVGVYSSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXX 431
            SA +ASS+S +GVYSS +DPVHVPSPDSRSSG+VGAIKREVGVVGVRRQ S+NS      
Sbjct: 235  SAMVASSSSAIGVYSSSTDPVHVPSPDSRSSGSVGAIKREVGVVGVRRQSSDNSKSSVPS 294

Query: 430  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSK 251
                                                  SE+++ +VS SRS + + Y+++
Sbjct: 295  SSFSNSLLGGEGSAETLQSFSTISKNDEVG------QASESILPSVSVSRSLLSSHYSNR 348

Query: 250  A--HKLMGHQKAMQPNMEWXXXXXXXXXXXXSGVIGTAVTLISPPTNNPTDSIMEADHLQ 77
                + +GHQKA QPN EW             GVIGT    +SPP +N   S  E   + 
Sbjct: 349  QQHQQPVGHQKASQPNKEWKPKSSQKPSLNNPGVIGTPTKSVSPPAHNSEVSESEPAKVL 408

Query: 76   GKFSQLNITENQHVIIPQHLRVPE 5
             K S++NI ENQ+VII QH+RVPE
Sbjct: 409  EKLSRVNIHENQNVIIAQHIRVPE 432


>ref|XP_007135474.1| hypothetical protein PHAVU_010G132600g [Phaseolus vulgaris]
            gi|561008519|gb|ESW07468.1| hypothetical protein
            PHAVU_010G132600g [Phaseolus vulgaris]
          Length = 864

 Score =  331 bits (848), Expect = 5e-88
 Identities = 209/437 (47%), Positives = 256/437 (58%), Gaps = 9/437 (2%)
 Frame = -3

Query: 1288 GTPNISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEVRRRRD 1109
            GT  +SA VRKTIQSIKEIV NHS+ADIYV LKE+NMDPNET QKLLNQDPFHEV+RRRD
Sbjct: 12   GTHLLSARVRKTIQSIKEIVGNHSDADIYVALKETNMDPNETTQKLLNQDPFHEVKRRRD 71

Query: 1108 KKKE--NMSYMASVEQRRQTEHT--QVVKSQTFPDRNVRRGGFVRNSLPGVSREFRVVRD 941
            +KKE  N+    S + RR +E+   Q VK  T  +RNVRR  + RN+LPG+SREFRVVRD
Sbjct: 72   RKKEPQNVGNNGSADSRRPSENNSGQGVKFHTPSERNVRRANYSRNTLPGISREFRVVRD 131

Query: 940  NRVNQNTSRDIKPSSLQSSSSANVEVFQNVPAKSPTGNLTDQEHLAARNSEEHKSSQATN 761
            NRVN    +++KP S Q  +SA+ E+  N+  K   G+     H   R+S    SSQA N
Sbjct: 132  NRVNY-IYKEVKPLSQQHLASASEELNVNLSEK---GSSASTSH---RSSGSRNSSQALN 184

Query: 760  RPSHSTSAQVQEVYSSGAHRKGLFEDTWSKVPSSVS---DLTQGLKPRNSHPSSATLASS 590
             PS S +   ++   +   RK   ED      S +S   +  Q +KP + H + A++ASS
Sbjct: 185  GPSDSFARYPKDAVPNIVDRKIASEDKDKDKQSMISNAAERVQPIKPNHIHQNPASVASS 244

Query: 589  NSEVGVYSSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXXXXXX 410
            +S VGVYSS +DPVHVPSPDSRSS  VGAI+REVGVVGVRRQPS+N  K           
Sbjct: 245  SSAVGVYSSSTDPVHVPSPDSRSSSVVGAIRREVGVVGVRRQPSDNKVKQ----SFAPSS 300

Query: 409  XXXXXXXXXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAH-KLMG 233
                                       Q  V+E  +  V  SR  V NQYN + H +L+G
Sbjct: 301  SYVAGKDGTSADSFQPVGAVLKTEQFSQTKVTEPSLSGVPVSRPSVNNQYNGRPHQQLVG 360

Query: 232  HQKAMQPNMEWXXXXXXXXXXXXSGVIGT-AVTLISPPTNNPTDSIMEADHLQGKFSQLN 56
            HQ+  Q N EW             GVIGT      SPP  N  D   +A  LQ K SQLN
Sbjct: 361  HQRVSQQNKEWKPKSSQKPNSNNPGVIGTPKKAAASPPAENSVDIESDAVELQDKLSQLN 420

Query: 55   ITENQHVIIPQHLRVPE 5
            I ENQ+VII QH++VPE
Sbjct: 421  IYENQNVIIAQHIQVPE 437


>ref|XP_003528451.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max]
          Length = 863

 Score =  331 bits (848), Expect = 5e-88
 Identities = 205/440 (46%), Positives = 262/440 (59%), Gaps = 12/440 (2%)
 Frame = -3

Query: 1288 GTPNISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEVRRRRD 1109
            GT  +SA VRKTIQSIKEIV NHS+ADIYV LKE+NMDPNET QKLLNQDPFHEV+RRRD
Sbjct: 12   GTHLLSARVRKTIQSIKEIVGNHSDADIYVALKETNMDPNETTQKLLNQDPFHEVKRRRD 71

Query: 1108 KKKENMSY----MASVEQRRQTEHT--QVVKSQTFPDRNVRRGGFVRNSLPGVSREFRVV 947
            +KKE  +       S + RR +E+   Q +K     +RNVRR  + RN+LPG+S+EFRVV
Sbjct: 72   RKKETQNVGNKGQPSADSRRSSENNSGQGMKFNAPSERNVRRTNYSRNTLPGISKEFRVV 131

Query: 946  RDNRVNQNTSRDIKPSSLQSSSSANVEVFQNVPAKSPTGNLTDQEHLAARNSEEHKSSQA 767
            RDNRVN +  +++KP + Q S+SA  ++  N P K   G+ T   H   R+S    SS A
Sbjct: 132  RDNRVN-HIYKEVKPLTQQHSTSATEQLNVNTPDK---GSSTSTNH---RSSGSRNSSLA 184

Query: 766  TNRPSHSTSAQVQEVYSSGAHRKGLFEDTWSK-VPSSVSDLTQGLKPRNSHPSSATLASS 590
            +N PS S +  +++   +   RK   ED   + + S+ +   Q +KP N+H +SA++AS+
Sbjct: 185  SNGPSDSHARYLKDAVPNIIDRKIASEDKDKQGMISNAAGRVQPIKPNNAHQNSASVAST 244

Query: 589  NSEVGVYSSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXXXXXX 410
            +S VGVYSS +DPVHVPSPDSRSSG VGAI+REVGVVGVRRQ S+N AK           
Sbjct: 245  SSAVGVYSSSTDPVHVPSPDSRSSGVVGAIRREVGVVGVRRQSSDNKAKQ----SFAPSI 300

Query: 409  XXXXXXXXXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAH-KLMG 233
                                       Q  V+E  +  +  SR  + NQYN++ H +L+G
Sbjct: 301  SYVVGKDGTSADSFQSVGAVSKTEQFSQTNVTEPSLSGMPVSRPSLNNQYNNRPHQQLVG 360

Query: 232  HQKAMQPNMEWXXXXXXXXXXXXSGVIGT----AVTLISPPTNNPTDSIMEADHLQGKFS 65
            HQ+  Q N EW             GVIGT    AV   SPP  N  D       LQ K S
Sbjct: 361  HQRVSQQNKEWKPKSSQKPNSNSPGVIGTPKKAAVAAASPPAENSGDIESNTTELQDKLS 420

Query: 64   QLNITENQHVIIPQHLRVPE 5
            Q+NI ENQ+VII QH+RVPE
Sbjct: 421  QVNIYENQNVIIAQHIRVPE 440


>ref|XP_006583149.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Glycine max]
          Length = 830

 Score =  325 bits (833), Expect = 3e-86
 Identities = 203/440 (46%), Positives = 259/440 (58%), Gaps = 12/440 (2%)
 Frame = -3

Query: 1288 GTPNISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEVRRRRD 1109
            GT  +SA VRKTIQSIKEIV NHS+ADIYV LKE+NMDPNET QKLLNQDPFHEV+RRRD
Sbjct: 12   GTHLLSARVRKTIQSIKEIVGNHSDADIYVALKETNMDPNETTQKLLNQDPFHEVKRRRD 71

Query: 1108 KKKENMSY----MASVEQRRQTEHT--QVVKSQTFPDRNVRRGGFVRNSLPGVSREFRVV 947
            +KKE  +       S + RR +E+   Q +K     +RNVRR  + RN+LPG+S+EFRVV
Sbjct: 72   RKKETQNVGNKGQPSADSRRSSENNSGQGMKFNAPSERNVRRTNYSRNTLPGISKEFRVV 131

Query: 946  RDNRVNQNTSRDIKPSSLQSSSSANVEVFQNVPAKSPTGNLTDQEHLAARNSEEHKSSQA 767
            RDNRVN +  +++KP + Q S+SA  ++  N P K   G+ T   H   R+S    SS A
Sbjct: 132  RDNRVN-HIYKEVKPLTQQHSTSATEQLNVNTPDK---GSSTSTNH---RSSGSRNSSLA 184

Query: 766  TNRPSHSTSAQVQEVYSSGAHRKGLFEDTWSK-VPSSVSDLTQGLKPRNSHPSSATLASS 590
            +N PS S +  +++   +   RK   ED   + + S+ +   Q +KP N+H +SA++AS+
Sbjct: 185  SNGPSDSHARYLKDAVPNIIDRKIASEDKDKQGMISNAAGRVQPIKPNNAHQNSASVAST 244

Query: 589  NSEVGVYSSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXXXXXX 410
            +S VGVYSS +DPVHVPSPDSRSSG VGAI+REVGVVGVRRQ S+N AK           
Sbjct: 245  SSAVGVYSSSTDPVHVPSPDSRSSGVVGAIRREVGVVGVRRQSSDNKAKQ---------- 294

Query: 409  XXXXXXXXXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAH-KLMG 233
                                           S + +     SR  + NQYN++ H +L+G
Sbjct: 295  ---------------------------SFAPSISYVVGKDVSRPSLNNQYNNRPHQQLVG 327

Query: 232  HQKAMQPNMEWXXXXXXXXXXXXSGVIGT----AVTLISPPTNNPTDSIMEADHLQGKFS 65
            HQ+  Q N EW             GVIGT    AV   SPP  N  D       LQ K S
Sbjct: 328  HQRVSQQNKEWKPKSSQKPNSNSPGVIGTPKKAAVAAASPPAENSGDIESNTTELQDKLS 387

Query: 64   QLNITENQHVIIPQHLRVPE 5
            Q+NI ENQ+VII QH+RVPE
Sbjct: 388  QVNIYENQNVIIAQHIRVPE 407


>ref|XP_006583148.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Glycine max]
          Length = 855

 Score =  325 bits (833), Expect = 3e-86
 Identities = 201/440 (45%), Positives = 258/440 (58%), Gaps = 12/440 (2%)
 Frame = -3

Query: 1288 GTPNISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEVRRRRD 1109
            GT  +SA VRKTIQSIKEIV NHS+ADIYV LKE+NMDPNET QKLLNQDPFHEV+RRRD
Sbjct: 12   GTHLLSARVRKTIQSIKEIVGNHSDADIYVALKETNMDPNETTQKLLNQDPFHEVKRRRD 71

Query: 1108 KKKENMSY----MASVEQRRQTEHT--QVVKSQTFPDRNVRRGGFVRNSLPGVSREFRVV 947
            +KKE  +       S + RR +E+   Q +K     +RNVRR  + RN+LPG+S+EFRVV
Sbjct: 72   RKKETQNVGNKGQPSADSRRSSENNSGQGMKFNAPSERNVRRTNYSRNTLPGISKEFRVV 131

Query: 946  RDNRVNQNTSRDIKPSSLQSSSSANVEVFQNVPAKSPTGNLTDQEHLAARNSEEHKSSQA 767
            RDNRVN +  +++KP + Q S+SA  ++  N P K  +G+                SS A
Sbjct: 132  RDNRVN-HIYKEVKPLTQQHSTSATEQLNVNTPDKGSSGS--------------RNSSLA 176

Query: 766  TNRPSHSTSAQVQEVYSSGAHRKGLFEDTWSK-VPSSVSDLTQGLKPRNSHPSSATLASS 590
            +N PS S +  +++   +   RK   ED   + + S+ +   Q +KP N+H +SA++AS+
Sbjct: 177  SNGPSDSHARYLKDAVPNIIDRKIASEDKDKQGMISNAAGRVQPIKPNNAHQNSASVAST 236

Query: 589  NSEVGVYSSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXXXXXX 410
            +S VGVYSS +DPVHVPSPDSRSSG VGAI+REVGVVGVRRQ S+N AK           
Sbjct: 237  SSAVGVYSSSTDPVHVPSPDSRSSGVVGAIRREVGVVGVRRQSSDNKAKQ----SFAPSI 292

Query: 409  XXXXXXXXXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAH-KLMG 233
                                       Q  V+E  +  +  SR  + NQYN++ H +L+G
Sbjct: 293  SYVVGKDGTSADSFQSVGAVSKTEQFSQTNVTEPSLSGMPVSRPSLNNQYNNRPHQQLVG 352

Query: 232  HQKAMQPNMEWXXXXXXXXXXXXSGVIGT----AVTLISPPTNNPTDSIMEADHLQGKFS 65
            HQ+  Q N EW             GVIGT    AV   SPP  N  D       LQ K S
Sbjct: 353  HQRVSQQNKEWKPKSSQKPNSNSPGVIGTPKKAAVAAASPPAENSGDIESNTTELQDKLS 412

Query: 64   QLNITENQHVIIPQHLRVPE 5
            Q+NI ENQ+VII QH+RVPE
Sbjct: 413  QVNIYENQNVIIAQHIRVPE 432


>ref|XP_002304144.2| hypothetical protein POPTR_0003s06200g [Populus trichocarpa]
            gi|550342535|gb|EEE79123.2| hypothetical protein
            POPTR_0003s06200g [Populus trichocarpa]
          Length = 858

 Score =  323 bits (828), Expect = 1e-85
 Identities = 205/441 (46%), Positives = 257/441 (58%), Gaps = 5/441 (1%)
 Frame = -3

Query: 1309 GSSRLDAGTPNISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFH 1130
            GS R    T  +SA VRK IQSIKEIV N S+ADIY+ LKE+NMDPNET QKLLNQDPFH
Sbjct: 21   GSGRQQQHT--LSARVRKIIQSIKEIVGNFSDADIYMVLKETNMDPNETVQKLLNQDPFH 78

Query: 1129 EVRRRRDKKKENMSYMASVEQRRQTEH-TQVVKSQTFPDRNVRRGGFVRNSL---PGVSR 962
            EV+R+RDKKKE+MSY  SV+ R+Q E+  Q ++ +TF DR  +RGG  R       GV+R
Sbjct: 79   EVKRKRDKKKESMSYRGSVDSRKQPENFDQGMRPRTFLDRYAQRGGHTRTDSIGNRGVNR 138

Query: 961  EFRVVRDNRVNQNTSRDIKPSSLQSSSSANVEVFQNVPAKSPTGNLTDQEHLAARNSEEH 782
            EFRVVRDNR+NQN +R+ KP+  Q S+SA  E    V  K   G      +L   N++  
Sbjct: 139  EFRVVRDNRINQNANREPKPALPQGSTSAK-EKGSGVTEKGSAG--ISNNNLKPSNAQ-- 193

Query: 781  KSSQATNRPSHSTSAQVQEVYSSGAHRKGLFEDTWSKVPSSVSDLTQGLKPRNSHPSSAT 602
             SSQ +N P++      ++  S    RK + E+  S   ++ +   Q +KP NS    A+
Sbjct: 194  SSSQTSNGPTYPEPRYNRDAKSRAGDRKVVSEEKRSTASNATTSRAQVVKPNNSQQHDAS 253

Query: 601  LASSNSEVGVYSSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXX 422
            LASSNS VGVYSS +DPVHVPSPDSRSSG VGAIKREVGVVG RRQ SEN+ K       
Sbjct: 254  LASSNSVVGVYSSSTDPVHVPSPDSRSSGVVGAIKREVGVVGGRRQ-SENAVKDLSSSNS 312

Query: 421  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAH- 245
                                             T     M +V  +RS + NQYNS+ H 
Sbjct: 313  FSESFHPLTAISNTDQVRQ--------------TAVIESMPSVPVNRSLLHNQYNSRPHQ 358

Query: 244  KLMGHQKAMQPNMEWXXXXXXXXXXXXSGVIGTAVTLISPPTNNPTDSIMEADHLQGKFS 65
            + +G+ KA Q N EW             GVIGT      PPT+N     + A +LQ KFS
Sbjct: 359  QTVGYPKASQHNKEWKPKSSQKSSITSPGVIGTPTKSSLPPTDNSKSMELNAANLQDKFS 418

Query: 64   QLNITENQHVIIPQHLRVPEA 2
            ++NI ENQ+VII QH+RVPE+
Sbjct: 419  RVNIHENQNVIIAQHIRVPES 439


>ref|XP_004163891.1| PREDICTED: uncharacterized protein LOC101226902 [Cucumis sativus]
          Length = 846

 Score =  319 bits (818), Expect = 1e-84
 Identities = 193/436 (44%), Positives = 261/436 (59%), Gaps = 4/436 (0%)
 Frame = -3

Query: 1300 RLDAGTPNISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEVR 1121
            R+D GT  + A VRKTIQSIKEIV NHS+ADIY TLKE+NMDPNETAQKLLNQDPF EV+
Sbjct: 6    RVDGGTHVLPARVRKTIQSIKEIVGNHSDADIYTTLKETNMDPNETAQKLLNQDPFREVK 65

Query: 1120 RRRDKKKENMSYMASVEQRRQTEHT-QVVKSQTFPDRNVRRGGFVRNSLPGVSREFRVVR 944
            RRRDKKKEN+ Y  S++ +R +E   Q  K  T  DRNVRRG + ++S PG+S+EFRVVR
Sbjct: 66   RRRDKKKENVGYKGSLDAQRNSEDVRQGTKVYTLSDRNVRRGAYAKSSWPGISKEFRVVR 125

Query: 943  DNRVNQNTSRDIKPSSLQSSSSANVEVFQNVPAK--SPTGNLTDQEHLAARNSEEHKSSQ 770
            DNRVN+N++R++KP+S   + S N EV  NV     +P G        A   S   + SQ
Sbjct: 126  DNRVNRNSNREVKPASSHLALSTN-EVSTNVSKSVITPRG--------AHGGSFGGRISQ 176

Query: 769  ATNRPSHSTSAQVQEVYSSGAHRKGLFEDTWSKVPSSVSDLTQGLKPRNSHPSSATLASS 590
             + R + S  +  ++ +S+G  +K L +D    + SS+ D+  G  P +S P S  LAS+
Sbjct: 177  VSFRKTDSHPSNPRDGHSTGMAQKELRDDVGVSMLSSIPDMHIG-NPNDSEPHSPVLASN 235

Query: 589  NSEVGVYSSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXXXXXX 410
             + VG+YSS +DPVHVPSPDSRSS  VGAIKREVG VGVRRQ  ++S             
Sbjct: 236  GAAVGLYSSSTDPVHVPSPDSRSSAPVGAIKREVGAVGVRRQLKDSSINQSSGPSVSLAN 295

Query: 409  XXXXXXXXXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAHK-LMG 233
                                          ++E+++  +  SR+ + NQ++S+ H+  MG
Sbjct: 296  SVSERDGSSDSFQPMSSTSKGEQLS----QITESVIPGLVGSRTSLNNQHSSRQHQPTMG 351

Query: 232  HQKAMQPNMEWXXXXXXXXXXXXSGVIGTAVTLISPPTNNPTDSIMEADHLQGKFSQLNI 53
            HQKA QPN EW             GVIGT  +    P +   +   EA ++Q K +++++
Sbjct: 352  HQKASQPNKEWKPKSSQKLSTGNPGVIGTP-SKSKAPADESKELHSEAANVQEKLARVDL 410

Query: 52   TENQHVIIPQHLRVPE 5
             ENQHVII +H+RVP+
Sbjct: 411  HENQHVIIAEHIRVPD 426


>ref|XP_004141213.1| PREDICTED: uncharacterized protein LOC101203238 [Cucumis sativus]
          Length = 740

 Score =  319 bits (818), Expect = 1e-84
 Identities = 193/436 (44%), Positives = 261/436 (59%), Gaps = 4/436 (0%)
 Frame = -3

Query: 1300 RLDAGTPNISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEVR 1121
            R+D GT  + A VRKTIQSIKEIV NHS+ADIY TLKE+NMDPNETAQKLLNQDPF EV+
Sbjct: 6    RVDGGTHVLPARVRKTIQSIKEIVGNHSDADIYTTLKETNMDPNETAQKLLNQDPFREVK 65

Query: 1120 RRRDKKKENMSYMASVEQRRQTEHT-QVVKSQTFPDRNVRRGGFVRNSLPGVSREFRVVR 944
            RRRDKKKEN+ Y  S++ +R +E   Q  K  T  DRNVRRG + ++S PG+S+EFRVVR
Sbjct: 66   RRRDKKKENVGYKGSLDAQRNSEDVRQGTKVYTLSDRNVRRGAYAKSSWPGISKEFRVVR 125

Query: 943  DNRVNQNTSRDIKPSSLQSSSSANVEVFQNVPAK--SPTGNLTDQEHLAARNSEEHKSSQ 770
            DNRVN+N++R++KP+S   + S N EV  NV     +P G        A   S   + SQ
Sbjct: 126  DNRVNRNSNREVKPASSHLALSTN-EVSTNVSKSVITPRG--------AHGGSFGGRISQ 176

Query: 769  ATNRPSHSTSAQVQEVYSSGAHRKGLFEDTWSKVPSSVSDLTQGLKPRNSHPSSATLASS 590
             + R + S  +  ++ +S+G  +K L +D    + SS+ D+  G  P +S P S  LAS+
Sbjct: 177  VSFRKTDSHPSNPRDGHSTGMAQKELRDDVGVSMLSSIPDMHIG-NPNDSEPHSPVLASN 235

Query: 589  NSEVGVYSSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXXXXXX 410
             + VG+YSS +DPVHVPSPDSRSS  VGAIKREVG VGVRRQ  ++S             
Sbjct: 236  GAAVGLYSSSTDPVHVPSPDSRSSAPVGAIKREVGAVGVRRQLKDSSINQSSGPSVSLAN 295

Query: 409  XXXXXXXXXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAHK-LMG 233
                                          ++E+++  +  SR+ + NQ++S+ H+  MG
Sbjct: 296  SVSERDGSSDSFQPMSSTSKGEQLS----QITESVIPGLVGSRTSLNNQHSSRQHQPTMG 351

Query: 232  HQKAMQPNMEWXXXXXXXXXXXXSGVIGTAVTLISPPTNNPTDSIMEADHLQGKFSQLNI 53
            HQKA QPN EW             GVIGT  +    P +   +   EA ++Q K +++++
Sbjct: 352  HQKASQPNKEWKPKSSQKLSTGNPGVIGTP-SKSKAPADESKELHSEAANVQEKLARVDL 410

Query: 52   TENQHVIIPQHLRVPE 5
             ENQHVII +H+RVP+
Sbjct: 411  HENQHVIIAEHIRVPD 426


>ref|XP_004510436.1| PREDICTED: flocculation protein FLO11-like [Cicer arietinum]
          Length = 889

 Score =  314 bits (805), Expect = 5e-83
 Identities = 205/469 (43%), Positives = 254/469 (54%), Gaps = 35/469 (7%)
 Frame = -3

Query: 1306 SSRLDAGTPN--ISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPF 1133
            SSR + GT    +SA VRKTIQSIKEIV NHS+ADIYV LKE+NMDPNET QKLLNQDPF
Sbjct: 4    SSRTEGGTGTHLLSAKVRKTIQSIKEIVGNHSDADIYVALKETNMDPNETTQKLLNQDPF 63

Query: 1132 HEVRRRRDKKKENMSY-------------------------------MASVEQRRQTEH- 1049
            HEV+RRRD+KKEN +                                  S E RR TE+ 
Sbjct: 64   HEVKRRRDRKKENQNVGNRGSGEPRRHSENGGQGMQFNNPSEHNVGNKGSGEPRRHTENG 123

Query: 1048 TQVVKSQTFPDRNVRRGGFVRNSLPGVSREFRVVRDNRVNQNTSRDIKPSSLQSSSSANV 869
             Q +   T  + NVRR  + RNS P  SREFRVVRDNRVN +  +++KP  LQ S+S   
Sbjct: 124  GQGMHFHTPAEHNVRRTNYSRNSTPSFSREFRVVRDNRVN-HIYKEVKPPLLQHSTSTTE 182

Query: 868  EVFQNVPAKSPTGNLTDQEHLAARNSEEHKSSQATNRPSHSTSAQVQEVYSSGAHRKGLF 689
            ++  N   KS +    +Q+   ARN + H      N PS S + Q ++  ++   +K   
Sbjct: 183  KLPINTSDKSSSAASNNQKSSGARNHQAH------NGPSVSHARQSKDAATNVGGKKTTS 236

Query: 688  EDTWSKVPSSVSDLTQGLKPRNSHPSSATLASSNSEVGVYSSFSDPVHVPSPDSRSSGTV 509
            ED      +S S   Q  KP NSH SS+T AS++S VGVYSS +DPVHVPSPDSRSSG V
Sbjct: 237  EDKQGTTSNS-SARVQPTKPNNSHHSSSTAASTSSVVGVYSSSTDPVHVPSPDSRSSGVV 295

Query: 508  GAIKREVGVVGVRRQPSENSAKHXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 329
            GAI+REVGVVGVRRQ S +                                         
Sbjct: 296  GAIRREVGVVGVRRQSSSDHKPKQLFASSSSHANSVTGKDGTSADSLQSVGAVSKTEQLS 355

Query: 328  QVTVSEAMMHNVSASRSFVGNQYNSKAH-KLMGHQKAMQPNMEWXXXXXXXXXXXXSGVI 152
            Q  V+E    ++S SR  + NQYN++ H +L+GHQ+  Q N EW             GVI
Sbjct: 356  QTAVTEPSFPSMSVSRPSLNNQYNNRPHQQLVGHQRVSQHNKEWKPKSSQKTNSNGPGVI 415

Query: 151  GTAVTLISPPTNNPTDSIMEADHLQGKFSQLNITENQHVIIPQHLRVPE 5
            GT    +S P  N  D   +   LQ K SQLN+ ENQ+VII QH+RVPE
Sbjct: 416  GTPKKSVSSPAENSEDIESDTAQLQDKRSQLNVYENQNVIIAQHIRVPE 464


>ref|XP_006598817.1| PREDICTED: putative uncharacterized protein DDB_G0277255-like
            [Glycine max]
          Length = 852

 Score =  303 bits (775), Expect = 1e-79
 Identities = 193/436 (44%), Positives = 249/436 (57%), Gaps = 12/436 (2%)
 Frame = -3

Query: 1276 ISAHVRKTIQSIKEIVENHSEADIYVTLKESNMDPNETAQKLLNQDPFHEVRRRRDKKKE 1097
            +SA VRKTIQSIKEIV NHS+ADIYV LKE+NMDPNET QKLLNQDPFHEV+RRRD+KKE
Sbjct: 18   LSARVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLNQDPFHEVKRRRDRKKE 77

Query: 1096 NMSY----MASVEQRRQTEHT--QVVKSQTFPDRNVRRGGFVRNSLPGVSREFRVVRDNR 935
              +       S + RR +E+   Q +K  T  +RNVRR  + R++ PG+SREFRVVRDNR
Sbjct: 78   TQNVGNRGQPSADSRRPSENNSGQGMKFHTHSERNVRRTNYSRSTFPGISREFRVVRDNR 137

Query: 934  VNQNTSRDIKPSSLQSSSSANVEVFQNVPAKSPTGNLTDQEHLAARNSEEHKSSQATNRP 755
            VN +  +++ P S Q S+S   ++  N+  K  +G+                SSQA+N P
Sbjct: 138  VN-HIYKEVTPLSQQHSTSVTEQLNVNISDKGSSGS--------------RNSSQASNGP 182

Query: 754  SHSTSAQVQEVYSSGAHRKGLFEDTWSK-VPSSVSDLTQGLKPRNSHPSSATLASSNSEV 578
            S S +    +       RK ++ED   + + S+ +   Q +KP + H +SA +AS++S V
Sbjct: 183  SDSHARYAPKTID----RKIVYEDKDKQGMISNAAGRVQPIKPNSVHQNSALVASTSSAV 238

Query: 577  GVYSSFSDPVHVPSPDSRSSGTVGAIKREVGVVGVRRQPSENSAKHXXXXXXXXXXXXXX 398
            GVYSS +DPVHVPSPDSRS G VGAI+REVG VGVRRQ S+N AK               
Sbjct: 239  GVYSSSTDPVHVPSPDSRSPGVVGAIRREVGFVGVRRQSSDNKAKQ----SFAPSSPHVV 294

Query: 397  XXXXXXXXXXXXXXXXXXXXXXXQVTVSEAMMHNVSASRSFVGNQYNSKAH-KLMGHQKA 221
                                   Q  V+E  +  +  SR  + NQ+N++ H +L+GHQ+ 
Sbjct: 295  GKDGTSADSFQSVGAVSKTEQFSQTNVTEPSLSGMPVSRPSLNNQHNNRPHQQLVGHQRV 354

Query: 220  MQPNMEW-XXXXXXXXXXXXSGVIGT---AVTLISPPTNNPTDSIMEADHLQGKFSQLNI 53
             Q N EW              GVIGT   A    SPP  N  D       LQ K SQ+NI
Sbjct: 355  SQQNKEWKPKSSQKPNCNNSPGVIGTPKKAAAAASPPAENSGDIESNTVELQDKLSQVNI 414

Query: 52   TENQHVIIPQHLRVPE 5
             ENQ+VII QH+RVPE
Sbjct: 415  YENQNVIIAQHIRVPE 430


Top