BLASTX nr result

ID: Cocculus23_contig00011973 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00011973
         (1955 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007024587.1| Uncharacterized protein isoform 4 [Theobroma...   400   e-108
ref|XP_007024585.1| Uncharacterized protein isoform 2 [Theobroma...   395   e-107
ref|XP_002274314.2| PREDICTED: uncharacterized protein LOC100248...   394   e-107
ref|XP_007024589.1| Uncharacterized protein isoform 6 [Theobroma...   388   e-105
ref|XP_007024584.1| Uncharacterized protein isoform 1 [Theobroma...   383   e-103
ref|XP_007024588.1| Uncharacterized protein isoform 5 [Theobroma...   383   e-103
ref|XP_002521347.1| conserved hypothetical protein [Ricinus comm...   377   e-101
emb|CAN69468.1| hypothetical protein VITISV_042555 [Vitis vinifera]   359   3e-96
emb|CBI35892.3| unnamed protein product [Vitis vinifera]              350   1e-93
ref|XP_002299597.2| hydroxyproline-rich glycoprotein [Populus tr...   347   1e-92
ref|XP_007135474.1| hypothetical protein PHAVU_010G132600g [Phas...   343   2e-91
ref|XP_003528451.1| PREDICTED: dentin sialophosphoprotein-like i...   341   8e-91
gb|EXB29673.1| hypothetical protein L484_013447 [Morus notabilis]     337   9e-90
ref|XP_006583148.1| PREDICTED: dentin sialophosphoprotein-like i...   336   2e-89
ref|XP_002304144.2| hypothetical protein POPTR_0003s06200g [Popu...   335   4e-89
ref|XP_006583149.1| PREDICTED: dentin sialophosphoprotein-like i...   333   2e-88
ref|XP_006426627.1| hypothetical protein CICLE_v10024871mg [Citr...   328   5e-87
ref|XP_006426626.1| hypothetical protein CICLE_v10024871mg [Citr...   328   5e-87
ref|XP_004163891.1| PREDICTED: uncharacterized protein LOC101226...   325   4e-86
ref|XP_004141213.1| PREDICTED: uncharacterized protein LOC101203...   325   4e-86

>ref|XP_007024587.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508779953|gb|EOY27209.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 849

 Score =  400 bits (1027), Expect = e-108
 Identities = 255/570 (44%), Positives = 331/570 (58%), Gaps = 13/570 (2%)
 Frame = -3

Query: 1674 VVSGSRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQ 1495
            +V+G+R++G     ++SA VRKTIQSIKEIVGNHS+ADIYV LK++NMDPNET QKLL+Q
Sbjct: 1    MVNGARIEG-----DISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQ 55

Query: 1494 DPFHEVRRKRDKKKENISYKASAEPRKQTEHA-QAAKSQTFPDRNVRRGSFVRNALPGIS 1318
            D FHEVRRKRD+KKE+I YK S + RK++E+  Q  K + +P+R  RRGS+ RN LPG++
Sbjct: 56   DTFHEVRRKRDRKKESIEYKVSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRNTLPGVN 115

Query: 1317 REFRVVRDNRVNHNASRDINAASFQ-SSSANVEVIPHVPAKSPKGILIDQDHLAARNSGD 1141
            REFRVVRDNRVN NA++D+     Q S+SAN +V  +V  K   G   +Q   ++R+   
Sbjct: 116  REFRVVRDNRVNQNANKDMKTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRSL-- 173

Query: 1140 HKPSHVVNRPSHSNPGLVQDAYSGGAHRKGLYEDAWAKVPSSASLTQGLKPRNFQPSSAT 961
               S   N PS S     +DA S G  RK + E+    +P++   +Q +KP N Q  +AT
Sbjct: 174  ---SQTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAAT 230

Query: 960  LAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVKHXXXXX 781
             +                           GAVGAIKREVGVVGVRRQPSENAVK      
Sbjct: 231  QSSSSSVVGVYSSSTDPVHVPSPDSRSS-GAVGAIKREVGVVGVRRQPSENAVKDSSGSS 289

Query: 780  XXXXXXXXXXXSNQQS----PSAAKSTQLGQXXXXXXXXXXXXXSRPFLGNQYSSKAHQ- 616
                       ++ ++    PS +++ QL               SR FL NQY S+ +Q 
Sbjct: 290  GSLSNSLVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQ 349

Query: 615  VVGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXANLISPPSNHTD-TKMESDHLQRKFSG 439
             +GHQKA Q N EWKPK +QK                SPP++       E+  LQ KFS 
Sbjct: 350  ALGHQKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQ 409

Query: 438  LGITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQSNGESSA- 262
            + I EN++VII QH++VPE DR  LTFGSF VEFDS +NF    Q  G+AE SNGES+A 
Sbjct: 410  VNIYENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAAS 469

Query: 261  -SVSAPLTSNDDA---NEVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXXXRNLENYV 94
             SVSAP TS+DDA     +++L  Q          S  ASEH           +NL++Y 
Sbjct: 470  LSVSAPDTSSDDAAGGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYA 529

Query: 93   DIGLVRNDSPSYTSVEPQQQQQDVSGLPSF 4
            DIGLV+++SPSY   E  Q+QQD   LPSF
Sbjct: 530  DIGLVQDNSPSYAPSE-SQKQQDPPELPSF 558


>ref|XP_007024585.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508779951|gb|EOY27207.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 852

 Score =  395 bits (1014), Expect = e-107
 Identities = 255/572 (44%), Positives = 331/572 (57%), Gaps = 15/572 (2%)
 Frame = -3

Query: 1674 VVSGSRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQ 1495
            +V+G+R++G     ++SA VRKTIQSIKEIVGNHS+ADIYV LK++NMDPNET QKLL+Q
Sbjct: 1    MVNGARIEG-----DISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQ 55

Query: 1494 DPFHEVRRKRDKKKENISYKASAEPRKQTEHA-QAAKSQTFPDRNVRRGSFVRNALP--G 1324
            D FHEVRRKRD+KKE+I YK S + RK++E+  Q  K + +P+R  RRGS+ RN LP  G
Sbjct: 56   DTFHEVRRKRDRKKESIEYKVSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRNTLPDAG 115

Query: 1323 ISREFRVVRDNRVNHNASRDINAASFQ-SSSANVEVIPHVPAKSPKGILIDQDHLAARNS 1147
            ++REFRVVRDNRVN NA++D+     Q S+SAN +V  +V  K   G   +Q   ++R+ 
Sbjct: 116  VNREFRVVRDNRVNQNANKDMKTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRSL 175

Query: 1146 GDHKPSHVVNRPSHSNPGLVQDAYSGGAHRKGLYEDAWAKVPSSASLTQGLKPRNFQPSS 967
                 S   N PS S     +DA S G  RK + E+    +P++   +Q +KP N Q  +
Sbjct: 176  -----SQTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHA 230

Query: 966  ATLAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVKHXXX 787
            AT +                           GAVGAIKREVGVVGVRRQPSENAVK    
Sbjct: 231  ATQSSSSSVVGVYSSSTDPVHVPSPDSRSS-GAVGAIKREVGVVGVRRQPSENAVKDSSG 289

Query: 786  XXXXXXXXXXXXXSNQQS----PSAAKSTQLGQXXXXXXXXXXXXXSRPFLGNQYSSKAH 619
                         ++ ++    PS +++ QL               SR FL NQY S+ +
Sbjct: 290  SSGSLSNSLVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQN 349

Query: 618  Q-VVGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXANLISPPSNHTD-TKMESDHLQRKF 445
            Q  +GHQKA Q N EWKPK +QK                SPP++       E+  LQ KF
Sbjct: 350  QQALGHQKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKF 409

Query: 444  SGLGITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQSNGESS 265
            S + I EN++VII QH++VPE DR  LTFGSF VEFDS +NF    Q  G+AE SNGES+
Sbjct: 410  SQVNIYENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESA 469

Query: 264  A--SVSAPLTSNDDA---NEVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXXXRNLEN 100
            A  SVSAP TS+DDA     +++L  Q          S  ASEH           +NL++
Sbjct: 470  ASLSVSAPDTSSDDAAGGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDS 529

Query: 99   YVDIGLVRNDSPSYTSVEPQQQQQDVSGLPSF 4
            Y DIGLV+++SPSY   E  Q+QQD   LPSF
Sbjct: 530  YADIGLVQDNSPSYAPSE-SQKQQDPPELPSF 560


>ref|XP_002274314.2| PREDICTED: uncharacterized protein LOC100248075 [Vitis vinifera]
          Length = 860

 Score =  394 bits (1013), Expect = e-107
 Identities = 263/578 (45%), Positives = 325/578 (56%), Gaps = 21/578 (3%)
 Frame = -3

Query: 1674 VVSGSRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQ 1495
            +VSGSR++GGT    + ARVRKTIQSIKEIVGNHS+ADIYVTL+++NMDPNET QKLL Q
Sbjct: 1    MVSGSRMEGGTQI--LPARVRKTIQSIKEIVGNHSDADIYVTLRETNMDPNETTQKLLYQ 58

Query: 1494 DPFHEVRRKRDKKKENISYKASAEPRKQTEHAQAAKSQTFPDRNVRRGSFVRNAL----- 1330
            DPFHEV+RKRDKKKE+  YK   EPR   E+    K ++FPDRNVRRG + R+ L     
Sbjct: 59   DPFHEVKRKRDKKKESTGYKRPTEPRIYIENVGQGKFRSFPDRNVRRGGYSRSTLMVRIL 118

Query: 1329 --PGISREFRVVRDNRVNHNASRDINAASFQ-SSSANVEVIPHVPAKSPKGILIDQDHLA 1159
               GI REFRVVRDNRVN N +RD+   S Q ++S N +VI ++   S KG      +  
Sbjct: 119  LDAGIGREFRVVRDNRVNQNTNRDMKPVSPQLATSVNEQVISNI---SEKGNSTGTSNNQ 175

Query: 1158 ARNSGDHKPSHVVNRPSHSNPGLVQDAYSGGAHRKGLYEDAWAKVPSSASLTQGLKPRNF 979
              +SG  + S  +N P+ + PG+ QDA S G++RK L E+  A +P++ S  Q +KP + 
Sbjct: 176  KPSSG-RQSSQSLNGPTDARPGIPQDANSSGSNRKELLEERQATIPNAVSRVQAVKPNDS 234

Query: 978  QPSSATLAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVK 799
            QP SA+LA                          S  VGAIKREVGVVGVRRQ +EN+VK
Sbjct: 235  QPYSASLA-SNSSVVGVYSSSSDPVHVPSPDSRSSAIVGAIKREVGVVGVRRQSTENSVK 293

Query: 798  HXXXXXXXXXXXXXXXXSNQQSPSA---------AKSTQLGQXXXXXXXXXXXXXSRPFL 646
            H                  + SPS           KS Q  Q             +R FL
Sbjct: 294  H---SSAPSSSLPSSLLGRENSPSTEPFRPFNAIPKSDQPRQTTVPDHVIPSMPVNRSFL 350

Query: 645  GNQYSSKAH-QVVGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXANLISP-PSNHTDTKM 472
            GNQY S+ H Q VGHQKA Q N EWKPKS+QK            A  +SP   N  D + 
Sbjct: 351  GNQYGSRPHQQPVGHQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLES 410

Query: 471  ESDHLQRKFSGLGITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGI 292
            E+  LQ K S   I+ENQ+VII QH++VPE DR  LTFGSF  +      FAS  Q  G 
Sbjct: 411  ETAKLQDKLSQASISENQNVIIAQHIRVPETDRCRLTFGSFGAD------FASGFQAVGN 464

Query: 291  AEQSNGESSA--SVSAPLTSNDDANEVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXX 118
            A++ + E SA  SVS P +S+DD ++   L  Q          S  ASEH          
Sbjct: 465  ADEPSAEPSASLSVSPPESSSDDGSKQVDLDDQYINSGTASPESGEASEHQLPDKKESSS 524

Query: 117  XRNLENYVDIGLVRNDSPSYTSVEPQQQQQDVSGLPSF 4
             +NLENY DIGLVR  SPSYT    QQQ++ V  LPSF
Sbjct: 525  PQNLENYADIGLVRESSPSYTPESQQQQERHV--LPSF 560


>ref|XP_007024589.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508779955|gb|EOY27211.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 839

 Score =  388 bits (996), Expect = e-105
 Identities = 247/565 (43%), Positives = 322/565 (56%), Gaps = 8/565 (1%)
 Frame = -3

Query: 1674 VVSGSRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQ 1495
            +V+G+R++G     ++SA VRKTIQSIKEIVGNHS+ADIYV LK++NMDPNET QKLL+Q
Sbjct: 1    MVNGARIEG-----DISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQ 55

Query: 1494 DPFHEVRRKRDKKKENISYKASAEPRKQTEHA-QAAKSQTFPDRNVRRGSFVRNALPGIS 1318
            D FHEVRRKRD+KKE+I YK S + RK++E+  Q  K + +P+R  RRGS+ RN LPG++
Sbjct: 56   DTFHEVRRKRDRKKESIEYKVSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRNTLPGVN 115

Query: 1317 REFRVVRDNRVNHNASRDINAASFQ-SSSANVEVIPHVPAKSPKGILIDQDHLAARNSGD 1141
            REFRVVRDNRVN NA++D+     Q S+SAN +V  +V  K   G   +Q   ++R+   
Sbjct: 116  REFRVVRDNRVNQNANKDMKTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRSL-- 173

Query: 1140 HKPSHVVNRPSHSNPGLVQDAYSGGAHRKGLYEDAWAKVPSSASLTQGLKPRNFQPSSAT 961
               S   N PS S     +DA S G  RK + E+    +P++   +Q +KP N Q  +AT
Sbjct: 174  ---SQTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAAT 230

Query: 960  LAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVKHXXXXX 781
             +                           GAVGAIKREVGVVGVRRQPSENAVK      
Sbjct: 231  QSSSSSVVGVYSSSTDPVHVPSPDSRSS-GAVGAIKREVGVVGVRRQPSENAVKDSSGSS 289

Query: 780  XXXXXXXXXXXSNQQS----PSAAKSTQLGQXXXXXXXXXXXXXSRPFLGNQYSSKAHQ- 616
                       ++ ++    PS +++ QL               SR FL NQY S+ +Q 
Sbjct: 290  GSLSNSLVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQ 349

Query: 615  VVGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXANLISPPSNHTD-TKMESDHLQRKFSG 439
             +GHQKA Q N EWKPK +QK                SPP++       E+  LQ KFS 
Sbjct: 350  ALGHQKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQ 409

Query: 438  LGITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQSNGESSAS 259
            + I EN++VII QH++VPE DR  LTFGSF VEFDS +NF    Q  G+AE SNGES+AS
Sbjct: 410  VNIYENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAAS 469

Query: 258  VSAPLTSNDDANEVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXXXRNLENYVDIGLV 79
              A          +++L  Q          S  ASEH           +NL++Y DIGLV
Sbjct: 470  DDAA-----GGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLV 524

Query: 78   RNDSPSYTSVEPQQQQQDVSGLPSF 4
            +++SPSY   E  Q+QQD   LPSF
Sbjct: 525  QDNSPSYAPSE-SQKQQDPPELPSF 548


>ref|XP_007024584.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508779950|gb|EOY27206.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 883

 Score =  383 bits (984), Expect = e-103
 Identities = 255/603 (42%), Positives = 331/603 (54%), Gaps = 46/603 (7%)
 Frame = -3

Query: 1674 VVSGSRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQ 1495
            +V+G+R++G     ++SA VRKTIQSIKEIVGNHS+ADIYV LK++NMDPNET QKLL+Q
Sbjct: 1    MVNGARIEG-----DISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQ 55

Query: 1494 DPFHEVRRKRDKKKENISYKASAEPRKQTEHA-QAAKSQTFPDRNVRRGSFVRNALPGIS 1318
            D FHEVRRKRD+KKE+I YK S + RK++E+  Q  K + +P+R  RRGS+ RN LPG++
Sbjct: 56   DTFHEVRRKRDRKKESIEYKVSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRNTLPGVN 115

Query: 1317 REFRVVRDNRVNHNASRDINAASFQ-SSSANVEVIPHVPAKSPKGILIDQDHLAARNSGD 1141
            REFRVVRDNRVN NA++D+     Q S+SAN +V  +V  K   G   +Q   ++R+   
Sbjct: 116  REFRVVRDNRVNQNANKDMKTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRSL-- 173

Query: 1140 HKPSHVVNRPSHSNPGLVQDAYSGGAHRKGLYEDAWAKVPSSASLTQGLKPRNFQPSSAT 961
               S   N PS S     +DA S G  RK + E+    +P++   +Q +KP N Q  +AT
Sbjct: 174  ---SQTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAAT 230

Query: 960  LAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVKHXXXXX 781
             +                           GAVGAIKREVGVVGVRRQPSENAVK      
Sbjct: 231  QSSSSSVVGVYSSSTDPVHVPSPDSRSS-GAVGAIKREVGVVGVRRQPSENAVKDSSGSS 289

Query: 780  XXXXXXXXXXXSNQQS----PSAAKSTQLGQXXXXXXXXXXXXXSRPFLGNQYSSKAHQ- 616
                       ++ ++    PS +++ QL               SR FL NQY S+ +Q 
Sbjct: 290  GSLSNSLVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQ 349

Query: 615  VVGHQKAA---------------------------QSNMEWKPKSTQKXXXXXXXXXXXX 517
             +GHQK A                           Q N EWKPK +QK            
Sbjct: 350  ALGHQKEASYCSAFHPFIDQISLWESLSCIFDAANQHNKEWKPKLSQKSSVNNPGVIGTP 409

Query: 516  ANLISPPSNHTD-TKMESDHLQRKFSGLGITENQHVIIPQHLQVPEADRTWLTFGSFDVE 340
                SPP++       E+  LQ KFS + I EN++VII QH++VPE DR  LTFGSF VE
Sbjct: 410  KKSASPPADDAKGLDSETAKLQDKFSQVNIYENENVIIAQHIRVPENDRCRLTFGSFGVE 469

Query: 339  FDSNKNFASRIQGHGIAEQSNGESSA--------SVSAPLTSNDDA---NEVDLLHGQXX 193
            FDS +NF    Q  G+AE SNGES+A        SVSAP TS+DDA     +++L  Q  
Sbjct: 470  FDSLRNFVPGFQATGVAEDSNGESAARLVFSPNLSVSAPDTSSDDAAGGKPIEILDDQIG 529

Query: 192  XXXXXXXXSDAASEHXXXXXXXXXXXRNLENYVDIGLVRNDSPSYTSVEPQQQQQDVSGL 13
                    S  ASEH           +NL++Y DIGLV+++SPSY   E  Q+QQD   L
Sbjct: 530  NSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQDNSPSYAPSE-SQKQQDPPEL 588

Query: 12   PSF 4
            PSF
Sbjct: 589  PSF 591


>ref|XP_007024588.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508779954|gb|EOY27210.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 842

 Score =  383 bits (983), Expect = e-103
 Identities = 247/567 (43%), Positives = 322/567 (56%), Gaps = 10/567 (1%)
 Frame = -3

Query: 1674 VVSGSRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQ 1495
            +V+G+R++G     ++SA VRKTIQSIKEIVGNHS+ADIYV LK++NMDPNET QKLL+Q
Sbjct: 1    MVNGARIEG-----DISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQ 55

Query: 1494 DPFHEVRRKRDKKKENISYKASAEPRKQTEHA-QAAKSQTFPDRNVRRGSFVRNALP--G 1324
            D FHEVRRKRD+KKE+I YK S + RK++E+  Q  K + +P+R  RRGS+ RN LP  G
Sbjct: 56   DTFHEVRRKRDRKKESIEYKVSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRNTLPDAG 115

Query: 1323 ISREFRVVRDNRVNHNASRDINAASFQ-SSSANVEVIPHVPAKSPKGILIDQDHLAARNS 1147
            ++REFRVVRDNRVN NA++D+     Q S+SAN +V  +V  K   G   +Q   ++R+ 
Sbjct: 116  VNREFRVVRDNRVNQNANKDMKTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRSL 175

Query: 1146 GDHKPSHVVNRPSHSNPGLVQDAYSGGAHRKGLYEDAWAKVPSSASLTQGLKPRNFQPSS 967
                 S   N PS S     +DA S G  RK + E+    +P++   +Q +KP N Q  +
Sbjct: 176  -----SQTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHA 230

Query: 966  ATLAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVKHXXX 787
            AT +                           GAVGAIKREVGVVGVRRQPSENAVK    
Sbjct: 231  ATQSSSSSVVGVYSSSTDPVHVPSPDSRSS-GAVGAIKREVGVVGVRRQPSENAVKDSSG 289

Query: 786  XXXXXXXXXXXXXSNQQS----PSAAKSTQLGQXXXXXXXXXXXXXSRPFLGNQYSSKAH 619
                         ++ ++    PS +++ QL               SR FL NQY S+ +
Sbjct: 290  SSGSLSNSLVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQN 349

Query: 618  Q-VVGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXANLISPPSNHTD-TKMESDHLQRKF 445
            Q  +GHQKA Q N EWKPK +QK                SPP++       E+  LQ KF
Sbjct: 350  QQALGHQKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKF 409

Query: 444  SGLGITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQSNGESS 265
            S + I EN++VII QH++VPE DR  LTFGSF VEFDS +NF    Q  G+AE SNGES+
Sbjct: 410  SQVNIYENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESA 469

Query: 264  ASVSAPLTSNDDANEVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXXXRNLENYVDIG 85
            AS  A          +++L  Q          S  ASEH           +NL++Y DIG
Sbjct: 470  ASDDAA-----GGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIG 524

Query: 84   LVRNDSPSYTSVEPQQQQQDVSGLPSF 4
            LV+++SPSY   E  Q+QQD   LPSF
Sbjct: 525  LVQDNSPSYAPSE-SQKQQDPPELPSF 550


>ref|XP_002521347.1| conserved hypothetical protein [Ricinus communis]
            gi|223539425|gb|EEF41015.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 864

 Score =  377 bits (967), Expect = e-101
 Identities = 254/563 (45%), Positives = 316/563 (56%), Gaps = 16/563 (2%)
 Frame = -3

Query: 1644 TTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQDPFHEVRRKR 1465
            TTT  +SA VRKTIQSIKEIVGN S+ADIY+ LK++NMDPNETAQKLLNQDPFHEV+RKR
Sbjct: 16   TTTHTLSATVRKTIQSIKEIVGNFSDADIYMALKETNMDPNETAQKLLNQDPFHEVKRKR 75

Query: 1464 DKKKENISYKASAEPRKQTEH-AQAAKSQTFPDRNVRRGSFVRNALP---GISREFRVVR 1297
            DKKKE+++Y+ S + RK  E+  Q  K +TF DRN R+G ++R A+P   GI+REFRVVR
Sbjct: 76   DKKKESMAYRGSLDSRKNPENMGQGTKFRTFSDRNTRQGGYIRAAVPGNAGINREFRVVR 135

Query: 1296 DNRVNHNASRDINAASFQSSSANVEV-IPHVPAKSPKGILIDQDHLAARNSGDHKPSHVV 1120
            DNRVN N +R+   A  Q S ++ E+ I  V  K   G   +  H   R+S     S   
Sbjct: 136  DNRVNLNTTREPKPAMQQGSISSDELGISTVTEKGSSGSSGNVKHSGVRSS-----SQAS 190

Query: 1119 NRPSHSNPGLVQDAYSGGAHRKGLYEDAWAKVPSSASLTQGLKPRNFQPSSATLAXXXXX 940
            N P  S     +DA S    RK + E+  A VPS+AS  Q +KP + Q  SATLA     
Sbjct: 191  NGPPDSQSRHTRDATSNFTDRKAMTEEKRAVVPSAASRIQVMKPSS-QHHSATLA-SSNS 248

Query: 939  XXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVKHXXXXXXXXXXXX 760
                                 S AVGAIKREVGVVG RRQ SENAVK+            
Sbjct: 249  VVGVYSSSMDPVHVPSPESRSSAAVGAIKREVGVVGGRRQSSENAVKNSSASSSSFSNSV 308

Query: 759  XXXXSN-----QQSPSAAKSTQLGQXXXXXXXXXXXXXSRPFLGNQYSSKAHQVVGHQKA 595
                 +     Q  P+ +K+ Q+ +              R FLGNQYS      VGHQKA
Sbjct: 309  LGRDGSLPESFQPFPTISKNDQVNEPVATESAMPSISVGRSFLGNQYSRTHQTAVGHQKA 368

Query: 594  AQSNMEWKPKSTQKXXXXXXXXXXXXANLISPPS-NHTDTKMESDHLQRKFSGLGITENQ 418
             Q N EWKPKS+QK                SPP+ N  D + ++  +Q K   + I ENQ
Sbjct: 369  TQHNKEWKPKSSQKASVGSPGVIGTPTKSSSPPAGNSKDLESDATDMQEKLLRVNIYENQ 428

Query: 417  HVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQSNGESSASV--SAPL 244
            +VII QH++VPE DR  LTFGSF VEFDS++N  S  Q  G+ + S  ES+AS+  SAP 
Sbjct: 429  NVIIAQHIRVPETDRCRLTFGSFGVEFDSSRNMPSGFQAAGVTKDSKAESAASLSASAPE 488

Query: 243  TSNDDAN---EVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXXXRNLENYVDIGLVRN 73
            +S+DDA+   +V+LL  Q          S A SEH            NL+NY DIGLVR+
Sbjct: 489  SSSDDASGNKQVELLDEQVRNSGSDSPASGAVSEH--QSPDKSSSPPNLDNYADIGLVRD 546

Query: 72   DSPSYTSVEPQQQQQDVSGLPSF 4
             SP +TS E  Q QQD   LPSF
Sbjct: 547  SSP-FTSSE-SQHQQDPPELPSF 567


>emb|CAN69468.1| hypothetical protein VITISV_042555 [Vitis vinifera]
          Length = 914

 Score =  359 bits (921), Expect = 3e-96
 Identities = 256/629 (40%), Positives = 319/629 (50%), Gaps = 76/629 (12%)
 Frame = -3

Query: 1662 SRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNET----------- 1516
            SR++GG     +  +V KTIQ IKEIVGNHS+ADIYV L++ NMDPNET           
Sbjct: 5    SRMEGGMQI--LPPQVHKTIQLIKEIVGNHSDADIYVALREMNMDPNETVQKLLNQDLDI 62

Query: 1515 --------------AQKLLNQDPFHEVRRKRDKKKENISYKASAEPRKQTEHAQAAKSQT 1378
                          AQKLLNQDPFHEV+RKRDKKKE+  YK   EPR   E+    K ++
Sbjct: 63   HVMLREMNMDPNEVAQKLLNQDPFHEVKRKRDKKKESTGYKRPTEPRIYIENVGQGKFRS 122

Query: 1377 FPDRNVRRGSFVRNALPG------------------------------------ISREFR 1306
            FPDRNVRRG + R+ +PG                                    I REFR
Sbjct: 123  FPDRNVRRGGYSRSTVPGNAKTYQFYHSFVLELLYLTVCFLLSELMVRILLDAGIGREFR 182

Query: 1305 VVRDNRVNHNASRDINAASFQ-SSSANVEVIPHVPAK-SPKGILIDQDHLAARNSGDHKP 1132
            VVRDNRVN N +RD+   S Q ++SAN +VI ++  K +  G   +Q   + R S     
Sbjct: 183  VVRDNRVNQNTNRDMKPVSPQLATSANEQVISNISEKGNSTGTSNNQKPSSGRQS----- 237

Query: 1131 SHVVNRPSHSNPGLVQDAYSGGAHRKGLYEDAWAKVPSSASLTQGLKPRNFQPSSATLAX 952
            S  +N P+ + PG+ QDA S G++RK L E+  A +P++ S  Q +KP + QP SA+LA 
Sbjct: 238  SQSLNGPTDARPGIPQDANSSGSNRKELLEERQATIPNAVSRVQAVKPNDSQPYSASLAS 297

Query: 951  XXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVKHXXXXXXXX 772
                                        VGAIKREVGVVGVRRQ +EN+VKH        
Sbjct: 298  NSSVVGVYSSSSDPVHVPSPDSRSS-AIVGAIKREVGVVGVRRQSTENSVKHSSAPSSSL 356

Query: 771  XXXXXXXXSNQQSPSAA---------KSTQLGQXXXXXXXXXXXXXSRPFLGNQYSSKAH 619
                      + SPS           KS Q  Q             +R FLGNQY S+ H
Sbjct: 357  PSSLLG---RENSPSTEPFRPFNAIPKSDQPRQTTVPDHVIPSMPVNRSFLGNQYGSRPH 413

Query: 618  QV-VGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXANLISPPS-NHTDTKMESDHLQRKF 445
            Q  VGHQKA Q N EWKPKS+QK            A  +SP + N  D + E+  LQ K 
Sbjct: 414  QQPVGHQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESETAKLQDKL 473

Query: 444  SGLGITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQSNGESS 265
            S   I+ENQ+VII QH++VPE DR  LTFGSF  +      FAS  Q  G A++ + E S
Sbjct: 474  SQASISENQNVIIAQHIRVPETDRCRLTFGSFGAD------FASGFQAVGNADEPSAEPS 527

Query: 264  A--SVSAPLTSNDDANEVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXXXRNLENYVD 91
            A  SVS P +S+DD ++   L  Q          S  ASEH           +NLENY D
Sbjct: 528  ASLSVSPPESSSDDGSKQVDLDDQYINSGTASPESGEASEHQLPDKKESSSPQNLENYAD 587

Query: 90   IGLVRNDSPSYTSVEPQQQQQDVSGLPSF 4
            IGLVR  SPSYT    QQQ++ V  LPSF
Sbjct: 588  IGLVRESSPSYTPESQQQQERHV--LPSF 614


>emb|CBI35892.3| unnamed protein product [Vitis vinifera]
          Length = 809

 Score =  350 bits (899), Expect = 1e-93
 Identities = 247/578 (42%), Positives = 303/578 (52%), Gaps = 21/578 (3%)
 Frame = -3

Query: 1674 VVSGSRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQ 1495
            +VSGSR++GGT    + ARVRKTIQSIKEIVGNHS+ADIYVTL+++NMDPNET QKLL Q
Sbjct: 1    MVSGSRMEGGTQI--LPARVRKTIQSIKEIVGNHSDADIYVTLRETNMDPNETTQKLLYQ 58

Query: 1494 DPFHEVRRKRDKKKENISYKASAEPRKQTEHAQAAKSQTFPDRNVRRGSFVRNALP---- 1327
            DPFHEV+RKRDKKKE+  YK   EPR   E+    K ++FPDRNVRRG + R+ +P    
Sbjct: 59   DPFHEVKRKRDKKKESTGYKRPTEPRIYIENVGQGKFRSFPDRNVRRGGYSRSTVPGNAK 118

Query: 1326 ------------GISREFRVVRDNRVNHNASRDINAASFQ-SSSANVEVIPHVPAKSPKG 1186
                        GI REFRVVRDNRVN N +RD+   S Q ++S N +VI ++   S KG
Sbjct: 119  TYQFYHSILLDAGIGREFRVVRDNRVNQNTNRDMKPVSPQLATSVNEQVISNI---SEKG 175

Query: 1185 ILIDQDHLAARNSGDHKPSHVVNRPSHSNPGLVQDAYSGGAHRKGLYEDAWAKVPSSASL 1006
                  +    +SG  + S  +N P+ + PG+ QDA S                      
Sbjct: 176  NSTGTSNNQKPSSG-RQSSQSLNGPTDARPGIPQDANS---------------------- 212

Query: 1005 TQGLKPRNFQPSSATLAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVR 826
               +KP + QP SA+LA                          S  VGAIKREVGVVGVR
Sbjct: 213  ---MKPNDSQPYSASLA-SNSSVVGVYSSSSDPVHVPSPDSRSSAIVGAIKREVGVVGVR 268

Query: 825  RQPSENAVKHXXXXXXXXXXXXXXXXSNQQSPSAAKSTQLGQXXXXXXXXXXXXXSRPFL 646
            RQ +EN+                             S Q  Q             +R FL
Sbjct: 269  RQSTENS-----------------------------SDQPRQTTVPDHVIPSMPVNRSFL 299

Query: 645  GNQYSSKAH-QVVGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXANLISP-PSNHTDTKM 472
            GNQY S+ H Q VGHQKA Q N EWKPKS+QK            A  +SP   N  D + 
Sbjct: 300  GNQYGSRPHQQPVGHQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLES 359

Query: 471  ESDHLQRKFSGLGITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGI 292
            E+  LQ K S   I+ENQ+VII QH++VPE DR  LTFGSF  +      FAS  Q  G 
Sbjct: 360  ETAKLQDKLSQASISENQNVIIAQHIRVPETDRCRLTFGSFGAD------FASGFQAVGN 413

Query: 291  AEQSNGESSA--SVSAPLTSNDDANEVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXX 118
            A++ + E SA  SVS P +S+DD ++   L  Q          S  ASEH          
Sbjct: 414  ADEPSAEPSASLSVSPPESSSDDGSKQVDLDDQYINSGTASPESGEASEHQLPDKKESSS 473

Query: 117  XRNLENYVDIGLVRNDSPSYTSVEPQQQQQDVSGLPSF 4
             +NLENY DIGLVR  SPSYT    QQQ++ V  LPSF
Sbjct: 474  PQNLENYADIGLVRESSPSYTPESQQQQERHV--LPSF 509


>ref|XP_002299597.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550347518|gb|EEE84402.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 854

 Score =  347 bits (890), Expect = 1e-92
 Identities = 249/567 (43%), Positives = 312/567 (55%), Gaps = 14/567 (2%)
 Frame = -3

Query: 1662 SRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQDPFH 1483
            S   G   T  +SA+VRKTIQSIKEIVGN S+ADIY+ LK++NMDPNETAQKLLNQDPFH
Sbjct: 16   STSSGQQQTHTLSAKVRKTIQSIKEIVGNFSDADIYMVLKETNMDPNETAQKLLNQDPFH 75

Query: 1482 EVRRKRDKKKENISYKASAEPRKQTEH-AQAAKSQTFPDRNVRRGSFVRNALP---GISR 1315
            EV+RKR+KKKEN SY+ S + RK +E+  Q  +  TF DRN +RG + R A P   GI+R
Sbjct: 76   EVKRKREKKKENTSYRGSVDSRKHSENFGQGMRPHTFSDRNAQRGGYTRTASPGNRGINR 135

Query: 1314 EFRVVRDNRVNHNASRDINAASFQ-SSSANVEVIPHVPAKSPKGILIDQDHLAARNSGDH 1138
            EFRVVRDNRVN N SR+   A    S+SA  +    V  K   GI  +     AR+S  H
Sbjct: 136  EFRVVRDNRVNQNTSREPKPALLHGSTSAKEQGSGVVTEKGSTGISSNLKPSDARSS--H 193

Query: 1137 KPSHVVNRPSHSNPGLVQDAYSGGAHRKGLYEDAWAKVPSSA--SLTQGLKPRNFQPSSA 964
            + S   N P  S P   +DA S    RK + E+    V S+A  S  Q  K  N Q  +A
Sbjct: 194  QAS---NGPIDSEPRHNRDANSSVGDRKVVSEEK-RSVASNATTSRVQVAKSNNSQQHNA 249

Query: 963  TLAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVKHXXXX 784
             L                           SG VGAIKREVGVVG RRQ  ENAVK     
Sbjct: 250  -LQASSNPVVGVYSSSTDPVHVPSPDSRSSGVVGAIKREVGVVGGRRQSFENAVKDLSSS 308

Query: 783  XXXXXXXXXXXXSNQQSPSAAKSTQLGQXXXXXXXXXXXXXSRPFLGNQYSSKAH-QVVG 607
                          +   + +K+ Q+ Q             +R FL NQY+++ H Q VG
Sbjct: 309  NSFSESF-------RPFTAISKTDQVSQ-TAAIEPMPSVPVNRSFLNNQYNNRPHQQAVG 360

Query: 606  HQKAAQSNMEWKPKSTQKXXXXXXXXXXXXANLISPPS-NHTDTKMESDHLQRKFSGLGI 430
            H KA+Q N EWKPKS+QK                SPP+ N  + ++++ +LQ KFS + I
Sbjct: 361  HPKASQHNKEWKPKSSQKSSVTSPGVIGTPTKSSSPPTDNSKNMELDAANLQDKFSRINI 420

Query: 429  TENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQSNGESSASV-- 256
             ENQ+VII QH++VPE DR  LTFGSF V FD+ +      Q  GI+E+SNGES+ S+  
Sbjct: 421  HENQNVIIAQHIRVPETDRCKLTFGSFGVGFDAPR--TPGFQAVGISEESNGESAISLPA 478

Query: 255  SAPLTSNDDAN---EVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXXXRNLENYVDIG 85
            SAP +S+DDA+   +++LL  Q          +   SEH            NL+NY DIG
Sbjct: 479  SAPDSSSDDASGGKQIELLDDQARNYGSDSPAASLESEH--PLPVNSSSPPNLDNYADIG 536

Query: 84   LVRNDSPSYTSVEPQQQQQDVSGLPSF 4
            LVRN SPSY   E  QQQQD   LPSF
Sbjct: 537  LVRNSSPSYAPSE-SQQQQDHPELPSF 562


>ref|XP_007135474.1| hypothetical protein PHAVU_010G132600g [Phaseolus vulgaris]
            gi|561008519|gb|ESW07468.1| hypothetical protein
            PHAVU_010G132600g [Phaseolus vulgaris]
          Length = 864

 Score =  343 bits (880), Expect = 2e-91
 Identities = 239/577 (41%), Positives = 310/577 (53%), Gaps = 20/577 (3%)
 Frame = -3

Query: 1674 VVSGSRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQ 1495
            +V GSR +  T T  +SARVRKTIQSIKEIVGNHS+ADIYV LK++NMDPNET QKLLNQ
Sbjct: 1    MVPGSRTESATGTHLLSARVRKTIQSIKEIVGNHSDADIYVALKETNMDPNETTQKLLNQ 60

Query: 1494 DPFHEVRRKRDKKKE--NISYKASAEPRKQTEH--AQAAKSQTFPDRNVRRGSFVRNALP 1327
            DPFHEV+R+RD+KKE  N+    SA+ R+ +E+   Q  K  T  +RNVRR ++ RN LP
Sbjct: 61   DPFHEVKRRRDRKKEPQNVGNNGSADSRRPSENNSGQGVKFHTPSERNVRRANYSRNTLP 120

Query: 1326 GISREFRVVRDNRVNHNASRDINAASFQS-SSANVEVIPHVPAKSPKGILIDQDHLAARN 1150
            GISREFRVVRDNRVN+   +++   S Q  +SA+ E+  ++   S KG      H   R+
Sbjct: 121  GISREFRVVRDNRVNY-IYKEVKPLSQQHLASASEELNVNL---SEKGSSASTSH---RS 173

Query: 1149 SGDHKPSHVVNRPSHSNPGLVQDAYSGGAHRKGLYE----DAWAKVPSSASLTQGLKPRN 982
            SG    S  +N PS S     +DA      RK   E    D  + + ++A   Q +KP +
Sbjct: 174  SGSRNSSQALNGPSDSFARYPKDAVPNIVDRKIASEDKDKDKQSMISNAAERVQPIKPNH 233

Query: 981  FQPSSATLAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAV 802
               + A++A                          S  VGAI+REVGVVGVRRQPS+N V
Sbjct: 234  IHQNPASVA-SSSSAVGVYSSSTDPVHVPSPDSRSSSVVGAIRREVGVVGVRRQPSDNKV 292

Query: 801  KHXXXXXXXXXXXXXXXXSNQQSPSAA--KSTQLGQXXXXXXXXXXXXXSRPFLGNQYSS 628
            K                 ++   P  A  K+ Q  Q             SRP + NQY+ 
Sbjct: 293  KQSFAPSSSYVAGKDGTSADSFQPVGAVLKTEQFSQTKVTEPSLSGVPVSRPSVNNQYNG 352

Query: 627  KAH-QVVGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXANLISPPSNHTDTKMESD--HL 457
            + H Q+VGHQ+ +Q N EWKPKS+QK                + P       +ESD   L
Sbjct: 353  RPHQQLVGHQRVSQQNKEWKPKSSQKPNSNNPGVIGTPKKAAASPPAENSVDIESDAVEL 412

Query: 456  QRKFSGLGITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQSN 277
            Q K S L I ENQ+VII QH+QVPE DR  LTFG+   E DS++   S+    G +E+SN
Sbjct: 413  QDKLSQLNIYENQNVIIAQHIQVPETDRCRLTFGTIGTEIDSSR-LQSKYHIVGPSEKSN 471

Query: 276  GESSAS--VSAPLTSNDD---ANEVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXXXR 112
             E +AS  V AP  S DD   + +VDLL             S A SE            +
Sbjct: 472  DELAASLAVPAPELSTDDVSGSKQVDLLDEHIRSSGSDSPVSGAPSEQQLPDNKDSSNTQ 531

Query: 111  NLENYVDIGLVRNDSPSYTSVEPQQQQ-QDVSGLPSF 4
            NL+NY +IGLVR+ SPSY   EPQQQ+  D+ G  ++
Sbjct: 532  NLDNYANIGLVRDSSPSYAPSEPQQQESHDMPGFAAY 568


>ref|XP_003528451.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max]
          Length = 863

 Score =  341 bits (874), Expect = 8e-91
 Identities = 240/578 (41%), Positives = 309/578 (53%), Gaps = 21/578 (3%)
 Frame = -3

Query: 1674 VVSGSRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQ 1495
            +V GSR +GGT T  +SARVRKTIQSIKEIVGNHS+ADIYV LK++NMDPNET QKLLNQ
Sbjct: 1    MVPGSRTEGGTGTHLLSARVRKTIQSIKEIVGNHSDADIYVALKETNMDPNETTQKLLNQ 60

Query: 1494 DPFHEVRRKRDKKKE--NISYKA--SAEPRKQTEH--AQAAKSQTFPDRNVRRGSFVRNA 1333
            DPFHEV+R+RD+KKE  N+  K   SA+ R+ +E+   Q  K     +RNVRR ++ RN 
Sbjct: 61   DPFHEVKRRRDRKKETQNVGNKGQPSADSRRSSENNSGQGMKFNAPSERNVRRTNYSRNT 120

Query: 1332 LPGISREFRVVRDNRVNHNASRDINAASFQSSSANVEVIPHVPAKSPKGILIDQDHLAAR 1153
            LPGIS+EFRVVRDNRVNH            S+SA  ++  + P    KG     +H   R
Sbjct: 121  LPGISKEFRVVRDNRVNHIYKEVKPLTQQHSTSATEQLNVNTP---DKGSSTSTNH---R 174

Query: 1152 NSGDHKPSHVVNRPSHSNPGLVQDAYSGGAHRKGLYE--DAWAKVPSSASLTQGLKPRNF 979
            +SG    S   N PS S+   ++DA      RK   E  D    + ++A   Q +KP N 
Sbjct: 175  SSGSRNSSLASNGPSDSHARYLKDAVPNIIDRKIASEDKDKQGMISNAAGRVQPIKPNNA 234

Query: 978  QPSSATLAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVK 799
              +SA++A                          SG VGAI+REVGVVGVRRQ S+N  K
Sbjct: 235  HQNSASVA-STSSAVGVYSSSTDPVHVPSPDSRSSGVVGAIRREVGVVGVRRQSSDNKAK 293

Query: 798  HXXXXXXXXXXXXXXXXSN--QQSPSAAKSTQLGQXXXXXXXXXXXXXSRPFLGNQYSSK 625
                             ++  Q   + +K+ Q  Q             SRP L NQY+++
Sbjct: 294  QSFAPSISYVVGKDGTSADSFQSVGAVSKTEQFSQTNVTEPSLSGMPVSRPSLNNQYNNR 353

Query: 624  AH-QVVGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXAN----LISPPS-NHTDTKMESD 463
             H Q+VGHQ+ +Q N EWKPKS+QK                    SPP+ N  D +  + 
Sbjct: 354  PHQQLVGHQRVSQQNKEWKPKSSQKPNSNSPGVIGTPKKAAVAAASPPAENSGDIESNTT 413

Query: 462  HLQRKFSGLGITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQ 283
             LQ K S + I ENQ+VII QH++VPE DR  LTFG+   E DS++   S+    G +E+
Sbjct: 414  ELQDKLSQVNIYENQNVIIAQHIRVPETDRCQLTFGTIGTELDSSR-LQSKYHIIGASEK 472

Query: 282  SNGESSAS--VSAPLTSNDD---ANEVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXX 118
            SN E +AS  V AP  S DD   + +VDL              S AASE           
Sbjct: 473  SNEELTASLTVPAPELSTDDVSGSKQVDLRDEHIRSSRSDSPVSGAASEQQLPDNKDSSN 532

Query: 117  XRNLENYVDIGLVRNDSPSYTSVEPQQQQQDVSGLPSF 4
             +NL+NY +IGLVR+ SPSY   EP  QQQD   +P F
Sbjct: 533  TQNLDNYANIGLVRDSSPSYAPSEP--QQQDSHDMPGF 568


>gb|EXB29673.1| hypothetical protein L484_013447 [Morus notabilis]
          Length = 854

 Score =  337 bits (865), Expect = 9e-90
 Identities = 233/573 (40%), Positives = 309/573 (53%), Gaps = 16/573 (2%)
 Frame = -3

Query: 1674 VVSGSRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQ 1495
            +VS SR+DGG     +SA VRKTIQSIKEIVGNHS+ DIY+ LK++NMDPNETAQKLLNQ
Sbjct: 1    MVSASRIDGGPQI--LSAGVRKTIQSIKEIVGNHSDIDIYLALKETNMDPNETAQKLLNQ 58

Query: 1494 DPFHEVRRKRDKKKENISYKASAEPRKQTE-HAQAAKSQTFPDRNVRRGSFVRNALP--- 1327
            DPFHEVRRKRDKKKE+    +S +PR  +E   Q +K  TF DRN RRG + RN+LP   
Sbjct: 59   DPFHEVRRKRDKKKESAGNDSSTDPRGHSEVKGQGSKVNTFSDRNARRGGYARNSLPDRI 118

Query: 1326 ----GISREFRVVRDNRVNHNASRDINAASFQSSSANVEVIPHVPAKSPKGILIDQDHLA 1159
                G+SREFRVVRDNRVN + +R+   AS  +S        ++  K   G    +   A
Sbjct: 119  MLHAGVSREFRVVRDNRVNRSLNREAKPAS--ASPTPPSTFENISGKGSTGSSNSEKPTA 176

Query: 1158 ARNSGDHKPSHVVNRPSHSNPGLVQDAYSGGAHRKGLYEDAWAKVPSSASLTQGLKPRNF 979
            ++NS     S  +  PS S+  +  D  S G  RK + E+      S AS  Q  K  N 
Sbjct: 177  SKNS-----SQGLYGPSDSHLRIAHDIESTGLVRKEVSEEKRVTFSSVASRVQAGKANNA 231

Query: 978  QPSSATLAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVK 799
            +  SA +A                          SG+VGAIKREVGVVGVRRQ S+N+  
Sbjct: 232  RSQSAMVA-SSSSAIGVYSSSTDPVHVPSPDSRSSGSVGAIKREVGVVGVRRQSSDNSKS 290

Query: 798  HXXXXXXXXXXXXXXXXSN--QQSPSAAKSTQLGQXXXXXXXXXXXXXSRPFLGNQYSSK 625
                             +   Q   + +K+ ++GQ             SR  L + YS++
Sbjct: 291  SVPSSSFSNSLLGGEGSAETLQSFSTISKNDEVGQ--ASESILPSVSVSRSLLSSHYSNR 348

Query: 624  A--HQVVGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXANLISPPSNHTD-TKMESDHLQ 454
                Q VGHQKA+Q N EWKPKS+QK               +SPP+++++ ++ E   + 
Sbjct: 349  QQHQQPVGHQKASQPNKEWKPKSSQKPSLNNPGVIGTPTKSVSPPAHNSEVSESEPAKVL 408

Query: 453  RKFSGLGITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQSNG 274
             K S + I ENQ+VII QH++VPE DR  LTFGSF  EF+S+ +  +  Q   I E SNG
Sbjct: 409  EKLSRVNIHENQNVIIAQHIRVPETDRCRLTFGSFGKEFESDSDLVNGYQAGAIGE-SNG 467

Query: 273  ESSASVSAPLTSNDDAN---EVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXXXRNLE 103
            E+++S+SAP +S  DA+   +VDL   Q          S   SE+           +NL+
Sbjct: 468  EAASSLSAPESSIGDASGSKQVDLTDEQIRNSGSDSPTSGGTSENQFPDKKESTSPQNLD 527

Query: 102  NYVDIGLVRNDSPSYTSVEPQQQQQDVSGLPSF 4
            NY DIGLV+ +SPSY   + QQ +     LP F
Sbjct: 528  NYADIGLVQGNSPSYAPADSQQPEH--PELPGF 558


>ref|XP_006583148.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Glycine max]
          Length = 855

 Score =  336 bits (862), Expect = 2e-89
 Identities = 237/578 (41%), Positives = 305/578 (52%), Gaps = 21/578 (3%)
 Frame = -3

Query: 1674 VVSGSRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQ 1495
            +V GSR +GGT T  +SARVRKTIQSIKEIVGNHS+ADIYV LK++NMDPNET QKLLNQ
Sbjct: 1    MVPGSRTEGGTGTHLLSARVRKTIQSIKEIVGNHSDADIYVALKETNMDPNETTQKLLNQ 60

Query: 1494 DPFHEVRRKRDKKKE--NISYKA--SAEPRKQTEH--AQAAKSQTFPDRNVRRGSFVRNA 1333
            DPFHEV+R+RD+KKE  N+  K   SA+ R+ +E+   Q  K     +RNVRR ++ RN 
Sbjct: 61   DPFHEVKRRRDRKKETQNVGNKGQPSADSRRSSENNSGQGMKFNAPSERNVRRTNYSRNT 120

Query: 1332 LPGISREFRVVRDNRVNHNASRDINAASFQSSSANVEVIPHVPAKSPKGILIDQDHLAAR 1153
            LPGIS+EFRVVRDNRVNH            S+SA  ++  + P K               
Sbjct: 121  LPGISKEFRVVRDNRVNHIYKEVKPLTQQHSTSATEQLNVNTPDKG-------------- 166

Query: 1152 NSGDHKPSHVVNRPSHSNPGLVQDAYSGGAHRKGLYE--DAWAKVPSSASLTQGLKPRNF 979
            +SG    S   N PS S+   ++DA      RK   E  D    + ++A   Q +KP N 
Sbjct: 167  SSGSRNSSLASNGPSDSHARYLKDAVPNIIDRKIASEDKDKQGMISNAAGRVQPIKPNNA 226

Query: 978  QPSSATLAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVK 799
              +SA++A                          SG VGAI+REVGVVGVRRQ S+N  K
Sbjct: 227  HQNSASVA-STSSAVGVYSSSTDPVHVPSPDSRSSGVVGAIRREVGVVGVRRQSSDNKAK 285

Query: 798  HXXXXXXXXXXXXXXXXSN--QQSPSAAKSTQLGQXXXXXXXXXXXXXSRPFLGNQYSSK 625
                             ++  Q   + +K+ Q  Q             SRP L NQY+++
Sbjct: 286  QSFAPSISYVVGKDGTSADSFQSVGAVSKTEQFSQTNVTEPSLSGMPVSRPSLNNQYNNR 345

Query: 624  AH-QVVGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXAN----LISPPS-NHTDTKMESD 463
             H Q+VGHQ+ +Q N EWKPKS+QK                    SPP+ N  D +  + 
Sbjct: 346  PHQQLVGHQRVSQQNKEWKPKSSQKPNSNSPGVIGTPKKAAVAAASPPAENSGDIESNTT 405

Query: 462  HLQRKFSGLGITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQ 283
             LQ K S + I ENQ+VII QH++VPE DR  LTFG+   E DS++   S+    G +E+
Sbjct: 406  ELQDKLSQVNIYENQNVIIAQHIRVPETDRCQLTFGTIGTELDSSR-LQSKYHIIGASEK 464

Query: 282  SNGESSAS--VSAPLTSNDD---ANEVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXX 118
            SN E +AS  V AP  S DD   + +VDL              S AASE           
Sbjct: 465  SNEELTASLTVPAPELSTDDVSGSKQVDLRDEHIRSSRSDSPVSGAASEQQLPDNKDSSN 524

Query: 117  XRNLENYVDIGLVRNDSPSYTSVEPQQQQQDVSGLPSF 4
             +NL+NY +IGLVR+ SPSY   EP  QQQD   +P F
Sbjct: 525  TQNLDNYANIGLVRDSSPSYAPSEP--QQQDSHDMPGF 560


>ref|XP_002304144.2| hypothetical protein POPTR_0003s06200g [Populus trichocarpa]
            gi|550342535|gb|EEE79123.2| hypothetical protein
            POPTR_0003s06200g [Populus trichocarpa]
          Length = 858

 Score =  335 bits (859), Expect = 4e-89
 Identities = 234/554 (42%), Positives = 302/554 (54%), Gaps = 12/554 (2%)
 Frame = -3

Query: 1629 MSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQDPFHEVRRKRDKKKE 1450
            +SARVRK IQSIKEIVGN S+ADIY+ LK++NMDPNET QKLLNQDPFHEV+RKRDKKKE
Sbjct: 30   LSARVRKIIQSIKEIVGNFSDADIYMVLKETNMDPNETVQKLLNQDPFHEVKRKRDKKKE 89

Query: 1449 NISYKASAEPRKQTEH-AQAAKSQTFPDRNVRRGSFVRN---ALPGISREFRVVRDNRVN 1282
            ++SY+ S + RKQ E+  Q  + +TF DR  +RG   R       G++REFRVVRDNR+N
Sbjct: 90   SMSYRGSVDSRKQPENFDQGMRPRTFLDRYAQRGGHTRTDSIGNRGVNREFRVVRDNRIN 149

Query: 1281 HNASRDINAASFQSSSANVEVIPHVPAKSPKGILIDQDHLAARNSGDHKPSHVVNRPSHS 1102
             NA+R+   A  Q S++  E    V  K   G  I  ++L   N+     S   N P++ 
Sbjct: 150  QNANREPKPALPQGSTSAKEKGSGVTEKGSAG--ISNNNLKPSNA--QSSSQTSNGPTYP 205

Query: 1101 NPGLVQDAYSGGAHRKGLYEDAWAKVP-SSASLTQGLKPRNFQPSSATLAXXXXXXXXXX 925
             P   +DA S    RK + E+  +    ++ S  Q +KP N Q   A+LA          
Sbjct: 206  EPRYNRDAKSRAGDRKVVSEEKRSTASNATTSRAQVVKPNNSQQHDASLA-SSNSVVGVY 264

Query: 924  XXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVKHXXXXXXXXXXXXXXXXS 745
                            SG VGAIKREVGVVG RRQ SENAVK                  
Sbjct: 265  SSSTDPVHVPSPDSRSSGVVGAIKREVGVVGGRRQ-SENAVKDLSSSNSFSESFHPL--- 320

Query: 744  NQQSPSAAKSTQLGQXXXXXXXXXXXXXSRPFLGNQYSSKAH-QVVGHQKAAQSNMEWKP 568
                 +A  +T   +             +R  L NQY+S+ H Q VG+ KA+Q N EWKP
Sbjct: 321  -----TAISNTDQVRQTAVIESMPSVPVNRSLLHNQYNSRPHQQTVGYPKASQHNKEWKP 375

Query: 567  KSTQKXXXXXXXXXXXXANLISPPS-NHTDTKMESDHLQRKFSGLGITENQHVIIPQHLQ 391
            KS+QK                 PP+ N    ++ + +LQ KFS + I ENQ+VII QH++
Sbjct: 376  KSSQKSSITSPGVIGTPTKSSLPPTDNSKSMELNAANLQDKFSRVNIHENQNVIIAQHIR 435

Query: 390  VPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQSNGESSASV--SAPLTSNDDA--- 226
            VPE+DR  LTFGSF VEFD ++N     Q  GI+E+SN ES+ S+  S P +S++DA   
Sbjct: 436  VPESDRCKLTFGSFGVEFDPSRNSTPGFQAVGISEESNRESAISLPASCPESSSEDAPGG 495

Query: 225  NEVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXXXRNLENYVDIGLVRNDSPSYTSVE 46
             +++LL  Q          +  ASEH            +L+NY DIGLVRN SPSY   E
Sbjct: 496  KQIELLDDQARNSESDSPEAGLASEH--QLPEKSSSPPDLDNYADIGLVRNSSPSYAPSE 553

Query: 45   PQQQQQDVSGLPSF 4
              QQQQD   LPSF
Sbjct: 554  -SQQQQDHPELPSF 566


>ref|XP_006583149.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Glycine max]
          Length = 830

 Score =  333 bits (854), Expect = 2e-88
 Identities = 240/576 (41%), Positives = 308/576 (53%), Gaps = 19/576 (3%)
 Frame = -3

Query: 1674 VVSGSRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQ 1495
            +V GSR +GGT T  +SARVRKTIQSIKEIVGNHS+ADIYV LK++NMDPNET QKLLNQ
Sbjct: 1    MVPGSRTEGGTGTHLLSARVRKTIQSIKEIVGNHSDADIYVALKETNMDPNETTQKLLNQ 60

Query: 1494 DPFHEVRRKRDKKKE--NISYKA--SAEPRKQTEH--AQAAKSQTFPDRNVRRGSFVRNA 1333
            DPFHEV+R+RD+KKE  N+  K   SA+ R+ +E+   Q  K     +RNVRR ++ RN 
Sbjct: 61   DPFHEVKRRRDRKKETQNVGNKGQPSADSRRSSENNSGQGMKFNAPSERNVRRTNYSRNT 120

Query: 1332 LPGISREFRVVRDNRVNHNASRDINAASFQSSSANVEVIPHVPAKSPKGILIDQDHLAAR 1153
            LPGIS+EFRVVRDNRVNH            S+SA  ++  + P    KG     +H   R
Sbjct: 121  LPGISKEFRVVRDNRVNHIYKEVKPLTQQHSTSATEQLNVNTP---DKGSSTSTNH---R 174

Query: 1152 NSGDHKPSHVVNRPSHSNPGLVQDAYSGGAHRKGLYE--DAWAKVPSSASLTQGLKPRNF 979
            +SG    S   N PS S+   ++DA      RK   E  D    + ++A   Q +KP N 
Sbjct: 175  SSGSRNSSLASNGPSDSHARYLKDAVPNIIDRKIASEDKDKQGMISNAAGRVQPIKPNNA 234

Query: 978  QPSSATLAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVK 799
              +SA++A                          SG VGAI+REVGVVGVRRQ S+N  K
Sbjct: 235  HQNSASVA-STSSAVGVYSSSTDPVHVPSPDSRSSGVVGAIRREVGVVGVRRQSSDNKAK 293

Query: 798  HXXXXXXXXXXXXXXXXSNQQSPSAAKSTQLGQXXXXXXXXXXXXXSRPFLGNQYSSKAH 619
                                QS + + S  +G+             SRP L NQY+++ H
Sbjct: 294  --------------------QSFAPSISYVVGK-----------DVSRPSLNNQYNNRPH 322

Query: 618  -QVVGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXAN----LISPPS-NHTDTKMESDHL 457
             Q+VGHQ+ +Q N EWKPKS+QK                    SPP+ N  D +  +  L
Sbjct: 323  QQLVGHQRVSQQNKEWKPKSSQKPNSNSPGVIGTPKKAAVAAASPPAENSGDIESNTTEL 382

Query: 456  QRKFSGLGITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQSN 277
            Q K S + I ENQ+VII QH++VPE DR  LTFG+   E DS++   S+    G +E+SN
Sbjct: 383  QDKLSQVNIYENQNVIIAQHIRVPETDRCQLTFGTIGTELDSSR-LQSKYHIIGASEKSN 441

Query: 276  GESSAS--VSAPLTSNDD---ANEVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXXXR 112
             E +AS  V AP  S DD   + +VDL              S AASE            +
Sbjct: 442  EELTASLTVPAPELSTDDVSGSKQVDLRDEHIRSSRSDSPVSGAASEQQLPDNKDSSNTQ 501

Query: 111  NLENYVDIGLVRNDSPSYTSVEPQQQQQDVSGLPSF 4
            NL+NY +IGLVR+ SPSY   EP  QQQD   +P F
Sbjct: 502  NLDNYANIGLVRDSSPSYAPSEP--QQQDSHDMPGF 535


>ref|XP_006426627.1| hypothetical protein CICLE_v10024871mg [Citrus clementina]
            gi|557528617|gb|ESR39867.1| hypothetical protein
            CICLE_v10024871mg [Citrus clementina]
          Length = 867

 Score =  328 bits (841), Expect = 5e-87
 Identities = 240/574 (41%), Positives = 311/574 (54%), Gaps = 21/574 (3%)
 Frame = -3

Query: 1662 SRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQDPFH 1483
            +R++GGT    +SA +R TIQ+IKEIVGNHS+ADIY TLKDSNMDPNETAQKLLNQDPF 
Sbjct: 17   TRIEGGTQI--LSAGMRNTIQTIKEIVGNHSDADIYFTLKDSNMDPNETAQKLLNQDPFL 74

Query: 1482 EVRRKRDKKKENISYKASAEPRKQTE-HAQAAKSQTFPDRNVRRGSFVRNALP--GISRE 1312
            EV+R+RDKKKEN+SYK+  EPRK +E   +  + +T+ DRN RR  + RNALP  GI+RE
Sbjct: 75   EVKRRRDKKKENMSYKSLEEPRKNSEIFGKTMRIRTYADRNARRRGYNRNALPDAGINRE 134

Query: 1311 FRVVRDNRVNHNASRDINAASFQSSSANVEVIPHVPAK-SPKGILIDQDHLAARNSGDHK 1135
            FRVVRDNRVN  A+++  +   QSS +  E + +V  K SP G    +     + SG   
Sbjct: 135  FRVVRDNRVNPEANQETKSPLPQSSISTNEKVTNVKEKGSPTGTTGSE-----KPSGGRS 189

Query: 1134 PSHVVNRPSHSNPGLVQDAYSGGAHRKGLYEDAWAKVPSSA------SLTQGLKPRNFQP 973
             S   N  ++ +P    D    G  R    E +  K  +SA      ++T+G        
Sbjct: 190  FSQASNGSTNLHPRHAYDHNITGTDR---IEPSAEKFTTSAVNFIQHNITEGY------- 239

Query: 972  SSATLAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVKHX 793
             SATLA                          S AVGAIKREVGVVG  RQ S+NAVK  
Sbjct: 240  -SATLA--SSNSVGGYFSSKDPVHVPSPDSRASSAVGAIKREVGVVGGGRQCSDNAVKDS 296

Query: 792  XXXXXXXXXXXXXXXSN---QQSPSAAKSTQLGQXXXXXXXXXXXXXSRPFLGNQYSSKA 622
                           ++   +  PS +K+ Q+ Q             +R    NQY+ ++
Sbjct: 297  TAPCSSFSNSILGRDNSDSFRPFPSISKADQINQIAATDSGVAGMPANRALFTNQYTGRS 356

Query: 621  H-QVVGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXANLISPP-SNHTDTKMESDHLQRK 448
            H Q VGHQKA+Q N EWKPKS+QK                SPP  +  D + +   LQ +
Sbjct: 357  HQQSVGHQKASQHNKEWKPKSSQKSNVIGPGVIGTPTKSPSPPVDDSKDLESDVAKLQDE 416

Query: 447  FSGLGITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQSNGES 268
             S + I ENQ+VII QH++VPE DR  LTFGSF V+F+S++N  S     G AE+SNGES
Sbjct: 417  LSRVNIHENQNVIIAQHIRVPETDRCRLTFGSFGVDFESSRNLGSGFLAAGSAEESNGES 476

Query: 267  SASV--SAPLTSNDDAN---EVDLLHGQXXXXXXXXXXSDAASEH-XXXXXXXXXXXRNL 106
            +AS+  +A  TS +D +    VD+L             S  ASEH            ++L
Sbjct: 477  AASLTGAASKTSGNDVSGRKPVDILDDLVRNSGSNSPASGEASEHQLPDDIKDASSPQDL 536

Query: 105  ENYVDIGLVRNDSPSYTSVEPQQQQQDVSGLPSF 4
            + Y DIGLVR+  PSY   E  QQQQD S L SF
Sbjct: 537  DGYADIGLVRDTDPSYPLSE-SQQQQDSSELASF 569


>ref|XP_006426626.1| hypothetical protein CICLE_v10024871mg [Citrus clementina]
            gi|557528616|gb|ESR39866.1| hypothetical protein
            CICLE_v10024871mg [Citrus clementina]
          Length = 866

 Score =  328 bits (841), Expect = 5e-87
 Identities = 240/574 (41%), Positives = 311/574 (54%), Gaps = 21/574 (3%)
 Frame = -3

Query: 1662 SRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQDPFH 1483
            +R++GGT    +SA +R TIQ+IKEIVGNHS+ADIY TLKDSNMDPNETAQKLLNQDPF 
Sbjct: 17   TRIEGGTQI--LSAGMRNTIQTIKEIVGNHSDADIYFTLKDSNMDPNETAQKLLNQDPFL 74

Query: 1482 EVRRKRDKKKENISYKASAEPRKQTE-HAQAAKSQTFPDRNVRRGSFVRNALP--GISRE 1312
            EV+R+RDKKKEN+SYK+  EPRK +E   +  + +T+ DRN RR  + RNALP  GI+RE
Sbjct: 75   EVKRRRDKKKENMSYKSLEEPRKNSEIFGKTMRIRTYADRNARRRGYNRNALPDAGINRE 134

Query: 1311 FRVVRDNRVNHNASRDINAASFQSSSANVEVIPHVPAK-SPKGILIDQDHLAARNSGDHK 1135
            FRVVRDNRVN  A+++  +   QSS +  E + +V  K SP G    +     + SG   
Sbjct: 135  FRVVRDNRVNPEANQETKSPLPQSSISTNEKVTNVKEKGSPTGTTGSE-----KPSGGRS 189

Query: 1134 PSHVVNRPSHSNPGLVQDAYSGGAHRKGLYEDAWAKVPSSA------SLTQGLKPRNFQP 973
             S   N  ++ +P    D    G  R    E +  K  +SA      ++T+G        
Sbjct: 190  FSQASNGSTNLHPRHAYDHNITGTDR---IEPSAEKFTTSAVNFIQHNITEGY------- 239

Query: 972  SSATLAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVKHX 793
             SATLA                          S AVGAIKREVGVVG  RQ S+NAVK  
Sbjct: 240  -SATLA--SSNSVGGYFSSKDPVHVPSPDSRASSAVGAIKREVGVVGGGRQCSDNAVKDS 296

Query: 792  XXXXXXXXXXXXXXXSN---QQSPSAAKSTQLGQXXXXXXXXXXXXXSRPFLGNQYSSKA 622
                           ++   +  PS +K+ Q+ Q             +R    NQY+ ++
Sbjct: 297  TAPCSSFSNSILGRDNSDSFRPFPSISKADQINQIAATDSGVAGMPANRALFTNQYTGRS 356

Query: 621  H-QVVGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXANLISPP-SNHTDTKMESDHLQRK 448
            H Q VGHQKA+Q N EWKPKS+QK                SPP  +  D + +   LQ +
Sbjct: 357  HQQSVGHQKASQHNKEWKPKSSQKSNVIGPGVIGTPTKSPSPPVDDSKDLESDVAKLQDE 416

Query: 447  FSGLGITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQSNGES 268
             S + I ENQ+VII QH++VPE DR  LTFGSF V+F+S++N  S     G AE+SNGES
Sbjct: 417  LSRVNIHENQNVIIAQHIRVPETDRCRLTFGSFGVDFESSRNLGSGFLAAGSAEESNGES 476

Query: 267  SASV--SAPLTSNDDAN---EVDLLHGQXXXXXXXXXXSDAASEH-XXXXXXXXXXXRNL 106
            +AS+  +A  TS +D +    VD+L             S  ASEH            ++L
Sbjct: 477  AASLTGAASKTSGNDVSGRKPVDILDDLVRNSGSNSPASGEASEHQLPDDIKDASSPQDL 536

Query: 105  ENYVDIGLVRNDSPSYTSVEPQQQQQDVSGLPSF 4
            + Y DIGLVR+  PSY   E  QQQQD S L SF
Sbjct: 537  DGYADIGLVRDTDPSYPLSE-SQQQQDSSELASF 569


>ref|XP_004163891.1| PREDICTED: uncharacterized protein LOC101226902 [Cucumis sativus]
          Length = 846

 Score =  325 bits (834), Expect = 4e-86
 Identities = 226/568 (39%), Positives = 300/568 (52%), Gaps = 11/568 (1%)
 Frame = -3

Query: 1674 VVSGSRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQ 1495
            +VSG R+DGGT    + ARVRKTIQSIKEIVGNHS+ADIY TLK++NMDPNETAQKLLNQ
Sbjct: 1    MVSGLRVDGGTHV--LPARVRKTIQSIKEIVGNHSDADIYTTLKETNMDPNETAQKLLNQ 58

Query: 1494 DPFHEVRRKRDKKKENISYKASAEPRKQTEHA-QAAKSQTFPDRNVRRGSFVRNALPGIS 1318
            DPF EV+R+RDKKKEN+ YK S + ++ +E   Q  K  T  DRNVRRG++ +++ PGIS
Sbjct: 59   DPFREVKRRRDKKKENVGYKGSLDAQRNSEDVRQGTKVYTLSDRNVRRGAYAKSSWPGIS 118

Query: 1317 REFRVVRDNRVNHNASRDINAASFQSSSANVEVIPHVPAK--SPKGILIDQDHLAARNSG 1144
            +EFRVVRDNRVN N++R++  AS   + +  EV  +V     +P+G        A   S 
Sbjct: 119  KEFRVVRDNRVNRNSNREVKPASSHLALSTNEVSTNVSKSVITPRG--------AHGGSF 170

Query: 1143 DHKPSHVVNRPSHSNPGLVQDAYSGGAHRKGLYEDAWAKVPSSASLTQGLKPRNFQPSSA 964
              + S V  R + S+P   +D +S G  +K L +D    + SS        P + +P S 
Sbjct: 171  GGRISQVSFRKTDSHPSNPRDGHSTGMAQKELRDDVGVSMLSSIPDMHIGNPNDSEPHSP 230

Query: 963  TLAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVKHXXXX 784
             LA                          S  VGAIKREVG VGVRRQ  ++++      
Sbjct: 231  VLA-SNGAAVGLYSSSTDPVHVPSPDSRSSAPVGAIKREVGAVGVRRQLKDSSINQSSGP 289

Query: 783  XXXXXXXXXXXXSNQQSPSAAKSTQLGQ--XXXXXXXXXXXXXSRPFLGNQYSSKAHQ-V 613
                         +  S     ST  G+               SR  L NQ+SS+ HQ  
Sbjct: 290  SVSLANSVSERDGSSDSFQPMSSTSKGEQLSQITESVIPGLVGSRTSLNNQHSSRQHQPT 349

Query: 612  VGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXANLISPPSNHTDTKMESDHLQRKFSGLG 433
            +GHQKA+Q N EWKPKS+QK            +   +P     +   E+ ++Q K + + 
Sbjct: 350  MGHQKASQPNKEWKPKSSQKLSTGNPGVIGTPSKSKAPADESKELHSEAANVQEKLARVD 409

Query: 432  ITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQSNGESSA--S 259
            + ENQHVII +H++VP+ D+  L FGSF  E DS+    S +Q     E+ NGESSA  S
Sbjct: 410  LHENQHVIIAEHIRVPDNDQYRLVFGSFGTESDSSGCLVSGLQAIRGPEELNGESSASQS 469

Query: 258  VSAPLTSNDDAN---EVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXXXRNLENYVDI 88
            VSA   S DDA+   +VDLL  Q          S  A+E            + L+ Y +I
Sbjct: 470  VSALEISTDDASGSRQVDLLDDQVRNSESNSPDSGTATELQSADKRESSSPQPLDTYAEI 529

Query: 87   GLVRNDSPSYTSVEPQQQQQDVSGLPSF 4
            GLVR+ +  YT   P  Q QD S L  F
Sbjct: 530  GLVRDRNLKYT---PAPQHQDPSELLGF 554


>ref|XP_004141213.1| PREDICTED: uncharacterized protein LOC101203238 [Cucumis sativus]
          Length = 740

 Score =  325 bits (834), Expect = 4e-86
 Identities = 226/568 (39%), Positives = 300/568 (52%), Gaps = 11/568 (1%)
 Frame = -3

Query: 1674 VVSGSRLDGGTTTPNMSARVRKTIQSIKEIVGNHSEADIYVTLKDSNMDPNETAQKLLNQ 1495
            +VSG R+DGGT    + ARVRKTIQSIKEIVGNHS+ADIY TLK++NMDPNETAQKLLNQ
Sbjct: 1    MVSGLRVDGGTHV--LPARVRKTIQSIKEIVGNHSDADIYTTLKETNMDPNETAQKLLNQ 58

Query: 1494 DPFHEVRRKRDKKKENISYKASAEPRKQTEHA-QAAKSQTFPDRNVRRGSFVRNALPGIS 1318
            DPF EV+R+RDKKKEN+ YK S + ++ +E   Q  K  T  DRNVRRG++ +++ PGIS
Sbjct: 59   DPFREVKRRRDKKKENVGYKGSLDAQRNSEDVRQGTKVYTLSDRNVRRGAYAKSSWPGIS 118

Query: 1317 REFRVVRDNRVNHNASRDINAASFQSSSANVEVIPHVPAK--SPKGILIDQDHLAARNSG 1144
            +EFRVVRDNRVN N++R++  AS   + +  EV  +V     +P+G        A   S 
Sbjct: 119  KEFRVVRDNRVNRNSNREVKPASSHLALSTNEVSTNVSKSVITPRG--------AHGGSF 170

Query: 1143 DHKPSHVVNRPSHSNPGLVQDAYSGGAHRKGLYEDAWAKVPSSASLTQGLKPRNFQPSSA 964
              + S V  R + S+P   +D +S G  +K L +D    + SS        P + +P S 
Sbjct: 171  GGRISQVSFRKTDSHPSNPRDGHSTGMAQKELRDDVGVSMLSSIPDMHIGNPNDSEPHSP 230

Query: 963  TLAXXXXXXXXXXXXXXXXXXXXXXXXXXSGAVGAIKREVGVVGVRRQPSENAVKHXXXX 784
             LA                          S  VGAIKREVG VGVRRQ  ++++      
Sbjct: 231  VLA-SNGAAVGLYSSSTDPVHVPSPDSRSSAPVGAIKREVGAVGVRRQLKDSSINQSSGP 289

Query: 783  XXXXXXXXXXXXSNQQSPSAAKSTQLGQ--XXXXXXXXXXXXXSRPFLGNQYSSKAHQ-V 613
                         +  S     ST  G+               SR  L NQ+SS+ HQ  
Sbjct: 290  SVSLANSVSERDGSSDSFQPMSSTSKGEQLSQITESVIPGLVGSRTSLNNQHSSRQHQPT 349

Query: 612  VGHQKAAQSNMEWKPKSTQKXXXXXXXXXXXXANLISPPSNHTDTKMESDHLQRKFSGLG 433
            +GHQKA+Q N EWKPKS+QK            +   +P     +   E+ ++Q K + + 
Sbjct: 350  MGHQKASQPNKEWKPKSSQKLSTGNPGVIGTPSKSKAPADESKELHSEAANVQEKLARVD 409

Query: 432  ITENQHVIIPQHLQVPEADRTWLTFGSFDVEFDSNKNFASRIQGHGIAEQSNGESSA--S 259
            + ENQHVII +H++VP+ D+  L FGSF  E DS+    S +Q     E+ NGESSA  S
Sbjct: 410  LHENQHVIIAEHIRVPDNDQYRLVFGSFGTESDSSGCLVSGLQAIRGPEELNGESSASQS 469

Query: 258  VSAPLTSNDDAN---EVDLLHGQXXXXXXXXXXSDAASEHXXXXXXXXXXXRNLENYVDI 88
            VSA   S DDA+   +VDLL  Q          S  A+E            + L+ Y +I
Sbjct: 470  VSALEISTDDASGSRQVDLLDDQVRNSESNSPDSGTATELQSADKRESSSPQPLDTYAEI 529

Query: 87   GLVRNDSPSYTSVEPQQQQQDVSGLPSF 4
            GLVR+ +  YT   P  Q QD S L  F
Sbjct: 530  GLVRDRNLKYT---PAPQHQDPSELLGF 554


Top