BLASTX nr result

ID: Mentha27_contig00005462 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha27_contig00005462
         (2650 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU21338.1| hypothetical protein MIMGU_mgv1a001286mg [Mimulus...   880   0.0  
ref|XP_002274314.2| PREDICTED: uncharacterized protein LOC100248...   713   0.0  
ref|XP_002521347.1| conserved hypothetical protein [Ricinus comm...   712   0.0  
ref|XP_007024587.1| Uncharacterized protein isoform 4 [Theobroma...   690   0.0  
gb|EXB29673.1| hypothetical protein L484_013447 [Morus notabilis]     687   0.0  
ref|XP_007024585.1| Uncharacterized protein isoform 2 [Theobroma...   682   0.0  
emb|CAN69468.1| hypothetical protein VITISV_042555 [Vitis vinifera]   678   0.0  
ref|XP_007024589.1| Uncharacterized protein isoform 6 [Theobroma...   676   0.0  
ref|XP_006426626.1| hypothetical protein CICLE_v10024871mg [Citr...   672   0.0  
ref|XP_006426627.1| hypothetical protein CICLE_v10024871mg [Citr...   668   0.0  
ref|XP_007024588.1| Uncharacterized protein isoform 5 [Theobroma...   668   0.0  
ref|XP_006465941.1| PREDICTED: dentin sialophosphoprotein-like [...   665   0.0  
ref|XP_007024584.1| Uncharacterized protein isoform 1 [Theobroma...   664   0.0  
emb|CBI35892.3| unnamed protein product [Vitis vinifera]              663   0.0  
ref|XP_003528451.1| PREDICTED: dentin sialophosphoprotein-like i...   653   0.0  
ref|XP_006583148.1| PREDICTED: dentin sialophosphoprotein-like i...   650   0.0  
ref|XP_002299597.2| hydroxyproline-rich glycoprotein [Populus tr...   650   0.0  
ref|XP_007135474.1| hypothetical protein PHAVU_010G132600g [Phas...   649   0.0  
ref|XP_006361347.1| PREDICTED: dentin sialophosphoprotein-like i...   630   e-177
ref|XP_006598817.1| PREDICTED: putative uncharacterized protein ...   628   e-177

>gb|EYU21338.1| hypothetical protein MIMGU_mgv1a001286mg [Mimulus guttatus]
          Length = 847

 Score =  880 bits (2274), Expect = 0.0
 Identities = 488/808 (60%), Positives = 568/808 (70%), Gaps = 11/808 (1%)
 Frame = -1

Query: 2416 SGSRIDGGAQVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQDPFH 2237
            SGSR +GG  V+SAGVRRTIQSIKEIVGNHSDA+IY  L++TNMDPNETAQK+LNQDPFH
Sbjct: 4    SGSRTEGGPLVISAGVRRTIQSIKEIVGNHSDADIYAVLQDTNMDPNETAQKMLNQDPFH 63

Query: 2236 EVKRRRDRKKEIPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNAP--SGVS 2063
            EVKR+RDRK+EI   KS+   +PKK+ D A MPVK+++YSDR++RR   +RNA   +G +
Sbjct: 64   EVKRKRDRKREISGYKSYIAADPKKSADPAHMPVKFSAYSDRNTRRGVSTRNAAPDAGAN 123

Query: 2062 QEFRVVRDNRVNQNSITDSKPGLNXXXXXXXXXXXXSMKSD----SGHQEH-VGQHSSQP 1898
            +EFRVVRDNRVNQN+ TD KP               SMKS     S HQE   GQ S+Q 
Sbjct: 124  REFRVVRDNRVNQNAGTDLKP---VQSSNSTSEDTVSMKSTLTGISEHQEPPAGQRSTQA 180

Query: 1897 IKSSADSQQRQSKDAASVGNDRKEMVGEKRFPVPSATSRVQIKAXXXXXXXXXXXXXXXX 1718
            +   AD Q  Q K A   GND+KEM  EKR    +  SR   KA                
Sbjct: 181  LNRPADLQAPQIKIAKQSGNDKKEMPSEKRLASTNVNSRSHKKANGPQPHSTNSSTSSVV 240

Query: 1717 XXXXXXSDPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXXXXXXXXXXXSG 1538
                  SDPVHVP+  +RPAA+VGAIRREVGVVGPRRQSS+                  G
Sbjct: 241  GVYSSSSDPVHVPA--ARPAASVGAIRREVGVVGPRRQSSDNSAKPSSQNISLPNTQS-G 297

Query: 1537 RDGQSRES-RPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPHQLNGHQKASQ 1361
            R+  SR+S RP +A  K +QS Q+VA +SA+P LP +RSFSSN YGSR HQL GHQKA Q
Sbjct: 298  RESHSRDSARPLSALPKTDQSVQDVAPESAMPGLPANRSFSSNQYGSRQHQLMGHQKAPQ 357

Query: 1360 PNKEWKPKASVKPSAKGPGVIGSPAKTVSPPAHNTEGTQKEAAVLQDNMSQLNISENQNV 1181
            PNKEWKPK+SVKP A GPGVIG+PAKT+SPPA N E  +KEAA +QD+MS+LN+SENQNV
Sbjct: 358  PNKEWKPKSSVKPIANGPGVIGTPAKTISPPAVNPEDLKKEAAQMQDSMSRLNLSENQNV 417

Query: 1180 IIAPHIRVSETDRCRLTFGSLGADFDTSANSVGVSTNGVEDLSTDPSGSVSASAAETSGD 1001
            IIA HIRVSETDRCRLTFGSLGA+ D S NSV +S +G E++S +PSGS+S S  E+S D
Sbjct: 418  IIAAHIRVSETDRCRLTFGSLGAELDGSTNSVSMSADGAEEVSAEPSGSISVSVPESSPD 477

Query: 1000 EPVSSKQLEMMXXXXXXXXXXXXXXXXXXDR--TEKKDSTSSQNLNDYADVVLVQGNSPS 827
            +   S+Q+E M                  D   TEKK+ ++ +NL +YA V LV+ NSPS
Sbjct: 478  DLGGSRQVETMDDSVRSSESNSPDSGAVSDHTLTEKKEPSNPKNLENYAAVGLVRVNSPS 537

Query: 826  YTTDSLQQHDTSELPSFSGYDPQMAYEMSYFRPVADETGRGPGLPSSQEYTQVLSVHTSN 647
            YT + LQQ D SELP+FSGYDPQM Y++SYFRP+ DET RG GLPSSQE   +LSVHTSN
Sbjct: 538  YTPELLQQQDASELPTFSGYDPQMGYDISYFRPIVDETFRGSGLPSSQE---ILSVHTSN 594

Query: 646  ALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQMYPQLHVSHFANMMPYRQF 467
            A+PAS++AMV                            LAQMYPQLHVSHFAN+MPYRQF
Sbjct: 595  AMPASTMAMV-------------------QQQQQQQQQLAQMYPQLHVSHFANLMPYRQF 635

Query: 466  LSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQGGVKYGIQQFKPVPTGSPT 287
            LS         PGYSNSPAYPHPSNGSSY+LMPGNS+     GVKYGIQQFKPVP GS T
Sbjct: 636  LSPVYVPPMPVPGYSNSPAYPHPSNGSSYVLMPGNSNFQASSGVKYGIQQFKPVPAGSAT 695

Query: 286  GFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKDSLYVPNPQAETSEIWMNPRDVSGMQ 107
            GFGNFTN +GYAI+ PG+V S  GHDDSSRLKYKD+LYVPNPQAETSEIWMNPRD+SGMQ
Sbjct: 696  GFGNFTNQSGYAISTPGIVASAPGHDDSSRLKYKDNLYVPNPQAETSEIWMNPRDLSGMQ 755

Query: 106  S-SYYNMPGQSPHPTAYLTSHSGHASFN 26
            S SYYNMPGQ+PHP AYLTSHSGHASFN
Sbjct: 756  SASYYNMPGQTPHP-AYLTSHSGHASFN 782


>ref|XP_002274314.2| PREDICTED: uncharacterized protein LOC100248075 [Vitis vinifera]
          Length = 860

 Score =  713 bits (1840), Expect = 0.0
 Identities = 422/831 (50%), Positives = 519/831 (62%), Gaps = 24/831 (2%)
 Frame = -1

Query: 2422 MVSGSRIDGGAQVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQDP 2243
            MVSGSR++GG Q+L A VR+TIQSIKEIVGNHSDA+IYV L+ETNMDPNET QKLL QDP
Sbjct: 1    MVSGSRMEGGTQILPARVRKTIQSIKEIVGNHSDADIYVTLRETNMDPNETTQKLLYQDP 60

Query: 2242 FHEVKRRRDRKKEIPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNA----- 2078
            FHEVKR+RD+KKE    K  T  EP+   +      K+ S+ DR+ RR G SR+      
Sbjct: 61   FHEVKRKRDKKKESTGYKRPT--EPRIYIENVGQG-KFRSFPDRNVRRGGYSRSTLMVRI 117

Query: 2077 --PSGVSQEFRVVRDNRVNQNSITDSKP-------GLNXXXXXXXXXXXXSMKSDSGHQE 1925
               +G+ +EFRVVRDNRVNQN+  D KP        +N            S  + +  + 
Sbjct: 118  LLDAGIGREFRVVRDNRVNQNTNRDMKPVSPQLATSVNEQVISNISEKGNSTGTSNNQKP 177

Query: 1924 HVGQHSSQPIKSSADSQQRQSKDAASVGNDRKEMVGEKRFPVPSATSRVQIKAXXXXXXX 1745
              G+ SSQ +    D++    +DA S G++RKE++ E++  +P+A SRVQ          
Sbjct: 178  SSGRQSSQSLNGPTDARPGIPQDANSSGSNRKELLEERQATIPNAVSRVQAVKPNDSQPY 237

Query: 1744 XXXXXXXXXXXXXXXS--DPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXX 1571
                           S  DPVHVPS  SR +A VGAI+REVGVVG RRQS+E        
Sbjct: 238  SASLASNSSVVGVYSSSSDPVHVPSPDSRSSAIVGAIKREVGVVGVRRQSTENSVKHSSA 297

Query: 1570 XXXXXXXXXSGRDGQ--SRESRPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSR 1397
                      GR+    +   RPFNA  K++Q  Q    D  IP +P +RSF  N YGSR
Sbjct: 298  PSSSLPSSLLGRENSPSTEPFRPFNAIPKSDQPRQTTVPDHVIPSMPVNRSFLGNQYGSR 357

Query: 1396 PHQLN-GHQKASQPNKEWKPKASVKPSAKGPGVIGSPAKTVSPPAHNTEGTQKEAAVLQD 1220
            PHQ   GHQKA QPNKEWKPK+S K S   PGVIG+PAK+VSP A N++  + E A LQD
Sbjct: 358  PHQQPVGHQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESETAKLQD 417

Query: 1219 NMSQLNISENQNVIIAPHIRVSETDRCRLTFGSLGADFDTSANSVGVSTNGVEDLSTDPS 1040
             +SQ +ISENQNVIIA HIRV ETDRCRLTFGS GADF +   +VG      ++ S +PS
Sbjct: 418  KLSQASISENQNVIIAQHIRVPETDRCRLTFGSFGADFASGFQAVG----NADEPSAEPS 473

Query: 1039 GSVSASAAETSGDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDR-TEKKDSTSSQNLNDY 863
             S+S S  E+S D+   SKQ+++                    +  +KK+S+S QNL +Y
Sbjct: 474  ASLSVSPPESSSDD--GSKQVDLDDQYINSGTASPESGEASEHQLPDKKESSSPQNLENY 531

Query: 862  ADVVLVQGNSPSYTTDSLQQHDTSELPSF-SGYDPQMAYEMSYFRPVADETGRGPGLPSS 686
            AD+ LV+ +SPSYT +S QQ +   LPSF   YDPQ  Y++ YFRP  DET RG GLPS 
Sbjct: 532  ADIGLVRESSPSYTPESQQQQERHVLPSFPHAYDPQAGYDIPYFRPTMDETVRGQGLPSP 591

Query: 685  QEYTQVLSVHTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQMYPQLH 506
            QE    L+ HT+N++PASSIAMV                            + QMY Q+H
Sbjct: 592  QE---ALASHTANSIPASSIAMV--------------------QQQQQQPPVPQMYQQVH 628

Query: 505  VSHFANMMPYRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQGGVKYG 326
            V HFAN+MPYRQFLS         PGYS++PAY HPSN +SYLLMPG SSHL   G+KYG
Sbjct: 629  VPHFANLMPYRQFLSPVYVPPMAMPGYSSNPAYSHPSNANSYLLMPGGSSHLGANGLKYG 688

Query: 325  IQQFKPVPTGSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKD-SLYVPNPQAET 149
            IQQ KPVP GSPTGFGNFTNP GYAINAPGVV S  G +DSSRLKYKD ++YVPNPQAET
Sbjct: 689  IQQLKPVPAGSPTGFGNFTNPTGYAINAPGVVGSATGLEDSSRLKYKDGNIYVPNPQAET 748

Query: 148  SEIWM-NPRDVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFNXXXXXAQS 2
            SEIW+ NPR++ G+QS+ YYNMP Q+PH  AY+ SH+GHASFN     AQS
Sbjct: 749  SEIWIQNPRELPGLQSAPYYNMPAQTPH-AAYMPSHTGHASFNAAAAAAQS 798


>ref|XP_002521347.1| conserved hypothetical protein [Ricinus communis]
            gi|223539425|gb|EEF41015.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 864

 Score =  712 bits (1839), Expect = 0.0
 Identities = 413/803 (51%), Positives = 513/803 (63%), Gaps = 17/803 (2%)
 Frame = -1

Query: 2383 LSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQDPFHEVKRRRDRKKE 2204
            LSA VR+TIQSIKEIVGN SDA+IY+ALKETNMDPNETAQKLLNQDPFHEVKR+RD+KKE
Sbjct: 21   LSATVRKTIQSIKEIVGNFSDADIYMALKETNMDPNETAQKLLNQDPFHEVKRKRDKKKE 80

Query: 2203 IPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNAP---SGVSQEFRVVRDNR 2033
                +   +++ +KN +      K+ ++SDR++R+ G  R A    +G+++EFRVVRDNR
Sbjct: 81   SMAYRG--SLDSRKNPENMGQGTKFRTFSDRNTRQGGYIRAAVPGNAGINREFRVVRDNR 138

Query: 2032 VNQNSITDSKPGLNXXXXXXXXXXXXSM-----KSDSGHQEHVG-QHSSQPIKSSADSQQ 1871
            VN N+  + KP +             ++        SG+ +H G + SSQ      DSQ 
Sbjct: 139  VNLNTTREPKPAMQQGSISSDELGISTVTEKGSSGSSGNVKHSGVRSSSQASNGPPDSQS 198

Query: 1870 RQSKDAASVGNDRKEMVGEKRFPVPSATSRVQI-KAXXXXXXXXXXXXXXXXXXXXXXSD 1694
            R ++DA S   DRK M  EKR  VPSA SR+Q+ K                        D
Sbjct: 199  RHTRDATSNFTDRKAMTEEKRAVVPSAASRIQVMKPSSQHHSATLASSNSVVGVYSSSMD 258

Query: 1693 PVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXXXXXXXXXXXSGRDGQSRES 1514
            PVHVPS  SR +A VGAI+REVGVVG RRQSSE                  GRDG   ES
Sbjct: 259  PVHVPSPESRSSAAVGAIKREVGVVGGRRQSSENAVKNSSASSSSFSNSVLGRDGSLPES 318

Query: 1513 -RPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPHQLN-GHQKASQPNKEWKP 1340
             +PF   SKN+Q ++ VAT+SA+P +   RSF  N Y SR HQ   GHQKA+Q NKEWKP
Sbjct: 319  FQPFPTISKNDQVNEPVATESAMPSISVGRSFLGNQY-SRTHQTAVGHQKATQHNKEWKP 377

Query: 1339 KASVKPSAKGPGVIGSPAKTVSPPAHNTEGTQKEAAVLQDNMSQLNISENQNVIIAPHIR 1160
            K+S K S   PGVIG+P K+ SPPA N++  + +A  +Q+ + ++NI ENQNVIIA HIR
Sbjct: 378  KSSQKASVGSPGVIGTPTKSSSPPAGNSKDLESDATDMQEKLLRVNIYENQNVIIAQHIR 437

Query: 1159 VSETDRCRLTFGSLGADFDTSAN-SVGVSTNGV-EDLSTDPSGSVSASAAETSGDEPVSS 986
            V ETDRCRLTFGS G +FD+S N   G    GV +D   + + S+SASA E+S D+   +
Sbjct: 438  VPETDRCRLTFGSFGVEFDSSRNMPSGFQAAGVTKDSKAESAASLSASAPESSSDDASGN 497

Query: 985  KQLEMMXXXXXXXXXXXXXXXXXXDRTEKKDSTSSQNLNDYADVVLVQGNSPSYTTDSLQ 806
            KQ+E++                  +      S+S  NL++YAD+ LV+ +SP  +++S  
Sbjct: 498  KQVELLDEQVRNSGSDSPASGAVSEHQSPDKSSSPPNLDNYADIGLVRDSSPFTSSESQH 557

Query: 805  QHDTSELPSFSGYDPQMAYEMSYFRPVADETGRGPGLPSSQEYTQVLSVHTSNALPASSI 626
            Q D  ELPSFS YDPQ  Y+MSYFRP  DET RG GL S+QE    L  H  +++PASSI
Sbjct: 558  QQDPPELPSFSAYDPQTVYDMSYFRPQIDETVRGQGLQSAQE---ALISHRVDSMPASSI 614

Query: 625  AMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQMYPQLHVSHFANMMPYRQFLSXXXXX 446
             MV                            +AQMYPQ+HVSH+ N+MPYRQFLS     
Sbjct: 615  PMV---------------------QQQQQPPIAQMYPQVHVSHYTNLMPYRQFLSPVYVP 653

Query: 445  XXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQGGVKYGIQQFKPVPTGSPTGFGNFTN 266
                PGYS++PAYPHPSNGSSYLLMPG SSHL+  G+KYGIQQFKPVP  SPTGFGNFT+
Sbjct: 654  QMAMPGYSSNPAYPHPSNGSSYLLMPGGSSHLSANGLKYGIQQFKPVPGSSPTGFGNFTS 713

Query: 265  PAGYAINAPGVVPSTVGHDDSSRLKYKD-SLYVPNPQAETSEIWM-NPRDVSGMQSS-YY 95
            P GYAINAPGVV S  G +DSSR+KYKD +LYVPNPQAETSEIW+ NPR++ G+QS+ YY
Sbjct: 714  PTGYAINAPGVVGSATGLEDSSRMKYKDGNLYVPNPQAETSEIWVQNPRELPGLQSAPYY 773

Query: 94   NMPGQSPHPTAYLTSHSGHASFN 26
            NMPGQSPH  AYL SH+GHASFN
Sbjct: 774  NMPGQSPH-AAYLPSHTGHASFN 795


>ref|XP_007024587.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508779953|gb|EOY27209.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 849

 Score =  690 bits (1780), Expect = 0.0
 Identities = 406/816 (49%), Positives = 511/816 (62%), Gaps = 17/816 (2%)
 Frame = -1

Query: 2422 MVSGSRIDGGAQVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQDP 2243
            MV+G+RI+G    +SA VR+TIQSIKEIVGNHSDA+IYVALKE NMDPNET QKLL+QD 
Sbjct: 1    MVNGARIEGD---ISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQDT 57

Query: 2242 FHEVKRRRDRKKEIPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNAPSGVS 2063
            FHEV+R+RDRKKE    K   +++ +K ++     +K+  Y +R SRR   +RN   GV+
Sbjct: 58   FHEVRRKRDRKKESIEYK--VSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRNTLPGVN 115

Query: 2062 QEFRVVRDNRVNQNSITDSKPGLNXXXXXXXXXXXXSM--KSDSGHQEHVGQHSSQPIKS 1889
            +EFRVVRDNRVNQN+  D K   +            ++  K  +G   +    SS+ +  
Sbjct: 116  REFRVVRDNRVNQNANKDMKTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRSLSQ 175

Query: 1888 SAD----SQQRQSKDAASVGNDRKEMVGEKRFPVPSATSRVQIKAXXXXXXXXXXXXXXX 1721
            +++    SQ R ++DA S G DRKE+  EKR  +P+A  R Q                  
Sbjct: 176  TSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQSSSS 235

Query: 1720 XXXXXXXS--DPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXXXXXXXXXX 1547
                   S  DPVHVPS  SR +  VGAI+REVGVVG RRQ SE                
Sbjct: 236  SVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLSNS 295

Query: 1546 XSGRDGQSRESRPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPHQLN-GHQK 1370
              GRD  S   R F + S+ +Q S   AT+S +P +  SRSF SN YGSR +Q   GHQK
Sbjct: 296  LVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGHQK 355

Query: 1369 ASQPNKEWKPKASVKPSAKGPGVIGSPAKTVSPPAHNTEGTQKEAAVLQDNMSQLNISEN 1190
            A+Q NKEWKPK S K S   PGVIG+P K+ SPPA + +G   E A LQD  SQ+NI EN
Sbjct: 356  ANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYEN 415

Query: 1189 QNVIIAPHIRVSETDRCRLTFGSLGADFDTSANSV-GVSTNGV-EDLSTDPSGSVSASAA 1016
            +NVIIA HIRV E DRCRLTFGS G +FD+  N V G    GV ED + + + S+S SA 
Sbjct: 416  ENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASLSVSAP 475

Query: 1015 ETSGDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDR--TEKKDSTSSQNLNDYADVVLVQ 842
            +TS D+    K +E++                  +    + KD++S QNL+ YAD+ LVQ
Sbjct: 476  DTSSDDAAGGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQ 535

Query: 841  GNSPSYT-TDSLQQHDTSELPSFSGYDPQMAYEMSYFRPVADETGRGPGLPSSQEYTQVL 665
             NSPSY  ++S +Q D  ELPSFS YDPQ  Y++ YFRP  DET RG GLPS QE    L
Sbjct: 536  DNSPSYAPSESQKQQDPPELPSFSAYDPQTGYDLPYFRPPIDETARGQGLPSPQE---AL 592

Query: 664  SVHTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQMYPQLHVSHFANM 485
            S HT+N +PAS+I M+                            +AQMYPQ+HVSHFAN+
Sbjct: 593  SAHTAN-VPASTIPMM----------------------QQQQPPVAQMYPQVHVSHFANI 629

Query: 484  MPYRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQGGVKYGIQQFKPV 305
            MPYRQF+S         PGYS++PAYPHPSNGSSY+LMPG SSHL   G+KYGIQQFKPV
Sbjct: 630  MPYRQFVSPIYLPQMAMPGYSSNPAYPHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPV 689

Query: 304  PTGSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKD-SLYVPNPQAETSEIWM-N 131
            P GSPTGFGNFT+P+GYAINAPGVV +  G +DSSR+KYKD ++YVPN QA+TS++W+ N
Sbjct: 690  PAGSPTGFGNFTSPSGYAINAPGVVGNPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQN 749

Query: 130  PRDVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFN 26
            PR++ G+QS+ YYNMP Q+PH   Y+ SH+GHASFN
Sbjct: 750  PRELPGLQSAPYYNMP-QTPH--GYMPSHTGHASFN 782


>gb|EXB29673.1| hypothetical protein L484_013447 [Morus notabilis]
          Length = 854

 Score =  687 bits (1772), Expect = 0.0
 Identities = 407/820 (49%), Positives = 515/820 (62%), Gaps = 21/820 (2%)
 Frame = -1

Query: 2422 MVSGSRIDGGAQVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQDP 2243
            MVS SRIDGG Q+LSAGVR+TIQSIKEIVGNHSD +IY+ALKETNMDPNETAQKLLNQDP
Sbjct: 1    MVSASRIDGGPQILSAGVRKTIQSIKEIVGNHSDIDIYLALKETNMDPNETAQKLLNQDP 60

Query: 2242 FHEVKRRRDRKKEIPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNA----- 2078
            FHEV+R+RD+KKE     S T  +P+ ++++     K N++SDR++RR G +RN+     
Sbjct: 61   FHEVRRKRDKKKESAGNDSST--DPRGHSEVKGQGSKVNTFSDRNARRGGYARNSLPDRI 118

Query: 2077 --PSGVSQEFRVVRDNRVNQNSITDSKPGL---NXXXXXXXXXXXXSMKSDSGHQEHVGQ 1913
               +GVS+EFRVVRDNRVN++   ++KP                  S  S +  +    +
Sbjct: 119  MLHAGVSREFRVVRDNRVNRSLNREAKPASASPTPPSTFENISGKGSTGSSNSEKPTASK 178

Query: 1912 HSSQPIKSSADSQQRQSKDAASVGNDRKEMVGEKRFPVPSATSRVQI-KAXXXXXXXXXX 1736
            +SSQ +   +DS  R + D  S G  RKE+  EKR    S  SRVQ  KA          
Sbjct: 179  NSSQGLYGPSDSHLRIAHDIESTGLVRKEVSEEKRVTFSSVASRVQAGKANNARSQSAMV 238

Query: 1735 XXXXXXXXXXXXS-DPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXXXXXX 1559
                        S DPVHVPS  SR + +VGAI+REVGVVG RRQSS+            
Sbjct: 239  ASSSSAIGVYSSSTDPVHVPSPDSRSSGSVGAIKREVGVVGVRRQSSDNSKSSVPSSSFS 298

Query: 1558 XXXXXSGRDGQSRESRPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPH--QL 1385
                  G +G +   + F+  SKN++  Q  A++S +P +  SRS  S+ Y +R    Q 
Sbjct: 299  NSLL--GGEGSAETLQSFSTISKNDEVGQ--ASESILPSVSVSRSLLSSHYSNRQQHQQP 354

Query: 1384 NGHQKASQPNKEWKPKASVKPSAKGPGVIGSPAKTVSPPAHNTEGTQKEAAVLQDNMSQL 1205
             GHQKASQPNKEWKPK+S KPS   PGVIG+P K+VSPPAHN+E ++ E A + + +S++
Sbjct: 355  VGHQKASQPNKEWKPKSSQKPSLNNPGVIGTPTKSVSPPAHNSEVSESEPAKVLEKLSRV 414

Query: 1204 NISENQNVIIAPHIRVSETDRCRLTFGSLGADFDTSANSVGVSTNGVEDLSTDPSGSVSA 1025
            NI ENQNVIIA HIRV ETDRCRLTFGS G +F++ ++ V     G    S   + S S 
Sbjct: 415  NIHENQNVIIAQHIRVPETDRCRLTFGSFGKEFESDSDLVNGYQAGAIGESNGEAAS-SL 473

Query: 1024 SAAETSGDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDR--TEKKDSTSSQNLNDYADVV 851
            SA E+S  +   SKQ+++                   +    +KK+STS QNL++YAD+ 
Sbjct: 474  SAPESSIGDASGSKQVDLTDEQIRNSGSDSPTSGGTSENQFPDKKESTSPQNLDNYADIG 533

Query: 850  LVQGNSPSYTTDSLQQHDTSELPSFSGYDPQMAYEMSYFRPVA--DETGRGPGLPSSQEY 677
            LVQGNSPSY     QQ +  ELP FS YD Q  Y+  YFRP +  DE  RG GLP+ QE 
Sbjct: 534  LVQGNSPSYAPADSQQPEHPELPGFSAYDSQTGYDFPYFRPASATDEAMRGQGLPTPQE- 592

Query: 676  TQVLSVHTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQMYPQLHVSH 497
                S H +N++P ++I+MV                            +AQMYPQ+HVSH
Sbjct: 593  --AFSSHNTNSVP-TTISMV---------------------QQQQQPPVAQMYPQVHVSH 628

Query: 496  FANMMPYRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQGGVKYGIQQ 317
            FAN+MPYRQFLS         PGYS+SPAYPHPSNG+SYLLMPG  +HL    +KYG+QQ
Sbjct: 629  FANLMPYRQFLSPVYVPPMAMPGYSSSPAYPHPSNGNSYLLMPGGGTHLNANSLKYGVQQ 688

Query: 316  FKPVPTGSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKD-SLYVPNPQAETSEI 140
            FKPVP G+PTGFGNF+NP GYAIN PGVV    G +DSSR+KYKD +LYVPNPQAETSE+
Sbjct: 689  FKPVPAGNPTGFGNFSNPNGYAINTPGVVGGATGLEDSSRIKYKDGNLYVPNPQAETSEM 748

Query: 139  WM-NPRDVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFN 26
            W+ NPR++ G+QS+ YYNMPGQSPH  AYL SH+GHAS+N
Sbjct: 749  WIQNPRELPGLQSTPYYNMPGQSPH-AAYLPSHTGHASYN 787


>ref|XP_007024585.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508779951|gb|EOY27207.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 852

 Score =  682 bits (1759), Expect = 0.0
 Identities = 406/819 (49%), Positives = 512/819 (62%), Gaps = 20/819 (2%)
 Frame = -1

Query: 2422 MVSGSRIDGGAQVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQDP 2243
            MV+G+RI+G    +SA VR+TIQSIKEIVGNHSDA+IYVALKE NMDPNET QKLL+QD 
Sbjct: 1    MVNGARIEGD---ISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQDT 57

Query: 2242 FHEVKRRRDRKKEIPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNA--PSG 2069
            FHEV+R+RDRKKE    K   +++ +K ++     +K+  Y +R SRR   +RN    +G
Sbjct: 58   FHEVRRKRDRKKESIEYK--VSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRNTLPDAG 115

Query: 2068 VSQEFRVVRDNRVNQNSITDSKPGLNXXXXXXXXXXXXSM--KSDSGHQEHVGQHSSQPI 1895
            V++EFRVVRDNRVNQN+  D K   +            ++  K  +G   +    SS+ +
Sbjct: 116  VNREFRVVRDNRVNQNANKDMKTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRSL 175

Query: 1894 KSSAD----SQQRQSKDAASVGNDRKEMVGEKRFPVPSATSRVQIKAXXXXXXXXXXXXX 1727
              +++    SQ R ++DA S G DRKE+  EKR  +P+A  R Q                
Sbjct: 176  SQTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQSS 235

Query: 1726 XXXXXXXXXS--DPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXXXXXXXX 1553
                     S  DPVHVPS  SR +  VGAI+REVGVVG RRQ SE              
Sbjct: 236  SSSVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLS 295

Query: 1552 XXXSGRDGQSRESRPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPHQLN-GH 1376
                GRD  S   R F + S+ +Q S   AT+S +P +  SRSF SN YGSR +Q   GH
Sbjct: 296  NSLVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGH 355

Query: 1375 QKASQPNKEWKPKASVKPSAKGPGVIGSPAKTVSPPAHNTEGTQKEAAVLQDNMSQLNIS 1196
            QKA+Q NKEWKPK S K S   PGVIG+P K+ SPPA + +G   E A LQD  SQ+NI 
Sbjct: 356  QKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIY 415

Query: 1195 ENQNVIIAPHIRVSETDRCRLTFGSLGADFDTSANSV-GVSTNGV-EDLSTDPSGSVSAS 1022
            EN+NVIIA HIRV E DRCRLTFGS G +FD+  N V G    GV ED + + + S+S S
Sbjct: 416  ENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASLSVS 475

Query: 1021 AAETSGDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDR--TEKKDSTSSQNLNDYADVVL 848
            A +TS D+    K +E++                  +    + KD++S QNL+ YAD+ L
Sbjct: 476  APDTSSDDAAGGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGL 535

Query: 847  VQGNSPSYT-TDSLQQHDTSELPSFS-GYDPQMAYEMSYFRPVADETGRGPGLPSSQEYT 674
            VQ NSPSY  ++S +Q D  ELPSFS  YDPQ  Y++ YFRP  DET RG GLPS QE  
Sbjct: 536  VQDNSPSYAPSESQKQQDPPELPSFSQAYDPQTGYDLPYFRPPIDETARGQGLPSPQE-- 593

Query: 673  QVLSVHTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQMYPQLHVSHF 494
              LS HT+N +PAS+I M+                            +AQMYPQ+HVSHF
Sbjct: 594  -ALSAHTAN-VPASTIPMM----------------------QQQQPPVAQMYPQVHVSHF 629

Query: 493  ANMMPYRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQGGVKYGIQQF 314
            AN+MPYRQF+S         PGYS++PAYPHPSNGSSY+LMPG SSHL   G+KYGIQQF
Sbjct: 630  ANIMPYRQFVSPIYLPQMAMPGYSSNPAYPHPSNGSSYVLMPGGSSHLNANGLKYGIQQF 689

Query: 313  KPVPTGSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKD-SLYVPNPQAETSEIW 137
            KPVP GSPTGFGNFT+P+GYAINAPGVV +  G +DSSR+KYKD ++YVPN QA+TS++W
Sbjct: 690  KPVPAGSPTGFGNFTSPSGYAINAPGVVGNPTGLEDSSRIKYKDGNIYVPNQQADTSDLW 749

Query: 136  M-NPRDVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFN 26
            + NPR++ G+QS+ YYNMP Q+PH   Y+ SH+GHASFN
Sbjct: 750  IQNPRELPGLQSAPYYNMP-QTPH--GYMPSHTGHASFN 785


>emb|CAN69468.1| hypothetical protein VITISV_042555 [Vitis vinifera]
          Length = 914

 Score =  678 bits (1749), Expect = 0.0
 Identities = 418/885 (47%), Positives = 514/885 (58%), Gaps = 78/885 (8%)
 Frame = -1

Query: 2422 MVSGSRIDGGAQVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNET--------- 2270
            M   SR++GG Q+L   V +TIQ IKEIVGNHSDA+IYVAL+E NMDPNET         
Sbjct: 1    MAFDSRMEGGMQILPPQVHKTIQLIKEIVGNHSDADIYVALREMNMDPNETVQKLLNQDL 60

Query: 2269 ----------------AQKLLNQDPFHEVKRRRDRKKEIPVQKSFTTVEPKKNNDLARMP 2138
                            AQKLLNQDPFHEVKR+RD+KKE    K  T  EP+   +     
Sbjct: 61   DIHVMLREMNMDPNEVAQKLLNQDPFHEVKRKRDKKKESTGYKRPT--EPRIYIENVGQG 118

Query: 2137 VKYNSYSDRSSRRVGPSRNA------------------------------------PSGV 2066
             K+ S+ DR+ RR G SR+                                      +G+
Sbjct: 119  -KFRSFPDRNVRRGGYSRSTVPGNAKTYQFYHSFVLELLYLTVCFLLSELMVRILLDAGI 177

Query: 2065 SQEFRVVRDNRVNQNSITDSKP-------GLNXXXXXXXXXXXXSMKSDSGHQEHVGQHS 1907
             +EFRVVRDNRVNQN+  D KP         N            S  + +  +   G+ S
Sbjct: 178  GREFRVVRDNRVNQNTNRDMKPVSPQLATSANEQVISNISEKGNSTGTSNNQKPSSGRQS 237

Query: 1906 SQPIKSSADSQQRQSKDAASVGNDRKEMVGEKRFPVPSATSRVQIKAXXXXXXXXXXXXX 1727
            SQ +    D++    +DA S G++RKE++ E++  +P+A SRVQ                
Sbjct: 238  SQSLNGPTDARPGIPQDANSSGSNRKELLEERQATIPNAVSRVQAVKPNDSQPYSASLAS 297

Query: 1726 XXXXXXXXXS--DPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXXXXXXXX 1553
                     S  DPVHVPS  SR +A VGAI+REVGVVG RRQS+E              
Sbjct: 298  NSSVVGVYSSSSDPVHVPSPDSRSSAIVGAIKREVGVVGVRRQSTENSVKHSSAPSSSLP 357

Query: 1552 XXXSGRDGQ--SRESRPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPHQLN- 1382
                GR+    +   RPFNA  K++Q  Q    D  IP +P +RSF  N YGSRPHQ   
Sbjct: 358  SSLLGRENSPSTEPFRPFNAIPKSDQPRQTTVPDHVIPSMPVNRSFLGNQYGSRPHQQPV 417

Query: 1381 GHQKASQPNKEWKPKASVKPSAKGPGVIGSPAKTVSPPAHNTEGTQKEAAVLQDNMSQLN 1202
            GHQKA QPNKEWKPK+S K S   PGVIG+PAK+VSP A N++  + E A LQD +SQ +
Sbjct: 418  GHQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESETAKLQDKLSQAS 477

Query: 1201 ISENQNVIIAPHIRVSETDRCRLTFGSLGADFDTSANSVGVSTNGVEDLSTDPSGSVSAS 1022
            ISENQNVIIA HIRV ETDRCRLTFGS GADF +   +VG      ++ S +PS S+S S
Sbjct: 478  ISENQNVIIAQHIRVPETDRCRLTFGSFGADFASGFQAVG----NADEPSAEPSASLSVS 533

Query: 1021 AAETSGDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDR-TEKKDSTSSQNLNDYADVVLV 845
              E+S D+   SKQ+++                    +  +KK+S+S QNL +YAD+ LV
Sbjct: 534  PPESSSDD--GSKQVDLDDQYINSGTASPESGEASEHQLPDKKESSSPQNLENYADIGLV 591

Query: 844  QGNSPSYTTDSLQQHDTSELPSF-SGYDPQMAYEMSYFRPVADETGRGPGLPSSQEYTQV 668
            + +SPSYT +S QQ +   LPSF   YDPQ  Y++ YFRP  DET RG GLPS QE    
Sbjct: 592  RESSPSYTPESQQQQERHVLPSFPHAYDPQAGYDIPYFRPTMDETVRGQGLPSPQE---A 648

Query: 667  LSVHTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQMYPQLHVSHFAN 488
            L+ HT+N++PASSIAMV                            + QMY Q+HV HFAN
Sbjct: 649  LASHTANSIPASSIAMV--------------------QQQQQQPPVPQMYQQVHVPHFAN 688

Query: 487  MMPYRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQGGVKYGIQQFKP 308
            +MPYRQFLS         PGYS++PAY HPSN +SYLLMPG SSHL   G+KYGIQQ KP
Sbjct: 689  LMPYRQFLSPVYVPPMAMPGYSSNPAYSHPSNANSYLLMPGGSSHLGANGLKYGIQQLKP 748

Query: 307  VPTGSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKD-SLYVPNPQAETSEIWM- 134
            VP GSPTGFGNFTNP GYAINAPGVV S  G +DSSRLKYKD ++YVPNPQAETSEIW+ 
Sbjct: 749  VPAGSPTGFGNFTNPTGYAINAPGVVGSATGLEDSSRLKYKDGNIYVPNPQAETSEIWIQ 808

Query: 133  NPRDVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFNXXXXXAQS 2
            NPR++ G+QS+ YYNMP Q+PH  AY+ SH+GHASFN     AQS
Sbjct: 809  NPRELPGLQSAPYYNMPAQTPH-AAYMPSHTGHASFNAAAAAAQS 852


>ref|XP_007024589.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508779955|gb|EOY27211.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 839

 Score =  676 bits (1744), Expect = 0.0
 Identities = 402/815 (49%), Positives = 505/815 (61%), Gaps = 16/815 (1%)
 Frame = -1

Query: 2422 MVSGSRIDGGAQVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQDP 2243
            MV+G+RI+G    +SA VR+TIQSIKEIVGNHSDA+IYVALKE NMDPNET QKLL+QD 
Sbjct: 1    MVNGARIEGD---ISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQDT 57

Query: 2242 FHEVKRRRDRKKEIPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNAPSGVS 2063
            FHEV+R+RDRKKE    K   +++ +K ++     +K+  Y +R SRR   +RN   GV+
Sbjct: 58   FHEVRRKRDRKKESIEYK--VSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRNTLPGVN 115

Query: 2062 QEFRVVRDNRVNQNSITDSKPGLNXXXXXXXXXXXXSM--KSDSGHQEHVGQHSSQPIKS 1889
            +EFRVVRDNRVNQN+  D K   +            ++  K  +G   +    SS+ +  
Sbjct: 116  REFRVVRDNRVNQNANKDMKTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRSLSQ 175

Query: 1888 SAD----SQQRQSKDAASVGNDRKEMVGEKRFPVPSATSRVQIKAXXXXXXXXXXXXXXX 1721
            +++    SQ R ++DA S G DRKE+  EKR  +P+A  R Q                  
Sbjct: 176  TSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQSSSS 235

Query: 1720 XXXXXXXS--DPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXXXXXXXXXX 1547
                   S  DPVHVPS  SR +  VGAI+REVGVVG RRQ SE                
Sbjct: 236  SVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLSNS 295

Query: 1546 XSGRDGQSRESRPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPHQLN-GHQK 1370
              GRD  S   R F + S+ +Q S   AT+S +P +  SRSF SN YGSR +Q   GHQK
Sbjct: 296  LVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGHQK 355

Query: 1369 ASQPNKEWKPKASVKPSAKGPGVIGSPAKTVSPPAHNTEGTQKEAAVLQDNMSQLNISEN 1190
            A+Q NKEWKPK S K S   PGVIG+P K+ SPPA + +G   E A LQD  SQ+NI EN
Sbjct: 356  ANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIYEN 415

Query: 1189 QNVIIAPHIRVSETDRCRLTFGSLGADFDTSANSV-GVSTNGVEDLSTDPSGSVSASAAE 1013
            +NVIIA HIRV E DRCRLTFGS G +FD+  N V G    GV +   D +G  +AS   
Sbjct: 416  ENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAE---DSNGESAAS--- 469

Query: 1012 TSGDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDR--TEKKDSTSSQNLNDYADVVLVQG 839
               D+    K +E++                  +    + KD++S QNL+ YAD+ LVQ 
Sbjct: 470  ---DDAAGGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQD 526

Query: 838  NSPSYT-TDSLQQHDTSELPSFSGYDPQMAYEMSYFRPVADETGRGPGLPSSQEYTQVLS 662
            NSPSY  ++S +Q D  ELPSFS YDPQ  Y++ YFRP  DET RG GLPS QE    LS
Sbjct: 527  NSPSYAPSESQKQQDPPELPSFSAYDPQTGYDLPYFRPPIDETARGQGLPSPQE---ALS 583

Query: 661  VHTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQMYPQLHVSHFANMM 482
             HT+N +PAS+I M+                            +AQMYPQ+HVSHFAN+M
Sbjct: 584  AHTAN-VPASTIPMM----------------------QQQQPPVAQMYPQVHVSHFANIM 620

Query: 481  PYRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQGGVKYGIQQFKPVP 302
            PYRQF+S         PGYS++PAYPHPSNGSSY+LMPG SSHL   G+KYGIQQFKPVP
Sbjct: 621  PYRQFVSPIYLPQMAMPGYSSNPAYPHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVP 680

Query: 301  TGSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKD-SLYVPNPQAETSEIWM-NP 128
             GSPTGFGNFT+P+GYAINAPGVV +  G +DSSR+KYKD ++YVPN QA+TS++W+ NP
Sbjct: 681  AGSPTGFGNFTSPSGYAINAPGVVGNPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNP 740

Query: 127  RDVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFN 26
            R++ G+QS+ YYNMP Q+PH   Y+ SH+GHASFN
Sbjct: 741  RELPGLQSAPYYNMP-QTPH--GYMPSHTGHASFN 772


>ref|XP_006426626.1| hypothetical protein CICLE_v10024871mg [Citrus clementina]
            gi|557528616|gb|ESR39866.1| hypothetical protein
            CICLE_v10024871mg [Citrus clementina]
          Length = 866

 Score =  672 bits (1734), Expect = 0.0
 Identities = 393/814 (48%), Positives = 507/814 (62%), Gaps = 19/814 (2%)
 Frame = -1

Query: 2410 SRIDGGAQVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQDPFHEV 2231
            +RI+GG Q+LSAG+R TIQ+IKEIVGNHSDA+IY  LK++NMDPNETAQKLLNQDPF EV
Sbjct: 17   TRIEGGTQILSAGMRNTIQTIKEIVGNHSDADIYFTLKDSNMDPNETAQKLLNQDPFLEV 76

Query: 2230 KRRRDRKKEIPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNA--PSGVSQE 2057
            KRRRD+KKE    KS    EP+KN+++    ++  +Y+DR++RR G +RNA   +G+++E
Sbjct: 77   KRRRDKKKENMSYKSLE--EPRKNSEIFGKTMRIRTYADRNARRRGYNRNALPDAGINRE 134

Query: 2056 FRVVRDNRVNQNSITDSKPGLNXXXXXXXXXXXXSMKSDS-----GHQEHVGQHS-SQPI 1895
            FRVVRDNRVN  +  ++K  L               +  S     G ++  G  S SQ  
Sbjct: 135  FRVVRDNRVNPEANQETKSPLPQSSISTNEKVTNVKEKGSPTGTTGSEKPSGGRSFSQAS 194

Query: 1894 KSSADSQQRQSKDAASVGNDRKEMVGEKRFPVPSATSRVQ-IKAXXXXXXXXXXXXXXXX 1718
              S +   R + D    G DR E   EK       TS V  I+                 
Sbjct: 195  NGSTNLHPRHAYDHNITGTDRIEPSAEK-----FTTSAVNFIQHNITEGYSATLASSNSV 249

Query: 1717 XXXXXXSDPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXXXXXXXXXXXSG 1538
                   DPVHVPS  SR ++ VGAI+REVGVVG  RQ S+                  G
Sbjct: 250  GGYFSSKDPVHVPSPDSRASSAVGAIKREVGVVGGGRQCSDNAVKDSTAPCSSFSNSILG 309

Query: 1537 RDGQSRESRPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPHQLN-GHQKASQ 1361
            RD  S   RPF + SK +Q +Q  ATDS +  +P +R+  +N Y  R HQ + GHQKASQ
Sbjct: 310  RDN-SDSFRPFPSISKADQINQIAATDSGVAGMPANRALFTNQYTGRSHQQSVGHQKASQ 368

Query: 1360 PNKEWKPKASVKPSAKGPGVIGSPAKTVSPPAHNTEGTQKEAAVLQDNMSQLNISENQNV 1181
             NKEWKPK+S K +  GPGVIG+P K+ SPP  +++  + + A LQD +S++NI ENQNV
Sbjct: 369  HNKEWKPKSSQKSNVIGPGVIGTPTKSPSPPVDDSKDLESDVAKLQDELSRVNIHENQNV 428

Query: 1180 IIAPHIRVSETDRCRLTFGSLGADFDTSAN--SVGVSTNGVEDLSTDPSGSVSASAAETS 1007
            IIA HIRV ETDRCRLTFGS G DF++S N  S  ++    E+ + + + S++ +A++TS
Sbjct: 429  IIAQHIRVPETDRCRLTFGSFGVDFESSRNLGSGFLAAGSAEESNGESAASLTGAASKTS 488

Query: 1006 GDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDR---TEKKDSTSSQNLNDYADVVLVQGN 836
            G++    K ++++                  +     + KD++S Q+L+ YAD+ LV+  
Sbjct: 489  GNDVSGRKPVDILDDLVRNSGSNSPASGEASEHQLPDDIKDASSPQDLDGYADIGLVRDT 548

Query: 835  SPSY-TTDSLQQHDTSELPSFSGYDPQMAYEMSYFRPVADETGRGPGLPSSQEYTQVLSV 659
             PSY  ++S QQ D+SEL SF  YD Q  Y+MSYFRP  DE+ RG GLPS QE    L+ 
Sbjct: 549  DPSYPLSESQQQQDSSELASFPAYDSQTGYDMSYFRPTMDESVRGQGLPSPQE---ALAS 605

Query: 658  HTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQMYPQLHVSHFANMMP 479
            H++N++PASSIAM+                            +AQMYPQ+HVSHF NMMP
Sbjct: 606  HSANSIPASSIAML---------------------QHQQQPQMAQMYPQVHVSHFPNMMP 644

Query: 478  YRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQGGVKYGIQQFKPVPT 299
            YRQ +S         PGYS++PAYPHPSNGSSYLLMPG SSHL+  G+KYGIQQFKPVPT
Sbjct: 645  YRQIISPVYVPQMAMPGYSSNPAYPHPSNGSSYLLMPGGSSHLSTNGLKYGIQQFKPVPT 704

Query: 298  GSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKD-SLYVPNPQAETSEIWM-NPR 125
             SPTGFGNFT+PAGYAINAP VV S  G +DSSR+KYKD +LYV N QA+TSE+W+ NPR
Sbjct: 705  ASPTGFGNFTSPAGYAINAPSVVGSVTGLEDSSRMKYKDGNLYVSNQQADTSELWIHNPR 764

Query: 124  DVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFN 26
            ++ GMQS  YYNMP Q+PH  AYL SH+GHASFN
Sbjct: 765  ELPGMQSGPYYNMPAQTPHAAAYLPSHAGHASFN 798


>ref|XP_006426627.1| hypothetical protein CICLE_v10024871mg [Citrus clementina]
            gi|557528617|gb|ESR39867.1| hypothetical protein
            CICLE_v10024871mg [Citrus clementina]
          Length = 867

 Score =  668 bits (1723), Expect = 0.0
 Identities = 393/815 (48%), Positives = 507/815 (62%), Gaps = 20/815 (2%)
 Frame = -1

Query: 2410 SRIDGGAQVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQDPFHEV 2231
            +RI+GG Q+LSAG+R TIQ+IKEIVGNHSDA+IY  LK++NMDPNETAQKLLNQDPF EV
Sbjct: 17   TRIEGGTQILSAGMRNTIQTIKEIVGNHSDADIYFTLKDSNMDPNETAQKLLNQDPFLEV 76

Query: 2230 KRRRDRKKEIPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNA--PSGVSQE 2057
            KRRRD+KKE    KS    EP+KN+++    ++  +Y+DR++RR G +RNA   +G+++E
Sbjct: 77   KRRRDKKKENMSYKSLE--EPRKNSEIFGKTMRIRTYADRNARRRGYNRNALPDAGINRE 134

Query: 2056 FRVVRDNRVNQNSITDSKPGLNXXXXXXXXXXXXSMKSDS-----GHQEHVGQHS-SQPI 1895
            FRVVRDNRVN  +  ++K  L               +  S     G ++  G  S SQ  
Sbjct: 135  FRVVRDNRVNPEANQETKSPLPQSSISTNEKVTNVKEKGSPTGTTGSEKPSGGRSFSQAS 194

Query: 1894 KSSADSQQRQSKDAASVGNDRKEMVGEKRFPVPSATSRVQ-IKAXXXXXXXXXXXXXXXX 1718
              S +   R + D    G DR E   EK       TS V  I+                 
Sbjct: 195  NGSTNLHPRHAYDHNITGTDRIEPSAEK-----FTTSAVNFIQHNITEGYSATLASSNSV 249

Query: 1717 XXXXXXSDPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXXXXXXXXXXXSG 1538
                   DPVHVPS  SR ++ VGAI+REVGVVG  RQ S+                  G
Sbjct: 250  GGYFSSKDPVHVPSPDSRASSAVGAIKREVGVVGGGRQCSDNAVKDSTAPCSSFSNSILG 309

Query: 1537 RDGQSRESRPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPHQLN-GHQKASQ 1361
            RD  S   RPF + SK +Q +Q  ATDS +  +P +R+  +N Y  R HQ + GHQKASQ
Sbjct: 310  RDN-SDSFRPFPSISKADQINQIAATDSGVAGMPANRALFTNQYTGRSHQQSVGHQKASQ 368

Query: 1360 PNKEWKPKASVKPSAKGPGVIGSPAKTVSPPAHNTEGTQKEAAVLQDNMSQLNISENQNV 1181
             NKEWKPK+S K +  GPGVIG+P K+ SPP  +++  + + A LQD +S++NI ENQNV
Sbjct: 369  HNKEWKPKSSQKSNVIGPGVIGTPTKSPSPPVDDSKDLESDVAKLQDELSRVNIHENQNV 428

Query: 1180 IIAPHIRVSETDRCRLTFGSLGADFDTSAN--SVGVSTNGVEDLSTDPSGSVSASAAETS 1007
            IIA HIRV ETDRCRLTFGS G DF++S N  S  ++    E+ + + + S++ +A++TS
Sbjct: 429  IIAQHIRVPETDRCRLTFGSFGVDFESSRNLGSGFLAAGSAEESNGESAASLTGAASKTS 488

Query: 1006 GDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDR---TEKKDSTSSQNLNDYADVVLVQGN 836
            G++    K ++++                  +     + KD++S Q+L+ YAD+ LV+  
Sbjct: 489  GNDVSGRKPVDILDDLVRNSGSNSPASGEASEHQLPDDIKDASSPQDLDGYADIGLVRDT 548

Query: 835  SPSY-TTDSLQQHDTSELPSF-SGYDPQMAYEMSYFRPVADETGRGPGLPSSQEYTQVLS 662
             PSY  ++S QQ D+SEL SF   YD Q  Y+MSYFRP  DE+ RG GLPS QE    L+
Sbjct: 549  DPSYPLSESQQQQDSSELASFPQAYDSQTGYDMSYFRPTMDESVRGQGLPSPQE---ALA 605

Query: 661  VHTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQMYPQLHVSHFANMM 482
             H++N++PASSIAM+                            +AQMYPQ+HVSHF NMM
Sbjct: 606  SHSANSIPASSIAML---------------------QHQQQPQMAQMYPQVHVSHFPNMM 644

Query: 481  PYRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQGGVKYGIQQFKPVP 302
            PYRQ +S         PGYS++PAYPHPSNGSSYLLMPG SSHL+  G+KYGIQQFKPVP
Sbjct: 645  PYRQIISPVYVPQMAMPGYSSNPAYPHPSNGSSYLLMPGGSSHLSTNGLKYGIQQFKPVP 704

Query: 301  TGSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKD-SLYVPNPQAETSEIWM-NP 128
            T SPTGFGNFT+PAGYAINAP VV S  G +DSSR+KYKD +LYV N QA+TSE+W+ NP
Sbjct: 705  TASPTGFGNFTSPAGYAINAPSVVGSVTGLEDSSRMKYKDGNLYVSNQQADTSELWIHNP 764

Query: 127  RDVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFN 26
            R++ GMQS  YYNMP Q+PH  AYL SH+GHASFN
Sbjct: 765  RELPGMQSGPYYNMPAQTPHAAAYLPSHAGHASFN 799


>ref|XP_007024588.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508779954|gb|EOY27210.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 842

 Score =  668 bits (1723), Expect = 0.0
 Identities = 402/818 (49%), Positives = 506/818 (61%), Gaps = 19/818 (2%)
 Frame = -1

Query: 2422 MVSGSRIDGGAQVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQDP 2243
            MV+G+RI+G    +SA VR+TIQSIKEIVGNHSDA+IYVALKE NMDPNET QKLL+QD 
Sbjct: 1    MVNGARIEGD---ISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQDT 57

Query: 2242 FHEVKRRRDRKKEIPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNA--PSG 2069
            FHEV+R+RDRKKE    K   +++ +K ++     +K+  Y +R SRR   +RN    +G
Sbjct: 58   FHEVRRKRDRKKESIEYK--VSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRNTLPDAG 115

Query: 2068 VSQEFRVVRDNRVNQNSITDSKPGLNXXXXXXXXXXXXSM--KSDSGHQEHVGQHSSQPI 1895
            V++EFRVVRDNRVNQN+  D K   +            ++  K  +G   +    SS+ +
Sbjct: 116  VNREFRVVRDNRVNQNANKDMKTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRSL 175

Query: 1894 KSSAD----SQQRQSKDAASVGNDRKEMVGEKRFPVPSATSRVQIKAXXXXXXXXXXXXX 1727
              +++    SQ R ++DA S G DRKE+  EKR  +P+A  R Q                
Sbjct: 176  SQTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQSS 235

Query: 1726 XXXXXXXXXS--DPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXXXXXXXX 1553
                     S  DPVHVPS  SR +  VGAI+REVGVVG RRQ SE              
Sbjct: 236  SSSVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLS 295

Query: 1552 XXXSGRDGQSRESRPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPHQLN-GH 1376
                GRD  S   R F + S+ +Q S   AT+S +P +  SRSF SN YGSR +Q   GH
Sbjct: 296  NSLVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGH 355

Query: 1375 QKASQPNKEWKPKASVKPSAKGPGVIGSPAKTVSPPAHNTEGTQKEAAVLQDNMSQLNIS 1196
            QKA+Q NKEWKPK S K S   PGVIG+P K+ SPPA + +G   E A LQD  SQ+NI 
Sbjct: 356  QKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIY 415

Query: 1195 ENQNVIIAPHIRVSETDRCRLTFGSLGADFDTSANSV-GVSTNGVEDLSTDPSGSVSASA 1019
            EN+NVIIA HIRV E DRCRLTFGS G +FD+  N V G    GV +   D +G  +AS 
Sbjct: 416  ENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAE---DSNGESAAS- 471

Query: 1018 AETSGDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDR--TEKKDSTSSQNLNDYADVVLV 845
                 D+    K +E++                  +    + KD++S QNL+ YAD+ LV
Sbjct: 472  -----DDAAGGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLV 526

Query: 844  QGNSPSYT-TDSLQQHDTSELPSFS-GYDPQMAYEMSYFRPVADETGRGPGLPSSQEYTQ 671
            Q NSPSY  ++S +Q D  ELPSFS  YDPQ  Y++ YFRP  DET RG GLPS QE   
Sbjct: 527  QDNSPSYAPSESQKQQDPPELPSFSQAYDPQTGYDLPYFRPPIDETARGQGLPSPQE--- 583

Query: 670  VLSVHTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQMYPQLHVSHFA 491
             LS HT+N +PAS+I M+                            +AQMYPQ+HVSHFA
Sbjct: 584  ALSAHTAN-VPASTIPMM----------------------QQQQPPVAQMYPQVHVSHFA 620

Query: 490  NMMPYRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQGGVKYGIQQFK 311
            N+MPYRQF+S         PGYS++PAYPHPSNGSSY+LMPG SSHL   G+KYGIQQFK
Sbjct: 621  NIMPYRQFVSPIYLPQMAMPGYSSNPAYPHPSNGSSYVLMPGGSSHLNANGLKYGIQQFK 680

Query: 310  PVPTGSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKD-SLYVPNPQAETSEIWM 134
            PVP GSPTGFGNFT+P+GYAINAPGVV +  G +DSSR+KYKD ++YVPN QA+TS++W+
Sbjct: 681  PVPAGSPTGFGNFTSPSGYAINAPGVVGNPTGLEDSSRIKYKDGNIYVPNQQADTSDLWI 740

Query: 133  -NPRDVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFN 26
             NPR++ G+QS+ YYNMP Q+PH   Y+ SH+GHASFN
Sbjct: 741  QNPRELPGLQSAPYYNMP-QTPH--GYMPSHTGHASFN 775


>ref|XP_006465941.1| PREDICTED: dentin sialophosphoprotein-like [Citrus sinensis]
          Length = 862

 Score =  665 bits (1715), Expect = 0.0
 Identities = 390/814 (47%), Positives = 504/814 (61%), Gaps = 19/814 (2%)
 Frame = -1

Query: 2410 SRIDGGAQVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQDPFHEV 2231
            +RI+GG Q+LSAG+R TIQ+IKEIVGNHSDA+IY  LK++NMDPNETAQKLLNQDPF EV
Sbjct: 17   TRIEGGTQILSAGMRNTIQTIKEIVGNHSDADIYFTLKDSNMDPNETAQKLLNQDPFLEV 76

Query: 2230 KRRRDRKKEIPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNA--PSGVSQE 2057
            KRRRD+KKE    KS    EP+KN+++    ++  +Y+DR++RR G +RNA   +G+++E
Sbjct: 77   KRRRDKKKENMSYKSLE--EPRKNSEIFGKTMRIRTYADRNARRRGYNRNALPDAGINRE 134

Query: 2056 FRVVRDNRVNQNSITDSKPGLNXXXXXXXXXXXXSMKSDS------GHQEHVGQHSSQPI 1895
            FRVVRDNRVN  +  ++K  L               +  S        +   G+  SQ  
Sbjct: 135  FRVVRDNRVNPEANQETKSPLPQSSISTNEKVTNVKEKGSPTGTTGSERPSGGRSFSQAS 194

Query: 1894 KSSADSQQRQSKDAASVGNDRKEMVGEKRFPVPSATSRVQ-IKAXXXXXXXXXXXXXXXX 1718
              S +   R + D    G DR E   EK       TS V  I+                 
Sbjct: 195  NGSTNLHPRHAYDHNITGTDRIEPSAEK-----FTTSAVNFIQHNITEGHSATLASSNSV 249

Query: 1717 XXXXXXSDPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXXXXXXXXXXXSG 1538
                   DPVHVPS  SR ++ VGAI+REVGVVG  RQ S+                  G
Sbjct: 250  GGYFSSKDPVHVPSPDSRASSAVGAIKREVGVVGGGRQCSDNAVRDSTAPRSSFSNSILG 309

Query: 1537 RDGQSRESRPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPHQLN-GHQKASQ 1361
            RD  S   RPF + SK +Q +Q  ATDS +     +R+  +N Y  R HQ + GHQKASQ
Sbjct: 310  RDN-SDSFRPFPSISKADQINQIAATDSGV----ANRALFTNQYTGRSHQQSVGHQKASQ 364

Query: 1360 PNKEWKPKASVKPSAKGPGVIGSPAKTVSPPAHNTEGTQKEAAVLQDNMSQLNISENQNV 1181
             NKEWKPK+S K +  GPGVIG+P K+ SPP  +++  + + A LQD +S++NI+ENQNV
Sbjct: 365  HNKEWKPKSSQKSNVIGPGVIGTPTKSPSPPVDDSKDLESDVAKLQDELSRVNINENQNV 424

Query: 1180 IIAPHIRVSETDRCRLTFGSLGADFDTSAN--SVGVSTNGVEDLSTDPSGSVSASAAETS 1007
            IIA HIRV ETDRCRLTFGS G DF++S N  S  ++    E+ + + + S++ +A++TS
Sbjct: 425  IIAQHIRVPETDRCRLTFGSFGVDFESSRNLGSGFLAAGSAEESNGESAASLTGAASKTS 484

Query: 1006 GDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDR---TEKKDSTSSQNLNDYADVVLVQGN 836
            G++    K ++++                  +     + KD++S Q+L+ YAD+ LV+  
Sbjct: 485  GNDVSGRKPVDILDDLVRNSGSNSPASGEASEHQLPDDIKDASSPQDLDGYADIGLVRDT 544

Query: 835  SPSY-TTDSLQQHDTSELPSFSGYDPQMAYEMSYFRPVADETGRGPGLPSSQEYTQVLSV 659
             PSY  ++S QQ D+SEL SF  YD Q  Y+MSYFRP  DE+ RG GLPS QE    L+ 
Sbjct: 545  DPSYPLSESQQQQDSSELASFPAYDSQTGYDMSYFRPTMDESVRGQGLPSPQE---ALAS 601

Query: 658  HTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQMYPQLHVSHFANMMP 479
            H++N++PASSIAM+                            +AQMYPQ+HVSHF NMMP
Sbjct: 602  HSANSIPASSIAML---------------------QHQQQPQMAQMYPQVHVSHFPNMMP 640

Query: 478  YRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQGGVKYGIQQFKPVPT 299
            YRQ +S         PGYS++PAYPHPSNGSSYLLMPG SSHL+  G+KYGIQQFKPVPT
Sbjct: 641  YRQIISPVYVPQMAMPGYSSNPAYPHPSNGSSYLLMPGGSSHLSTNGLKYGIQQFKPVPT 700

Query: 298  GSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKD-SLYVPNPQAETSEIWM-NPR 125
             SPTGFGNFT+PAGYAINAP VV S  G +DSSR+KYKD +LYV N QA+TSE+W+ NPR
Sbjct: 701  ASPTGFGNFTSPAGYAINAPSVVGSVTGLEDSSRMKYKDGNLYVSNQQADTSELWIHNPR 760

Query: 124  DVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFN 26
            ++ GMQS  YYNMP Q+PH  AYL SH+GHASFN
Sbjct: 761  ELPGMQSGPYYNMPAQTPHAAAYLPSHAGHASFN 794


>ref|XP_007024584.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508779950|gb|EOY27206.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 883

 Score =  664 bits (1714), Expect = 0.0
 Identities = 405/850 (47%), Positives = 509/850 (59%), Gaps = 51/850 (6%)
 Frame = -1

Query: 2422 MVSGSRIDGGAQVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQDP 2243
            MV+G+RI+G    +SA VR+TIQSIKEIVGNHSDA+IYVALKE NMDPNET QKLL+QD 
Sbjct: 1    MVNGARIEGD---ISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQDT 57

Query: 2242 FHEVKRRRDRKKEIPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNAPSGVS 2063
            FHEV+R+RDRKKE    K   +++ +K ++     +K+  Y +R SRR   +RN   GV+
Sbjct: 58   FHEVRRKRDRKKESIEYK--VSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRNTLPGVN 115

Query: 2062 QEFRVVRDNRVNQNSITDSKPGLNXXXXXXXXXXXXSM--KSDSGHQEHVGQHSSQPIKS 1889
            +EFRVVRDNRVNQN+  D K   +            ++  K  +G   +    SS+ +  
Sbjct: 116  REFRVVRDNRVNQNANKDMKTPFSQCSTSANEQVPVNVAEKGSTGTSSNQRPFSSRSLSQ 175

Query: 1888 SAD----SQQRQSKDAASVGNDRKEMVGEKRFPVPSATSRVQIKAXXXXXXXXXXXXXXX 1721
            +++    SQ R ++DA S G DRKE+  EKR  +P+A  R Q                  
Sbjct: 176  TSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQSSSS 235

Query: 1720 XXXXXXXS--DPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXXXXXXXXXX 1547
                   S  DPVHVPS  SR +  VGAI+REVGVVG RRQ SE                
Sbjct: 236  SVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLSNS 295

Query: 1546 XSGRDGQSRESRPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPHQLN-GHQK 1370
              GRD  S   R F + S+ +Q S   AT+S +P +  SRSF SN YGSR +Q   GHQK
Sbjct: 296  LVGRDNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGHQK 355

Query: 1369 ---------------------------ASQPNKEWKPKASVKPSAKGPGVIGSPAKTVSP 1271
                                       A+Q NKEWKPK S K S   PGVIG+P K+ SP
Sbjct: 356  EASYCSAFHPFIDQISLWESLSCIFDAANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASP 415

Query: 1270 PAHNTEGTQKEAAVLQDNMSQLNISENQNVIIAPHIRVSETDRCRLTFGSLGADFDTSAN 1091
            PA + +G   E A LQD  SQ+NI EN+NVIIA HIRV E DRCRLTFGS G +FD+  N
Sbjct: 416  PADDAKGLDSETAKLQDKFSQVNIYENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRN 475

Query: 1090 SV-GVSTNGVEDLSTDPSG-------SVSASAAETSGDEPVSSKQLEMMXXXXXXXXXXX 935
             V G    GV + S   S        ++S SA +TS D+    K +E++           
Sbjct: 476  FVPGFQATGVAEDSNGESAARLVFSPNLSVSAPDTSSDDAAGGKPIEILDDQIGNSGSDS 535

Query: 934  XXXXXXXDR--TEKKDSTSSQNLNDYADVVLVQGNSPSYT-TDSLQQHDTSELPSFS-GY 767
                   +    + KD++S QNL+ YAD+ LVQ NSPSY  ++S +Q D  ELPSFS  Y
Sbjct: 536  PLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQDNSPSYAPSESQKQQDPPELPSFSQAY 595

Query: 766  DPQMAYEMSYFRPVADETGRGPGLPSSQEYTQVLSVHTSNALPASSIAMVXXXXXXXXXX 587
            DPQ  Y++ YFRP  DET RG GLPS QE    LS HT+N +PAS+I M+          
Sbjct: 596  DPQTGYDLPYFRPPIDETARGQGLPSPQE---ALSAHTAN-VPASTIPMM---------- 641

Query: 586  XXXXXXXXXXXXXXXXXXLAQMYPQLHVSHFANMMPYRQFLSXXXXXXXXXPGYSNSPAY 407
                              +AQMYPQ+HVSHFAN+MPYRQF+S         PGYS++PAY
Sbjct: 642  ------------QQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNPAY 689

Query: 406  PHPSNGSSYLLMPGNSSHLTQGGVKYGIQQFKPVPTGSPTGFGNFTNPAGYAINAPGVVP 227
            PHPSNGSSY+LMPG SSHL   G+KYGIQQFKPVP GSPTGFGNFT+P+GYAINAPGVV 
Sbjct: 690  PHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGVVG 749

Query: 226  STVGHDDSSRLKYKD-SLYVPNPQAETSEIWM-NPRDVSGMQSS-YYNMPGQSPHPTAYL 56
            +  G +DSSR+KYKD ++YVPN QA+TS++W+ NPR++ G+QS+ YYNMP Q+PH   Y+
Sbjct: 750  NPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNMP-QTPH--GYM 806

Query: 55   TSHSGHASFN 26
             SH+GHASFN
Sbjct: 807  PSHTGHASFN 816


>emb|CBI35892.3| unnamed protein product [Vitis vinifera]
          Length = 809

 Score =  663 bits (1711), Expect = 0.0
 Identities = 405/836 (48%), Positives = 500/836 (59%), Gaps = 29/836 (3%)
 Frame = -1

Query: 2422 MVSGSRIDGGAQVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQDP 2243
            MVSGSR++GG Q+L A VR+TIQSIKEIVGNHSDA+IYV L+ETNMDPNET QKLL QDP
Sbjct: 1    MVSGSRMEGGTQILPARVRKTIQSIKEIVGNHSDADIYVTLRETNMDPNETTQKLLYQDP 60

Query: 2242 FHEVKRRRDRKKEIPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNA----- 2078
            FHEVKR+RD+KKE    K  T  EP+   +      K+ S+ DR+ RR G SR+      
Sbjct: 61   FHEVKRKRDKKKESTGYKRPT--EPRIYIENVGQG-KFRSFPDRNVRRGGYSRSTVPGNA 117

Query: 2077 -----------PSGVSQEFRVVRDNRVNQNSITDSKP-------GLNXXXXXXXXXXXXS 1952
                        +G+ +EFRVVRDNRVNQN+  D KP        +N            S
Sbjct: 118  KTYQFYHSILLDAGIGREFRVVRDNRVNQNTNRDMKPVSPQLATSVNEQVISNISEKGNS 177

Query: 1951 MKSDSGHQEHVGQHSSQPIKSSADSQQRQSKDAASVGNDRKEMVGEKRFPVPSATSRVQI 1772
              + +  +   G+ SSQ +    D++    +DA S+  +  +        + S +S V +
Sbjct: 178  TGTSNNQKPSSGRQSSQSLNGPTDARPGIPQDANSMKPNDSQPYSAS---LASNSSVVGV 234

Query: 1771 KAXXXXXXXXXXXXXXXXXXXXXXSDPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEX 1592
             +                       DPVHVPS  SR +A VGAI+REVGVVG RRQS+E 
Sbjct: 235  YSSSS--------------------DPVHVPSPDSRSSAIVGAIKREVGVVGVRRQSTE- 273

Query: 1591 XXXXXXXXXXXXXXXXSGRDGQSRESRPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSN 1412
                                            + ++Q  Q    D  IP +P +RSF  N
Sbjct: 274  --------------------------------NSSDQPRQTTVPDHVIPSMPVNRSFLGN 301

Query: 1411 AYGSRPHQLN-GHQKASQPNKEWKPKASVKPSAKGPGVIGSPAKTVSPPAHNTEGTQKEA 1235
             YGSRPHQ   GHQKA QPNKEWKPK+S K S   PGVIG+PAK+VSP A N++  + E 
Sbjct: 302  QYGSRPHQQPVGHQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESET 361

Query: 1234 AVLQDNMSQLNISENQNVIIAPHIRVSETDRCRLTFGSLGADFDTSANSVGVSTNGVEDL 1055
            A LQD +SQ +ISENQNVIIA HIRV ETDRCRLTFGS GADF +   +VG      ++ 
Sbjct: 362  AKLQDKLSQASISENQNVIIAQHIRVPETDRCRLTFGSFGADFASGFQAVG----NADEP 417

Query: 1054 STDPSGSVSASAAETSGDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDR-TEKKDSTSSQ 878
            S +PS S+S S  E+S D+   SKQ+++                    +  +KK+S+S Q
Sbjct: 418  SAEPSASLSVSPPESSSDD--GSKQVDLDDQYINSGTASPESGEASEHQLPDKKESSSPQ 475

Query: 877  NLNDYADVVLVQGNSPSYTTDSLQQHDTSELPSF-SGYDPQMAYEMSYFRPVADETGRGP 701
            NL +YAD+ LV+ +SPSYT +S QQ +   LPSF   YDPQ  Y++ YFRP  DET RG 
Sbjct: 476  NLENYADIGLVRESSPSYTPESQQQQERHVLPSFPHAYDPQAGYDIPYFRPTMDETVRGQ 535

Query: 700  GLPSSQEYTQVLSVHTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQM 521
            GLPS QE    L+ HT+N++PASSIAMV                            + QM
Sbjct: 536  GLPSPQE---ALASHTANSIPASSIAMV--------------------QQQQQQPPVPQM 572

Query: 520  YPQLHVSHFANMMPYRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQG 341
            Y Q+HV HFAN+MPYRQFLS         PGYS++PAY HPSN +SYLLMPG SSHL   
Sbjct: 573  YQQVHVPHFANLMPYRQFLSPVYVPPMAMPGYSSNPAYSHPSNANSYLLMPGGSSHLGAN 632

Query: 340  GVKYGIQQFKPVPTGSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKD-SLYVPN 164
            G+KYGIQQ KPVP GSPTGFGNFTNP GYAINAPGVV S  G +DSSRLKYKD ++YVPN
Sbjct: 633  GLKYGIQQLKPVPAGSPTGFGNFTNPTGYAINAPGVVGSATGLEDSSRLKYKDGNIYVPN 692

Query: 163  PQAETSEIWM-NPRDVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFNXXXXXAQS 2
            PQAETSEIW+ NPR++ G+QS+ YYNMP Q+PH  AY+ SH+GHASFN     AQS
Sbjct: 693  PQAETSEIWIQNPRELPGLQSAPYYNMPAQTPH-AAYMPSHTGHASFNAAAAAAQS 747


>ref|XP_003528451.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max]
          Length = 863

 Score =  653 bits (1685), Expect = 0.0
 Identities = 399/835 (47%), Positives = 495/835 (59%), Gaps = 36/835 (4%)
 Frame = -1

Query: 2422 MVSGSRIDGGA--QVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQ 2249
            MV GSR +GG    +LSA VR+TIQSIKEIVGNHSDA+IYVALKETNMDPNET QKLLNQ
Sbjct: 1    MVPGSRTEGGTGTHLLSARVRKTIQSIKEIVGNHSDADIYVALKETNMDPNETTQKLLNQ 60

Query: 2248 DPFHEVKRRRDRKKEIPV-----QKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSR 2084
            DPFHEVKRRRDRKKE        Q S  +    +NN    M  K+N+ S+R+ RR   SR
Sbjct: 61   DPFHEVKRRRDRKKETQNVGNKGQPSADSRRSSENNSGQGM--KFNAPSERNVRRTNYSR 118

Query: 2083 NAPSGVSQEFRVVRDNRVN----------QNSITDSKPGLNXXXXXXXXXXXXSMKSDSG 1934
            N   G+S+EFRVVRDNRVN          Q   T +   LN               + + 
Sbjct: 119  NTLPGISKEFRVVRDNRVNHIYKEVKPLTQQHSTSATEQLNVNTPDKGS------STSTN 172

Query: 1933 HQEHVGQHSSQPIKSSADSQQRQSKDAASVGNDRK--EMVGEKRFPVPSATSRVQ-IKAX 1763
            H+    ++SS      +DS  R  KDA     DRK      +K+  + +A  RVQ IK  
Sbjct: 173  HRSSGSRNSSLASNGPSDSHARYLKDAVPNIIDRKIASEDKDKQGMISNAAGRVQPIKPN 232

Query: 1762 XXXXXXXXXXXXXXXXXXXXXS-DPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXX 1586
                                 S DPVHVPS  SR +  VGAIRREVGVVG RRQSS+   
Sbjct: 233  NAHQNSASVASTSSAVGVYSSSTDPVHVPSPDSRSSGVVGAIRREVGVVGVRRQSSDNKA 292

Query: 1585 XXXXXXXXXXXXXXSGRDGQSRES-RPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNA 1409
                           G+DG S +S +   A SK EQ SQ   T+ ++  +P SR   +N 
Sbjct: 293  KQSFAPSISYVV---GKDGTSADSFQSVGAVSKTEQFSQTNVTEPSLSGMPVSRPSLNNQ 349

Query: 1408 YGSRPHQ-LNGHQKASQPNKEWKPKASVKPSAKGPGVIGSPAKTV----SPPAHNTEGTQ 1244
            Y +RPHQ L GHQ+ SQ NKEWKPK+S KP++  PGVIG+P K      SPPA N+   +
Sbjct: 350  YNNRPHQQLVGHQRVSQQNKEWKPKSSQKPNSNSPGVIGTPKKAAVAAASPPAENSGDIE 409

Query: 1243 KEAAVLQDNMSQLNISENQNVIIAPHIRVSETDRCRLTFGSLGADFDTSA-----NSVGV 1079
                 LQD +SQ+NI ENQNVIIA HIRV ETDRC+LTFG++G + D+S      + +G 
Sbjct: 410  SNTTELQDKLSQVNIYENQNVIIAQHIRVPETDRCQLTFGTIGTELDSSRLQSKYHIIGA 469

Query: 1078 STNGVEDLSTDPSGSVSASAAETSGDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDRT-- 905
            S    E+L+     S++  A E S D+   SKQ+++                   ++   
Sbjct: 470  SEKSNEELTA----SLTVPAPELSTDDVSGSKQVDLRDEHIRSSRSDSPVSGAASEQQLP 525

Query: 904  EKKDSTSSQNLNDYADVVLVQGNSPSYTTDSLQQHDTSELPSFSGYDPQMAYEMSYFRPV 725
            + KDS+++QNL++YA++ LV+ +SPSY     QQ D+ ++P F+ YDP   Y++ YFRP 
Sbjct: 526  DNKDSSNTQNLDNYANIGLVRDSSPSYAPSEPQQQDSHDMPGFAAYDPPAGYDIPYFRPT 585

Query: 724  ADETGRGPGLPSSQEYTQVLSVHTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXX 545
             DET RG GL S QE    L  H +N  PAS+IAMV                        
Sbjct: 586  IDETVRGQGLSSPQE---ALISHATNNPPASTIAMV----------------------QQ 620

Query: 544  XXXXLAQMYPQLHVSHFANMMPYRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPG 365
                + QMYPQ+HVSHFAN+MPYRQFLS         PGYS++P YPHP+NGSSYLLMPG
Sbjct: 621  QQPPVPQMYPQVHVSHFANLMPYRQFLSPVYVPPMAMPGYSSNPPYPHPTNGSSYLLMPG 680

Query: 364  NSSHLTQGGVKYGIQQFKPVPTGSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYK 185
              SHL    +KYG+QQFKPVP GSPTGFGNF NP GYA+  PGVV      +DSSR+KYK
Sbjct: 681  GGSHLNANNLKYGVQQFKPVPAGSPTGFGNFANPTGYAMITPGVVGGATALEDSSRVKYK 740

Query: 184  DSLYVPNPQAETSEIWM-NPRDVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFN 26
            D+LYVPNPQAETSEIW+ NPRD+ GMQS+ YYNMPGQ+PH  AY+ SH+GHASFN
Sbjct: 741  DNLYVPNPQAETSEIWLQNPRDLPGMQSTPYYNMPGQTPH-AAYMPSHTGHASFN 794


>ref|XP_006583148.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Glycine max]
          Length = 855

 Score =  650 bits (1678), Expect = 0.0
 Identities = 399/825 (48%), Positives = 493/825 (59%), Gaps = 26/825 (3%)
 Frame = -1

Query: 2422 MVSGSRIDGGA--QVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQ 2249
            MV GSR +GG    +LSA VR+TIQSIKEIVGNHSDA+IYVALKETNMDPNET QKLLNQ
Sbjct: 1    MVPGSRTEGGTGTHLLSARVRKTIQSIKEIVGNHSDADIYVALKETNMDPNETTQKLLNQ 60

Query: 2248 DPFHEVKRRRDRKKEIPV-----QKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSR 2084
            DPFHEVKRRRDRKKE        Q S  +    +NN    M  K+N+ S+R+ RR   SR
Sbjct: 61   DPFHEVKRRRDRKKETQNVGNKGQPSADSRRSSENNSGQGM--KFNAPSERNVRRTNYSR 118

Query: 2083 NAPSGVSQEFRVVRDNRVNQNSITDSKPGLNXXXXXXXXXXXXSMKSDSGHQEHVGQHSS 1904
            N   G+S+EFRVVRDNRVN +   + KP L                 D G      ++SS
Sbjct: 119  NTLPGISKEFRVVRDNRVN-HIYKEVKP-LTQQHSTSATEQLNVNTPDKGSSG--SRNSS 174

Query: 1903 QPIKSSADSQQRQSKDAASVGNDRK--EMVGEKRFPVPSATSRVQ-IKAXXXXXXXXXXX 1733
                  +DS  R  KDA     DRK      +K+  + +A  RVQ IK            
Sbjct: 175  LASNGPSDSHARYLKDAVPNIIDRKIASEDKDKQGMISNAAGRVQPIKPNNAHQNSASVA 234

Query: 1732 XXXXXXXXXXXS-DPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXXXXXXX 1556
                       S DPVHVPS  SR +  VGAIRREVGVVG RRQSS+             
Sbjct: 235  STSSAVGVYSSSTDPVHVPSPDSRSSGVVGAIRREVGVVGVRRQSSDNKAKQSFAPSISY 294

Query: 1555 XXXXSGRDGQSRES-RPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPHQ-LN 1382
                 G+DG S +S +   A SK EQ SQ   T+ ++  +P SR   +N Y +RPHQ L 
Sbjct: 295  VV---GKDGTSADSFQSVGAVSKTEQFSQTNVTEPSLSGMPVSRPSLNNQYNNRPHQQLV 351

Query: 1381 GHQKASQPNKEWKPKASVKPSAKGPGVIGSPAKTV----SPPAHNTEGTQKEAAVLQDNM 1214
            GHQ+ SQ NKEWKPK+S KP++  PGVIG+P K      SPPA N+   +     LQD +
Sbjct: 352  GHQRVSQQNKEWKPKSSQKPNSNSPGVIGTPKKAAVAAASPPAENSGDIESNTTELQDKL 411

Query: 1213 SQLNISENQNVIIAPHIRVSETDRCRLTFGSLGADFDTSA-----NSVGVSTNGVEDLST 1049
            SQ+NI ENQNVIIA HIRV ETDRC+LTFG++G + D+S      + +G S    E+L+ 
Sbjct: 412  SQVNIYENQNVIIAQHIRVPETDRCQLTFGTIGTELDSSRLQSKYHIIGASEKSNEELTA 471

Query: 1048 DPSGSVSASAAETSGDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDRT--EKKDSTSSQN 875
                S++  A E S D+   SKQ+++                   ++   + KDS+++QN
Sbjct: 472  ----SLTVPAPELSTDDVSGSKQVDLRDEHIRSSRSDSPVSGAASEQQLPDNKDSSNTQN 527

Query: 874  LNDYADVVLVQGNSPSYTTDSLQQHDTSELPSFSGYDPQMAYEMSYFRPVADETGRGPGL 695
            L++YA++ LV+ +SPSY     QQ D+ ++P F+ YDP   Y++ YFRP  DET RG GL
Sbjct: 528  LDNYANIGLVRDSSPSYAPSEPQQQDSHDMPGFAAYDPPAGYDIPYFRPTIDETVRGQGL 587

Query: 694  PSSQEYTQVLSVHTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQMYP 515
             S QE    L  H +N  PAS+IAMV                            + QMYP
Sbjct: 588  SSPQE---ALISHATNNPPASTIAMV----------------------QQQQPPVPQMYP 622

Query: 514  QLHVSHFANMMPYRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQGGV 335
            Q+HVSHFAN+MPYRQFLS         PGYS++P YPHP+NGSSYLLMPG  SHL    +
Sbjct: 623  QVHVSHFANLMPYRQFLSPVYVPPMAMPGYSSNPPYPHPTNGSSYLLMPGGGSHLNANNL 682

Query: 334  KYGIQQFKPVPTGSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKDSLYVPNPQA 155
            KYG+QQFKPVP GSPTGFGNF NP GYA+  PGVV      +DSSR+KYKD+LYVPNPQA
Sbjct: 683  KYGVQQFKPVPAGSPTGFGNFANPTGYAMITPGVVGGATALEDSSRVKYKDNLYVPNPQA 742

Query: 154  ETSEIWM-NPRDVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFN 26
            ETSEIW+ NPRD+ GMQS+ YYNMPGQ+PH  AY+ SH+GHASFN
Sbjct: 743  ETSEIWLQNPRDLPGMQSTPYYNMPGQTPH-AAYMPSHTGHASFN 786


>ref|XP_002299597.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550347518|gb|EEE84402.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 854

 Score =  650 bits (1676), Expect = 0.0
 Identities = 392/817 (47%), Positives = 501/817 (61%), Gaps = 21/817 (2%)
 Frame = -1

Query: 2413 GSRIDGGAQ---VLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQDP 2243
            G+    G Q    LSA VR+TIQSIKEIVGN SDA+IY+ LKETNMDPNETAQKLLNQDP
Sbjct: 14   GASTSSGQQQTHTLSAKVRKTIQSIKEIVGNFSDADIYMVLKETNMDPNETAQKLLNQDP 73

Query: 2242 FHEVKRRRDRKKEIPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNAPSG-- 2069
            FHEVKR+R++KKE    +   +V+ +K+++     ++ +++SDR+++R G +R A  G  
Sbjct: 74   FHEVKRKREKKKENTSYRG--SVDSRKHSENFGQGMRPHTFSDRNAQRGGYTRTASPGNR 131

Query: 2068 -VSQEFRVVRDNRVNQNSITDSKPGLNXXXXXXXXXXXXSM--KSDSGHQEHV----GQH 1910
             +++EFRVVRDNRVNQN+  + KP L              +  K  +G   ++     + 
Sbjct: 132  GINREFRVVRDNRVNQNTSREPKPALLHGSTSAKEQGSGVVTEKGSTGISSNLKPSDARS 191

Query: 1909 SSQPIKSSADSQQRQSKDAASVGNDRKEMVGEKRFPVPSAT-SRVQIKAXXXXXXXXXXX 1733
            S Q      DS+ R ++DA S   DRK +  EKR    +AT SRVQ+             
Sbjct: 192  SHQASNGPIDSEPRHNRDANSSVGDRKVVSEEKRSVASNATTSRVQVAKSNNSQQHNALQ 251

Query: 1732 XXXXXXXXXXXS--DPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXXXXXX 1559
                       S  DPVHVPS  SR +  VGAI+REVGVVG RRQS E            
Sbjct: 252  ASSNPVVGVYSSSTDPVHVPSPDSRSSGVVGAIKREVGVVGGRRQSFENAVKDLS----- 306

Query: 1558 XXXXXSGRDGQSRESRPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPHQLN- 1382
                    +  S   RPF A SK +Q SQ  A +  +P +P +RSF +N Y +RPHQ   
Sbjct: 307  ------SSNSFSESFRPFTAISKTDQVSQTAAIEP-MPSVPVNRSFLNNQYNNRPHQQAV 359

Query: 1381 GHQKASQPNKEWKPKASVKPSAKGPGVIGSPAKTVSPPAHNTEGTQKEAAVLQDNMSQLN 1202
            GH KASQ NKEWKPK+S K S   PGVIG+P K+ SPP  N++  + +AA LQD  S++N
Sbjct: 360  GHPKASQHNKEWKPKSSQKSSVTSPGVIGTPTKSSSPPTDNSKNMELDAANLQDKFSRIN 419

Query: 1201 ISENQNVIIAPHIRVSETDRCRLTFGSLGADFDTSANSVGVSTNGVEDLSTDPSG-SVSA 1025
            I ENQNVIIA HIRV ETDRC+LTFGS G  FD +  + G    G+ + S   S  S+ A
Sbjct: 420  IHENQNVIIAQHIRVPETDRCKLTFGSFGVGFD-APRTPGFQAVGISEESNGESAISLPA 478

Query: 1024 SAAETSGDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDRTEKKDSTSSQNLNDYADVVLV 845
            SA ++S D+    KQ+E++                  +     +S+S  NL++YAD+ LV
Sbjct: 479  SAPDSSSDDASGGKQIELLDDQARNYGSDSPAASLESEHPLPVNSSSPPNLDNYADIGLV 538

Query: 844  QGNSPSYT-TDSLQQHDTSELPSFSGYDPQMAYEMSYFRPVADETGRGPGLPSSQEYTQV 668
            + +SPSY  ++S QQ D  ELPSFS YDPQ  Y++SYFRP  DET RG GLPS QE    
Sbjct: 539  RNSSPSYAPSESQQQQDHPELPSFSAYDPQTGYDISYFRPQIDETVRGQGLPSPQE---A 595

Query: 667  LSVHTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQMYPQLHVSHFAN 488
            L+ HT+N +PAS+++ V                            +AQMYPQ+HVS F N
Sbjct: 596  LTTHTAN-VPASTMSTV-----------------------QQQPPMAQMYPQVHVSQFTN 631

Query: 487  MMPYRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQGGVKYGIQQFKP 308
            ++PYRQF+S         PGYS+SPAYPHPSNG+SYLLMPG  SHL   G+KYGIQ +KP
Sbjct: 632  LVPYRQFISPVYVPPMPMPGYSSSPAYPHPSNGNSYLLMPGGGSHLNANGLKYGIQHYKP 691

Query: 307  VPTGSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKD-SLYVPNPQAETSEIWM- 134
            VP  +P GFGNF +P+GYAINAPGVV S  G +DSSR+KYKD +LYVPNPQAE SEIW+ 
Sbjct: 692  VPGNNPAGFGNFVSPSGYAINAPGVVGSATGLEDSSRMKYKDGNLYVPNPQAEASEIWIQ 751

Query: 133  NPRDVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFN 26
            NPR++ GMQS+ YYNMPGQ+   TAYL SH+GHASFN
Sbjct: 752  NPREIPGMQSAPYYNMPGQT--HTAYLPSHTGHASFN 786


>ref|XP_007135474.1| hypothetical protein PHAVU_010G132600g [Phaseolus vulgaris]
            gi|561008519|gb|ESW07468.1| hypothetical protein
            PHAVU_010G132600g [Phaseolus vulgaris]
          Length = 864

 Score =  649 bits (1674), Expect = 0.0
 Identities = 391/829 (47%), Positives = 498/829 (60%), Gaps = 30/829 (3%)
 Frame = -1

Query: 2422 MVSGSRIDG--GAQVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQ 2249
            MV GSR +   G  +LSA VR+TIQSIKEIVGNHSDA+IYVALKETNMDPNET QKLLNQ
Sbjct: 1    MVPGSRTESATGTHLLSARVRKTIQSIKEIVGNHSDADIYVALKETNMDPNETTQKLLNQ 60

Query: 2248 DPFHEVKRRRDRKKE---IPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNA 2078
            DPFHEVKRRRDRKKE   +    S  +  P +NN  +   VK+++ S+R+ RR   SRN 
Sbjct: 61   DPFHEVKRRRDRKKEPQNVGNNGSADSRRPSENN--SGQGVKFHTPSERNVRRANYSRNT 118

Query: 2077 PSGVSQEFRVVRDNRVN----------QNSITDSKPGLNXXXXXXXXXXXXSMKSDSGHQ 1928
              G+S+EFRVVRDNRVN          Q  +  +   LN               + + H+
Sbjct: 119  LPGISREFRVVRDNRVNYIYKEVKPLSQQHLASASEELNVNLSEKGS------SASTSHR 172

Query: 1927 EHVGQHSSQPIKSSADSQQRQSKDAASVGNDRK----EMVGEKRFPVPSATSRVQ-IKAX 1763
                ++SSQ +   +DS  R  KDA     DRK    +   +K+  + +A  RVQ IK  
Sbjct: 173  SSGSRNSSQALNGPSDSFARYPKDAVPNIVDRKIASEDKDKDKQSMISNAAERVQPIKPN 232

Query: 1762 XXXXXXXXXXXXXXXXXXXXXS-DPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXX 1586
                                 S DPVHVPS  SR ++ VGAIRREVGVVG RRQ S+   
Sbjct: 233  HIHQNPASVASSSSAVGVYSSSTDPVHVPSPDSRSSSVVGAIRREVGVVGVRRQPSDNKV 292

Query: 1585 XXXXXXXXXXXXXXSGRDGQSRES-RPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNA 1409
                           G+DG S +S +P  A  K EQ SQ   T+ ++  +P SR   +N 
Sbjct: 293  KQSFAPSSSYVA---GKDGTSADSFQPVGAVLKTEQFSQTKVTEPSLSGVPVSRPSVNNQ 349

Query: 1408 YGSRPHQ-LNGHQKASQPNKEWKPKASVKPSAKGPGVIGSPAKTV-SPPAHNTEGTQKEA 1235
            Y  RPHQ L GHQ+ SQ NKEWKPK+S KP++  PGVIG+P K   SPPA N+   + +A
Sbjct: 350  YNGRPHQQLVGHQRVSQQNKEWKPKSSQKPNSNNPGVIGTPKKAAASPPAENSVDIESDA 409

Query: 1234 AVLQDNMSQLNISENQNVIIAPHIRVSETDRCRLTFGSLGADFDTSANSVGVSTNGVEDL 1055
              LQD +SQLNI ENQNVIIA HI+V ETDRCRLTFG++G + D+S         G  + 
Sbjct: 410  VELQDKLSQLNIYENQNVIIAQHIQVPETDRCRLTFGTIGTEIDSSRLQSKYHIVGPSEK 469

Query: 1054 STDP-SGSVSASAAETSGDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDR--TEKKDSTS 884
            S D  + S++  A E S D+   SKQ++++                  ++   + KDS++
Sbjct: 470  SNDELAASLAVPAPELSTDDVSGSKQVDLLDEHIRSSGSDSPVSGAPSEQQLPDNKDSSN 529

Query: 883  SQNLNDYADVVLVQGNSPSYTTDSLQQHDTSELPSFSGYDPQMAYEMSYFRPVADETGRG 704
            +QNL++YA++ LV+ +SPSY     QQ ++ ++P F+ YDP   Y++ YFRP  DET RG
Sbjct: 530  TQNLDNYANIGLVRDSSPSYAPSEPQQQESHDMPGFAAYDPPTGYDIPYFRPTIDETVRG 589

Query: 703  PGLPSSQEYTQVLSVHTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQ 524
             GL S QE    L  H +N  PAS+IAMV                            + Q
Sbjct: 590  QGLSSPQE---ALISHGTNNTPASTIAMV-------------------QQQQQQQPPVPQ 627

Query: 523  MYPQLHVSHFANMMPYRQFLS-XXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLT 347
            MYPQ+HVSHFAN+MPYRQFLS          PGYS++P YPHP+NG+SY+LMPG  SHL 
Sbjct: 628  MYPQMHVSHFANLMPYRQFLSPVYVPPPMAMPGYSSNPPYPHPTNGNSYVLMPGGGSHLN 687

Query: 346  QGGVKYGIQQFKPVPTGSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKDSLYVP 167
               +KYG+QQ+KPVP G+P GFGNF +PAGYA+  PGVV      +DSSR+KYKD+LYVP
Sbjct: 688  ANNLKYGVQQYKPVPAGNPAGFGNFASPAGYAMITPGVVGGATALEDSSRVKYKDNLYVP 747

Query: 166  NPQAETSEIWM-NPRDVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFN 26
            NPQAETSEIW+ NPRD+ GMQS+ YYNMPGQ+PH  AY+ SH+GHASFN
Sbjct: 748  NPQAETSEIWLQNPRDLPGMQSAPYYNMPGQTPH-AAYMPSHTGHASFN 795


>ref|XP_006361347.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Solanum
            tuberosum]
          Length = 837

 Score =  630 bits (1625), Expect = e-177
 Identities = 387/816 (47%), Positives = 482/816 (59%), Gaps = 17/816 (2%)
 Frame = -1

Query: 2422 MVSGSRIDGGAQVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLLNQDP 2243
            MVS S+ + G  VLSAGVR  ++SIKE+VGNHSDA+IYVALKETNMDPNETAQKLLNQDP
Sbjct: 1    MVSSSKPESGTHVLSAGVREILESIKEVVGNHSDADIYVALKETNMDPNETAQKLLNQDP 60

Query: 2242 FHEVKRRRDRKKEIPVQKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGPSRNAPSGVS 2063
            FHEVKR+RDRKKE    KS T  E   +++     ++ N++ + + RR   +R A  G +
Sbjct: 61   FHEVKRKRDRKKENHGYKSSTAPENGSHSEHGAQEMRVNTHINHNIRRGSYNRTALPGFT 120

Query: 2062 QEFRVVRDNRVNQN---------SITDSKPGLNXXXXXXXXXXXXSMKSDSGHQEHVGQH 1910
            +EFRVVRDNRVNQN         + T ++P ++            S K  SG+    G  
Sbjct: 121  REFRVVRDNRVNQNVNRVGKAVQTSTSAEPAIS------NTSVQSSSKGTSGNTLSTGGR 174

Query: 1909 SSQPIKSSADSQQRQSKDAASVGNDRKEMVGEKRFPVPSATSRV-QIKAXXXXXXXXXXX 1733
            SSQ    + +SQ   S DA     + + + GE    V +A S++  +K            
Sbjct: 175  SSQ--APNRNSQHTHSNDANLSSTNGQGLSGEMHASVSNAASQIGGVKPNGSRPHSITSS 232

Query: 1732 XXXXXXXXXXXSDPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSE--XXXXXXXXXXXX 1559
                       SDPVHVPSL SRPAA VGAI+REVGVVG RRQS+E              
Sbjct: 233  SNSVIGVYSSFSDPVHVPSLDSRPAAKVGAIKREVGVVGARRQSAETFAKSSSSQSRSSS 292

Query: 1558 XXXXXSGRDGQSRESRPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPHQLNG 1379
                   R             S N +S Q+  +DS    LP S+S S N + +R HQ  G
Sbjct: 293  NSHMEQARQDVGNSKGSLRPLSSNSRSDQSGVSDSPKSNLPMSKSLSGNQHMNRLHQSVG 352

Query: 1378 HQKASQPNKEWKPKASVKPSAKGPGVIGSPAKTVSPPAHNTEGTQKEAAVLQDNMSQLNI 1199
            HQKA Q    WKPK + K S   PGVIG P++ VS  +  +E  +KE + LQD MS+LNI
Sbjct: 353  HQKAVQ----WKPKLTKKSSVTDPGVIGKPSEGVSLTS-KSEDLEKEGSQLQDKMSRLNI 407

Query: 1198 SENQNVIIAPHIRVSETDRCRLTFGSLGADFDTSANSVGVSTNGVEDLSTDPSGSVSASA 1019
            SE  NVIIA HIRVSETDRCRLTFGS GA+F         S   +E+ S   S  +S   
Sbjct: 408  SE--NVIIAEHIRVSETDRCRLTFGSFGAEFK--------SAKDLEEESQTESSRLSVLV 457

Query: 1018 AETSGDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDR--TEKKDSTSSQNLNDYADVVLV 845
            +E+S D+PV SKQL++                   D+  ++ ++ +S ++L +YADV LV
Sbjct: 458  SESSTDDPVGSKQLDLADDRVQIPESTSPGSDVILDQKLSDNRECSSPEDLGNYADVGLV 517

Query: 844  QGNSPSYT-TDSLQQHDTSELPSFSGYDPQMAYEMSYFRPVADETGRGPGLPSSQEYTQV 668
            Q NS SYT  +S QQ + S L SFS YDPQ  Y++ YFRP  DE  R  G        + 
Sbjct: 518  QDNSASYTPPESQQQQNASNLSSFSAYDPQTGYDIPYFRPAVDEALRDQG------PQEA 571

Query: 667  LSVHTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQMYPQLHVSHFAN 488
            LS H +N++PASSI MV                            +AQMYPQ+HVSH+AN
Sbjct: 572  LSSHAANSMPASSIPMV--------------------QQVQQHQPIAQMYPQVHVSHYAN 611

Query: 487  MMPYRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQGGVKYGIQQFKP 308
            +MPYR   S         PGYS++ AYPHP NGS+YLLMPG  SHL+  G+KYGIQQFKP
Sbjct: 612  LMPYRHVFSPVYVPPMAMPGYSSNAAYPHPPNGSNYLLMPGGGSHLSANGLKYGIQQFKP 671

Query: 307  VPTGSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKD-SLYVPNPQAETSEIWMN 131
            VPTGS +GFGNFT+P GYAIN PGV+ S  G +DSSR+KYKD +LYVPNPQAETSE+WMN
Sbjct: 672  VPTGSASGFGNFTSPTGYAINTPGVIGSATGLEDSSRMKYKDGNLYVPNPQAETSEMWMN 731

Query: 130  PRDVSGMQS-SYYNMPGQSPHPTAYLTSHSGHASFN 26
            PRD+S MQS SYY+M GQ+PH  AYL SHSGHASFN
Sbjct: 732  PRDISTMQSGSYYSMSGQTPH-AAYLPSHSGHASFN 766


>ref|XP_006598817.1| PREDICTED: putative uncharacterized protein DDB_G0277255-like
            [Glycine max]
          Length = 852

 Score =  628 bits (1619), Expect = e-177
 Identities = 391/827 (47%), Positives = 488/827 (59%), Gaps = 28/827 (3%)
 Frame = -1

Query: 2422 MVSGSRIDGGA----QVLSAGVRRTIQSIKEIVGNHSDAEIYVALKETNMDPNETAQKLL 2255
            MV GS+ +GG      +LSA VR+TIQSIKEIVGNHSDA+IYVALKE NMDPNET QKLL
Sbjct: 1    MVPGSKTEGGGTGTTHLLSARVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLL 60

Query: 2254 NQDPFHEVKRRRDRKKEIPV-----QKSFTTVEPKKNNDLARMPVKYNSYSDRSSRRVGP 2090
            NQDPFHEVKRRRDRKKE        Q S  +  P +NN    M  K++++S+R+ RR   
Sbjct: 61   NQDPFHEVKRRRDRKKETQNVGNRGQPSADSRRPSENNSGQGM--KFHTHSERNVRRTNY 118

Query: 2089 SRNAPSGVSQEFRVVRDNRVNQNSITDSKPGLNXXXXXXXXXXXXSMKSDSGHQEHVGQH 1910
            SR+   G+S+EFRVVRDNRVN   I      L+               SD G      ++
Sbjct: 119  SRSTFPGISREFRVVRDNRVNH--IYKEVTPLSQQHSTSVTEQLNVNISDKGSSG--SRN 174

Query: 1909 SSQPIKSSADSQQRQSKDAASVGNDRKEMVGEK--RFPVPSATSRVQ-IKAXXXXXXXXX 1739
            SSQ     +DS  R +        DRK +  +K  +  + +A  RVQ IK          
Sbjct: 175  SSQASNGPSDSHARYAPKTI----DRKIVYEDKDKQGMISNAAGRVQPIKPNSVHQNSAL 230

Query: 1738 XXXXXXXXXXXXXS-DPVHVPSLHSRPAANVGAIRREVGVVGPRRQSSEXXXXXXXXXXX 1562
                         S DPVHVPS  SR    VGAIRREVG VG RRQSS+           
Sbjct: 231  VASTSSAVGVYSSSTDPVHVPSPDSRSPGVVGAIRREVGFVGVRRQSSDNKAKQSFAPSS 290

Query: 1561 XXXXXXSGRDGQSRES-RPFNAASKNEQSSQNVATDSAIPVLPTSRSFSSNAYGSRPHQ- 1388
                   G+DG S +S +   A SK EQ SQ   T+ ++  +P SR   +N + +RPHQ 
Sbjct: 291  PHVV---GKDGTSADSFQSVGAVSKTEQFSQTNVTEPSLSGMPVSRPSLNNQHNNRPHQQ 347

Query: 1387 LNGHQKASQPNKEWKPKASVKPSAKG-PGVIGSPAKTV---SPPAHNTEGTQKEAAVLQD 1220
            L GHQ+ SQ NKEWKPK+S KP+    PGVIG+P K     SPPA N+   +     LQD
Sbjct: 348  LVGHQRVSQQNKEWKPKSSQKPNCNNSPGVIGTPKKAAAAASPPAENSGDIESNTVELQD 407

Query: 1219 NMSQLNISENQNVIIAPHIRVSETDRCRLTFGSLGADFDTSA-----NSVGVSTNGVEDL 1055
             +SQ+NI ENQNVIIA HIRV ETDRCRLTFG++G + D+S      + +G S    E+L
Sbjct: 408  KLSQVNIYENQNVIIAQHIRVPETDRCRLTFGTIGTELDSSRPQSKYHIIGASEKSNEEL 467

Query: 1054 STDPSGSVSASAAETSGDEPVSSKQLEMMXXXXXXXXXXXXXXXXXXDR--TEKKDSTSS 881
                + S++  A E S D+   SKQ+++                   ++   + KDS+++
Sbjct: 468  ----TASLTVPAPELSTDDVSGSKQVDLRDEHIRSLGSDSPVSGATSEQQLPDNKDSSNT 523

Query: 880  QNLNDYADVVLVQGNSPSYTTDSLQQHDTSELPSFSGYDPQMAYEMSYFRPVADETGRGP 701
            +NL++YA++ LV+ +SPSY     QQ D+ ++P F+ YD    Y++ YFRP  DET RG 
Sbjct: 524  KNLDNYANIGLVRDSSPSYAPSEQQQQDSHDMPGFAAYDSPAGYDIPYFRPTIDETVRGQ 583

Query: 700  GLPSSQEYTQVLSVHTSNALPASSIAMVXXXXXXXXXXXXXXXXXXXXXXXXXXXXLAQM 521
            GL S QE    L  H +N  PAS+IAMV                            + QM
Sbjct: 584  GLSSPQE---ALISHPTNT-PASTIAMV----------------------QQQQPPVPQM 617

Query: 520  YPQLHVSHFANMMPYRQFLSXXXXXXXXXPGYSNSPAYPHPSNGSSYLLMPGNSSHLTQG 341
            YPQ+HVSHFAN+MPYRQFLS         PGYS++P YPHP+NGSSYLLMPG  SHL   
Sbjct: 618  YPQVHVSHFANLMPYRQFLSPVYVPPMAMPGYSSNPPYPHPTNGSSYLLMPGGGSHLNAN 677

Query: 340  GVKYGIQQFKPVPTGSPTGFGNFTNPAGYAINAPGVVPSTVGHDDSSRLKYKDSLYVPNP 161
             +KYG+QQFKPVP GSPTGFGNF NP GYA+  PGVV      +DSSR+KYKD+LYVPNP
Sbjct: 678  NLKYGVQQFKPVPAGSPTGFGNFANPTGYAMITPGVVGGATALEDSSRVKYKDNLYVPNP 737

Query: 160  QAETSEIWM-NPRDVSGMQSS-YYNMPGQSPHPTAYLTSHSGHASFN 26
            QAETSEIW+ NPRD  GMQS+ YYNMPGQ+PH  AY+ SH+GHASFN
Sbjct: 738  QAETSEIWLQNPRDHPGMQSTPYYNMPGQTPH-AAYMPSHTGHASFN 783


Top