BLASTX nr result

ID: Akebia23_contig00004944 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00004944
         (2917 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002274314.2| PREDICTED: uncharacterized protein LOC100248...   776   0.0  
emb|CAN69468.1| hypothetical protein VITISV_042555 [Vitis vinifera]   738   0.0  
emb|CBI35892.3| unnamed protein product [Vitis vinifera]              738   0.0  
ref|XP_002299597.2| hydroxyproline-rich glycoprotein [Populus tr...   733   0.0  
gb|EXB29673.1| hypothetical protein L484_013447 [Morus notabilis]     728   0.0  
ref|XP_007024587.1| Uncharacterized protein isoform 4 [Theobroma...   727   0.0  
ref|XP_007024589.1| Uncharacterized protein isoform 6 [Theobroma...   719   0.0  
ref|XP_002521347.1| conserved hypothetical protein [Ricinus comm...   718   0.0  
ref|XP_007024585.1| Uncharacterized protein isoform 2 [Theobroma...   717   0.0  
ref|XP_007024588.1| Uncharacterized protein isoform 5 [Theobroma...   709   0.0  
ref|XP_002304144.2| hypothetical protein POPTR_0003s06200g [Popu...   709   0.0  
ref|XP_007024584.1| Uncharacterized protein isoform 1 [Theobroma...   706   0.0  
ref|XP_007135474.1| hypothetical protein PHAVU_010G132600g [Phas...   690   0.0  
ref|XP_003528451.1| PREDICTED: dentin sialophosphoprotein-like i...   686   0.0  
ref|XP_006583148.1| PREDICTED: dentin sialophosphoprotein-like i...   685   0.0  
ref|XP_004163891.1| PREDICTED: uncharacterized protein LOC101226...   685   0.0  
ref|XP_006598817.1| PREDICTED: putative uncharacterized protein ...   677   0.0  
ref|XP_006426626.1| hypothetical protein CICLE_v10024871mg [Citr...   669   0.0  
ref|XP_006465941.1| PREDICTED: dentin sialophosphoprotein-like [...   664   0.0  
ref|XP_006426627.1| hypothetical protein CICLE_v10024871mg [Citr...   664   0.0  

>ref|XP_002274314.2| PREDICTED: uncharacterized protein LOC100248075 [Vitis vinifera]
          Length = 860

 Score =  776 bits (2005), Expect = 0.0
 Identities = 451/869 (51%), Positives = 546/869 (62%), Gaps = 31/869 (3%)
 Frame = -3

Query: 2684 MVLGSQVEVETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQDP 2505
            MV GS++E  T+IL ARVRKTIQSIKE+VGNHSD+DIYV L+ETNMDPNET QKLL QDP
Sbjct: 1    MVSGSRMEGGTQILPARVRKTIQSIKEIVGNHSDADIYVTLRETNMDPNETTQKLLYQDP 60

Query: 2504 FHEVXXXXXXXKENTAHMGSAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNAL----- 2340
            FHEV       KE+T +    EPR  IENV Q  K  +F D             +     
Sbjct: 61   FHEVKRKRDKKKESTGYKRPTEPRIYIENVGQG-KFRSFPDRNVRRGGYSRSTLMVRILL 119

Query: 2339 -PGISREFXXXXXXXXXXXXXREIKPASLQSVPVVGVSPVAILSDQEHL-----GLRSSE 2178
              GI REF             R++KP S Q    V    ++ +S++ +        + S 
Sbjct: 120  DAGIGREFRVVRDNRVNQNTNRDMKPVSPQLATSVNEQVISNISEKGNSTGTSNNQKPSS 179

Query: 2177 GQKSSEALNGPSDSSSGRGQG--------KEMFEDVGVKVPSSGSQSRGQKXXXXXXXXX 2022
            G++SS++LNGP+D+  G  Q         KE+ E+    +P++ S+ +  K         
Sbjct: 180  GRQSSQSLNGPTDARPGIPQDANSSGSNRKELLEERQATIPNAVSRVQAVKPNDSQPYSA 239

Query: 2021 XXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXXX 1842
                                           VGAIKREVGVVGVRR              
Sbjct: 240  SLASNSSVVGVYSSSSDPVHVPSPDSRSSAIVGAIKREVGVVGVRRQSTENSVKHSSAPS 299

Query: 1841 XSHPVTVSGKGISPSPESFQAPTSVTKRDQG-QTTATESVLPSISVNRSFLSNQYGSKPH 1665
             S P ++ G+  SPS E F+   ++ K DQ  QTT  + V+PS+ VNRSFL NQYGS+PH
Sbjct: 300  SSLPSSLLGRENSPSTEPFRPFNAIPKSDQPRQTTVPDHVIPSMPVNRSFLGNQYGSRPH 359

Query: 1664 QQLMGHQKAPQSNMEWKPKASQKSSHIRPGIIGATTPN-SPPTDISTESKIE-ANLLQKF 1491
            QQ +GHQKAPQ N EWKPK+SQKSSHI PG+IG    + SP  D S + + E A L  K 
Sbjct: 360  QQPVGHQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESETAKLQDKL 419

Query: 1490 SQVNISENDHVIIPLHLRVPEAARTQLTFGSFGAEFDSTNSVAYRFQANAEQSNGERSAS 1311
            SQ +ISEN +VII  H+RVPE  R +LTFGSFGA+F S     ++   NA++ + E SAS
Sbjct: 420  SQASISENQNVIIAQHIRVPETDRCRLTFGSFGADFAS----GFQAVGNADEPSAEPSAS 475

Query: 1310 --VSAPSTPREDVNQVDLLDGQXXXXXXXXXXXXXXSEHQLPYKNDSS-PENLENYADIG 1140
              VS P +  +D ++   LD Q              SEHQLP K +SS P+NLENYADIG
Sbjct: 476  LSVSPPESSSDDGSKQVDLDDQYINSGTASPESGEASEHQLPDKKESSSPQNLENYADIG 535

Query: 1139 LVRNDSPSYSPAEPQHQQDAAGLPSFP-AYDHQTGYDMPFFRPMIDESVHVQGAPSPAEA 963
            LVR  SPSY+P E Q QQ+   LPSFP AYD Q GYD+P+FRP +DE+V  QG PSP EA
Sbjct: 536  LVRESSPSYTP-ESQQQQERHVLPSFPHAYDPQAGYDIPYFRPTMDETVRGQGLPSPQEA 594

Query: 962  LSSHAINSIPASTXXXXXXXXXXQ----LYPQVHVPHFPNFMPYRQFVSPVFVPPMAMPG 795
            L+SH  NSIPAS+               +Y QVHVPHF N MPYRQF+SPV+VPPMAMPG
Sbjct: 595  LASHTANSIPASSIAMVQQQQQQPPVPQMYQQVHVPHFANLMPYRQFLSPVYVPPMAMPG 654

Query: 794  YSSNPAAYPHPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQGGYA 615
            YSSNPA Y HPSNANSY+LMPGGSSHL A GLKYG  Q KP+PAG+PTGFGN+TN  GYA
Sbjct: 655  YSSNPA-YSHPSNANSYLLMPGGSSHLGANGLKYGIQQLKPVPAGSPTGFGNFTNPTGYA 713

Query: 614  MNHQGTVGGATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNIPGQ 435
            +N  G VG ATGLEDS+R+KYK+GNIYVPNPQAETSE+WIQ PRELPG+QSAPYYN+P Q
Sbjct: 714  INAPGVVGSATGLEDSSRLKYKDGNIYVPNPQAETSEIWIQNPRELPGLQSAPYYNMPAQ 773

Query: 434  APHAASYMPSHTGHASFN-AAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXXXXXXXXX 258
             PHAA YMPSHTGHASFN AAAAAQ +H+ + GLYHP PQPAA+A+P HH+         
Sbjct: 774  TPHAA-YMPSHTGHASFNAAAAAAQSSHMQFPGLYHPPPQPAAMASP-HHLGPPMGGNVG 831

Query: 257  XXXXXXXXGTQVGAYQQPQVGHMNWSTNF 171
                    G QVGAYQQPQ+GH+NW+TNF
Sbjct: 832  VGVAAAAPGPQVGAYQQPQLGHLNWTTNF 860


>emb|CAN69468.1| hypothetical protein VITISV_042555 [Vitis vinifera]
          Length = 914

 Score =  738 bits (1906), Expect = 0.0
 Identities = 444/922 (48%), Positives = 540/922 (58%), Gaps = 84/922 (9%)
 Frame = -3

Query: 2684 MVLGSQVEVETKILSARVRKTIQSIKEVVGNHSDSDIYV--------------------- 2568
            M   S++E   +IL  +V KTIQ IKE+VGNHSD+DIYV                     
Sbjct: 1    MAFDSRMEGGMQILPPQVHKTIQLIKEIVGNHSDADIYVALREMNMDPNETVQKLLNQDL 60

Query: 2567 ----MLKETNMDPNETAQKLLNQDPFHEVXXXXXXXKENTAHMGSAEPRKPIENVRQW-- 2406
                ML+E NMDPNE AQKLLNQDPFHEV       KE+T +    EPR  IENV Q   
Sbjct: 61   DIHVMLREMNMDPNEVAQKLLNQDPFHEVKRKRDKKKESTGYKRPTEPRIYIENVGQGKF 120

Query: 2405 ---------------------TKSHTFSDXXXXXXXXXXXNAL-----------PGISRE 2322
                                  K++ F               L            GI RE
Sbjct: 121  RSFPDRNVRRGGYSRSTVPGNAKTYQFYHSFVLELLYLTVCFLLSELMVRILLDAGIGRE 180

Query: 2321 FXXXXXXXXXXXXXREIKPASLQSVPVVGVSPVAILSDQEHL-----GLRSSEGQKSSEA 2157
            F             R++KP S Q         ++ +S++ +        + S G++SS++
Sbjct: 181  FRVVRDNRVNQNTNRDMKPVSPQLATSANEQVISNISEKGNSTGTSNNQKPSSGRQSSQS 240

Query: 2156 LNGPSDSSSGRGQG--------KEMFEDVGVKVPSSGSQSRGQKXXXXXXXXXXXXXXXX 2001
            LNGP+D+  G  Q         KE+ E+    +P++ S+ +  K                
Sbjct: 241  LNGPTDARPGIPQDANSSGSNRKELLEERQATIPNAVSRVQAVKPNDSQPYSASLASNSS 300

Query: 2000 XXXXXXXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXXXXSHPVTV 1821
                                    VGAIKREVGVVGVRR               S P ++
Sbjct: 301  VVGVYSSSSDPVHVPSPDSRSSAIVGAIKREVGVVGVRRQSTENSVKHSSAPSSSLPSSL 360

Query: 1820 SGKGISPSPESFQAPTSVTKRDQG-QTTATESVLPSISVNRSFLSNQYGSKPHQQLMGHQ 1644
             G+  SPS E F+   ++ K DQ  QTT  + V+PS+ VNRSFL NQYGS+PHQQ +GHQ
Sbjct: 361  LGRENSPSTEPFRPFNAIPKSDQPRQTTVPDHVIPSMPVNRSFLGNQYGSRPHQQPVGHQ 420

Query: 1643 KAPQSNMEWKPKASQKSSHIRPGIIGATTPN-SPPTDISTESKIE-ANLLQKFSQVNISE 1470
            KAPQ N EWKPK+SQKSSHI PG+IG    + SP  D S + + E A L  K SQ +ISE
Sbjct: 421  KAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESETAKLQDKLSQASISE 480

Query: 1469 NDHVIIPLHLRVPEAARTQLTFGSFGAEFDSTNSVAYRFQANAEQSNGERSAS--VSAPS 1296
            N +VII  H+RVPE  R +LTFGSFGA+F S     ++   NA++ + E SAS  VS P 
Sbjct: 481  NQNVIIAQHIRVPETDRCRLTFGSFGADFAS----GFQAVGNADEPSAEPSASLSVSPPE 536

Query: 1295 TPREDVNQVDLLDGQXXXXXXXXXXXXXXSEHQLPYKNDSS-PENLENYADIGLVRNDSP 1119
            +  +D ++   LD Q              SEHQLP K +SS P+NLENYADIGLVR  SP
Sbjct: 537  SSSDDGSKQVDLDDQYINSGTASPESGEASEHQLPDKKESSSPQNLENYADIGLVRESSP 596

Query: 1118 SYSPAEPQHQQDAAGLPSFP-AYDHQTGYDMPFFRPMIDESVHVQGAPSPAEALSSHAIN 942
            SY+P E Q QQ+   LPSFP AYD Q GYD+P+FRP +DE+V  QG PSP EAL+SH  N
Sbjct: 597  SYTP-ESQQQQERHVLPSFPHAYDPQAGYDIPYFRPTMDETVRGQGLPSPQEALASHTAN 655

Query: 941  SIPASTXXXXXXXXXXQ----LYPQVHVPHFPNFMPYRQFVSPVFVPPMAMPGYSSNPAA 774
            SIPAS+               +Y QVHVPHF N MPYRQF+SPV+VPPMAMPGYSSNPA 
Sbjct: 656  SIPASSIAMVQQQQQQPPVPQMYQQVHVPHFANLMPYRQFLSPVYVPPMAMPGYSSNPA- 714

Query: 773  YPHPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQGGYAMNHQGTV 594
            Y HPSNANSY+LMPGGSSHL A GLKYG  Q KP+PAG+PTGFGN+TN  GYA+N  G V
Sbjct: 715  YSHPSNANSYLLMPGGSSHLGANGLKYGIQQLKPVPAGSPTGFGNFTNPTGYAINAPGVV 774

Query: 593  GGATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNIPGQAPHAASY 414
            G ATGLEDS+R+KYK+GNIYVPNPQAETSE+WIQ PRELPG+QSAPYYN+P Q PHAA Y
Sbjct: 775  GSATGLEDSSRLKYKDGNIYVPNPQAETSEIWIQNPRELPGLQSAPYYNMPAQTPHAA-Y 833

Query: 413  MPSHTGHASFN-AAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXXXXXXXXXXXXXXXX 237
            MPSHTGHASFN AAAAAQ +H+ + GLYHP PQPAA+A+P HH+                
Sbjct: 834  MPSHTGHASFNAAAAAAQSSHMQFPGLYHPPPQPAAMASP-HHLGPPMGGNVGVGVAAAA 892

Query: 236  XGTQVGAYQQPQVGHMNWSTNF 171
             G QVGAYQQPQ+GH+NW+TNF
Sbjct: 893  PGPQVGAYQQPQLGHLNWTTNF 914


>emb|CBI35892.3| unnamed protein product [Vitis vinifera]
          Length = 809

 Score =  738 bits (1905), Expect = 0.0
 Identities = 439/869 (50%), Positives = 524/869 (60%), Gaps = 31/869 (3%)
 Frame = -3

Query: 2684 MVLGSQVEVETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQDP 2505
            MV GS++E  T+IL ARVRKTIQSIKE+VGNHSD+DIYV L+ETNMDPNET QKLL QDP
Sbjct: 1    MVSGSRMEGGTQILPARVRKTIQSIKEIVGNHSDADIYVTLRETNMDPNETTQKLLYQDP 60

Query: 2504 FHEVXXXXXXXKENTAHMGSAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNAL----- 2340
            FHEV       KE+T +    EPR  IENV Q  K  +F D                   
Sbjct: 61   FHEVKRKRDKKKESTGYKRPTEPRIYIENVGQG-KFRSFPDRNVRRGGYSRSTVPGNAKT 119

Query: 2339 ----------PGISREFXXXXXXXXXXXXXREIKPASLQSVPVVGVSPVAILSDQEHL-- 2196
                       GI REF             R++KP S Q    V    ++ +S++ +   
Sbjct: 120  YQFYHSILLDAGIGREFRVVRDNRVNQNTNRDMKPVSPQLATSVNEQVISNISEKGNSTG 179

Query: 2195 ---GLRSSEGQKSSEALNGPSDSSSGRGQGKEMFEDVGVKVPSSGSQSRGQKXXXXXXXX 2025
                 + S G++SS++LNGP+D+  G  Q     +    +  S+   S            
Sbjct: 180  TSNNQKPSSGRQSSQSLNGPTDARPGIPQDANSMKPNDSQPYSASLASNSS--------- 230

Query: 2024 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXX 1845
                                            VGAIKREVGVVGVRR             
Sbjct: 231  --------VVGVYSSSSDPVHVPSPDSRSSAIVGAIKREVGVVGVRRQ------------ 270

Query: 1844 XXSHPVTVSGKGISPSPESFQAPTSVTKRDQGQTTATESVLPSISVNRSFLSNQYGSKPH 1665
                    S +  S  P               QTT  + V+PS+ VNRSFL NQYGS+PH
Sbjct: 271  --------STENSSDQPR--------------QTTVPDHVIPSMPVNRSFLGNQYGSRPH 308

Query: 1664 QQLMGHQKAPQSNMEWKPKASQKSSHIRPGIIGATTPN-SPPTDISTESKIE-ANLLQKF 1491
            QQ +GHQKAPQ N EWKPK+SQKSSHI PG+IG    + SP  D S + + E A L  K 
Sbjct: 309  QQPVGHQKAPQPNKEWKPKSSQKSSHIIPGVIGTPAKSVSPRADNSKDLESETAKLQDKL 368

Query: 1490 SQVNISENDHVIIPLHLRVPEAARTQLTFGSFGAEFDSTNSVAYRFQANAEQSNGERSAS 1311
            SQ +ISEN +VII  H+RVPE  R +LTFGSFGA+F S     ++   NA++ + E SAS
Sbjct: 369  SQASISENQNVIIAQHIRVPETDRCRLTFGSFGADFAS----GFQAVGNADEPSAEPSAS 424

Query: 1310 --VSAPSTPREDVNQVDLLDGQXXXXXXXXXXXXXXSEHQLPYKNDSS-PENLENYADIG 1140
              VS P +  +D ++   LD Q              SEHQLP K +SS P+NLENYADIG
Sbjct: 425  LSVSPPESSSDDGSKQVDLDDQYINSGTASPESGEASEHQLPDKKESSSPQNLENYADIG 484

Query: 1139 LVRNDSPSYSPAEPQHQQDAAGLPSFP-AYDHQTGYDMPFFRPMIDESVHVQGAPSPAEA 963
            LVR  SPSY+P E Q QQ+   LPSFP AYD Q GYD+P+FRP +DE+V  QG PSP EA
Sbjct: 485  LVRESSPSYTP-ESQQQQERHVLPSFPHAYDPQAGYDIPYFRPTMDETVRGQGLPSPQEA 543

Query: 962  LSSHAINSIPASTXXXXXXXXXXQ----LYPQVHVPHFPNFMPYRQFVSPVFVPPMAMPG 795
            L+SH  NSIPAS+               +Y QVHVPHF N MPYRQF+SPV+VPPMAMPG
Sbjct: 544  LASHTANSIPASSIAMVQQQQQQPPVPQMYQQVHVPHFANLMPYRQFLSPVYVPPMAMPG 603

Query: 794  YSSNPAAYPHPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQGGYA 615
            YSSNPA Y HPSNANSY+LMPGGSSHL A GLKYG  Q KP+PAG+PTGFGN+TN  GYA
Sbjct: 604  YSSNPA-YSHPSNANSYLLMPGGSSHLGANGLKYGIQQLKPVPAGSPTGFGNFTNPTGYA 662

Query: 614  MNHQGTVGGATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNIPGQ 435
            +N  G VG ATGLEDS+R+KYK+GNIYVPNPQAETSE+WIQ PRELPG+QSAPYYN+P Q
Sbjct: 663  INAPGVVGSATGLEDSSRLKYKDGNIYVPNPQAETSEIWIQNPRELPGLQSAPYYNMPAQ 722

Query: 434  APHAASYMPSHTGHASFN-AAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXXXXXXXXX 258
             PHAA YMPSHTGHASFN AAAAAQ +H+ + GLYHP PQPAA+A+P HH+         
Sbjct: 723  TPHAA-YMPSHTGHASFNAAAAAAQSSHMQFPGLYHPPPQPAAMASP-HHLGPPMGGNVG 780

Query: 257  XXXXXXXXGTQVGAYQQPQVGHMNWSTNF 171
                    G QVGAYQQPQ+GH+NW+TNF
Sbjct: 781  VGVAAAAPGPQVGAYQQPQLGHLNWTTNF 809


>ref|XP_002299597.2| hydroxyproline-rich glycoprotein [Populus trichocarpa]
            gi|550347518|gb|EEE84402.2| hydroxyproline-rich
            glycoprotein [Populus trichocarpa]
          Length = 854

 Score =  733 bits (1893), Expect = 0.0
 Identities = 428/862 (49%), Positives = 526/862 (61%), Gaps = 28/862 (3%)
 Frame = -3

Query: 2672 SQVEVETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQDPFHEV 2493
            S  + +T  LSA+VRKTIQSIKE+VGN SD+DIY++LKETNMDPNETAQKLLNQDPFHEV
Sbjct: 18   SSGQQQTHTLSAKVRKTIQSIKEIVGNFSDADIYMVLKETNMDPNETAQKLLNQDPFHEV 77

Query: 2492 XXXXXXXKENTAHMGSAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNALPG---ISRE 2322
                   KENT++ GS + RK  EN  Q  + HTFSD            A PG   I+RE
Sbjct: 78   KRKREKKKENTSYRGSVDSRKHSENFGQGMRPHTFSDRNAQRGGYTRT-ASPGNRGINRE 136

Query: 2321 FXXXXXXXXXXXXXREIKPASLQSVPVVGVSPVAILSDQEHLGLRS----SEGQKSSEAL 2154
            F             RE KPA L            +++++   G+ S    S+ + S +A 
Sbjct: 137  FRVVRDNRVNQNTSREPKPALLHGSTSAKEQGSGVVTEKGSTGISSNLKPSDARSSHQAS 196

Query: 2153 NGPSDS--------SSGRGQGKEMFEDVGVKVPSSGSQSRGQ--KXXXXXXXXXXXXXXX 2004
            NGP DS        +S  G  K + E+    V S+ + SR Q  K               
Sbjct: 197  NGPIDSEPRHNRDANSSVGDRKVVSEEKR-SVASNATTSRVQVAKSNNSQQHNALQASSN 255

Query: 2003 XXXXXXXXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXXXXSHPVT 1824
                                   G VGAIKREVGVVG RR                    
Sbjct: 256  PVVGVYSSSTDPVHVPSPDSRSSGVVGAIKREVGVVGGRRQSFENAV------------- 302

Query: 1823 VSGKGISPS---PESFQAPTSVTKRDQGQTTATESVLPSISVNRSFLSNQYGSKPHQQLM 1653
               K +S S    ESF+  T+++K DQ   TA    +PS+ VNRSFL+NQY ++PHQQ +
Sbjct: 303  ---KDLSSSNSFSESFRPFTAISKTDQVSQTAAIEPMPSVPVNRSFLNNQYNNRPHQQAV 359

Query: 1652 GHQKAPQSNMEWKPKASQKSSHIRPGIIGATTPNS-PPTDISTESKIEA-NLLQKFSQVN 1479
            GH KA Q N EWKPK+SQKSS   PG+IG  T +S PPTD S   +++A NL  KFS++N
Sbjct: 360  GHPKASQHNKEWKPKSSQKSSVTSPGVIGTPTKSSSPPTDNSKNMELDAANLQDKFSRIN 419

Query: 1478 ISENDHVIIPLHLRVPEAARTQLTFGSFGAEFDSTNSVAYRFQANAEQSNGERSASV--S 1305
            I EN +VII  H+RVPE  R +LTFGSFG  FD+  +  ++    +E+SNGE + S+  S
Sbjct: 420  IHENQNVIIAQHIRVPETDRCKLTFGSFGVGFDAPRTPGFQAVGISEESNGESAISLPAS 479

Query: 1304 APSTPREDVN---QVDLLDGQXXXXXXXXXXXXXXSEHQLPYKNDSSPENLENYADIGLV 1134
            AP +  +D +   Q++LLD Q              SEH LP  N SSP NL+NYADIGLV
Sbjct: 480  APDSSSDDASGGKQIELLDDQARNYGSDSPAASLESEHPLPV-NSSSPPNLDNYADIGLV 538

Query: 1133 RNDSPSYSPAEPQHQQDAAGLPSFPAYDHQTGYDMPFFRPMIDESVHVQGAPSPAEALSS 954
            RN SPSY+P+E Q QQD   LPSF AYD QTGYD+ +FRP IDE+V  QG PSP EAL++
Sbjct: 539  RNSSPSYAPSESQQQQDHPELPSFSAYDPQTGYDISYFRPQIDETVRGQGLPSPQEALTT 598

Query: 953  HAINSIPASTXXXXXXXXXXQ-LYPQVHVPHFPNFMPYRQFVSPVFVPPMAMPGYSSNPA 777
            H  N +PAST            +YPQVHV  F N +PYRQF+SPV+VPPM MPGYSS+PA
Sbjct: 599  HTAN-VPASTMSTVQQQPPMAQMYPQVHVSQFTNLVPYRQFISPVYVPPMPMPGYSSSPA 657

Query: 776  AYPHPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQGGYAMNHQGT 597
             YPHPSN NSY+LMPGG SHL A GLKYG   YKP+P   P GFGN+ +  GYA+N  G 
Sbjct: 658  -YPHPSNGNSYLLMPGGGSHLNANGLKYGIQHYKPVPGNNPAGFGNFVSPSGYAINAPGV 716

Query: 596  VGGATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNIPGQAPHAAS 417
            VG ATGLEDS+R+KYK+GN+YVPNPQAE SE+WIQ PRE+PGMQSAPYYN+PGQ  H A 
Sbjct: 717  VGSATGLEDSSRMKYKDGNLYVPNPQAEASEIWIQNPREIPGMQSAPYYNMPGQT-HTA- 774

Query: 416  YMPSHTGHASFNAAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXXXXXXXXXXXXXXXX 237
            Y+PSHTGHASFN AAAAQ +H+ + GLY P+PQP A+ +P HH+                
Sbjct: 775  YLPSHTGHASFN-AAAAQSSHMQFPGLYPPTPQPTAMPSP-HHLGPVMGGNVGVGVAPSA 832

Query: 236  XGTQVGAYQQPQVGHMNWSTNF 171
             G QVGAYQQPQ+GH+NW+TNF
Sbjct: 833  PGAQVGAYQQPQLGHLNWTTNF 854


>gb|EXB29673.1| hypothetical protein L484_013447 [Morus notabilis]
          Length = 854

 Score =  728 bits (1878), Expect = 0.0
 Identities = 439/871 (50%), Positives = 543/871 (62%), Gaps = 33/871 (3%)
 Frame = -3

Query: 2684 MVLGSQVEVETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQDP 2505
            MV  S+++   +ILSA VRKTIQSIKE+VGNHSD DIY+ LKETNMDPNETAQKLLNQDP
Sbjct: 1    MVSASRIDGGPQILSAGVRKTIQSIKEIVGNHSDIDIYLALKETNMDPNETAQKLLNQDP 60

Query: 2504 FHEVXXXXXXXKENTAHMGSAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNALP---- 2337
            FHEV       KE+  +  S +PR   E   Q +K +TFSD            +LP    
Sbjct: 61   FHEVRRKRDKKKESAGNDSSTDPRGHSEVKGQGSKVNTFSDRNARRGGYARN-SLPDRIM 119

Query: 2336 ---GISREFXXXXXXXXXXXXXREIKPASLQSVPVVGVSPVAILSDQEHLGLRSSE---- 2178
               G+SREF             RE KPAS    P    S    +S +   G  +SE    
Sbjct: 120  LHAGVSREFRVVRDNRVNRSLNREAKPASASPTPP---STFENISGKGSTGSSNSEKPTA 176

Query: 2177 GQKSSEALNGPSDS---------SSGRGQGKEMFEDVGVKVPSSGSQSRGQKXXXXXXXX 2025
             + SS+ L GPSDS         S+G  + KE+ E+  V   S  S+ +  K        
Sbjct: 177  SKNSSQGLYGPSDSHLRIAHDIESTGLVR-KEVSEEKRVTFSSVASRVQAGKANNARSQS 235

Query: 2024 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXX 1845
                                          G VGAIKREVGVVGVRR             
Sbjct: 236  AMVASSSSAIGVYSSSTDPVHVPSPDSRSSGSVGAIKREVGVVGVRRQSSDNSKSSVPSS 295

Query: 1844 XXSHPVTVSGKGISPSPESFQAPTSVTKRDQ-GQTTATESVLPSISVNRSFLSNQYGSKP 1668
              S+ + + G+G   S E+ Q+ ++++K D+ GQ  A+ES+LPS+SV+RS LS+ Y ++ 
Sbjct: 296  SFSNSL-LGGEG---SAETLQSFSTISKNDEVGQ--ASESILPSVSVSRSLLSSHYSNRQ 349

Query: 1667 -HQQLMGHQKAPQSNMEWKPKASQKSSHIRPGIIGATTPN-SPPTDISTESKIE-ANLLQ 1497
             HQQ +GHQKA Q N EWKPK+SQK S   PG+IG  T + SPP   S  S+ E A +L+
Sbjct: 350  QHQQPVGHQKASQPNKEWKPKSSQKPSLNNPGVIGTPTKSVSPPAHNSEVSESEPAKVLE 409

Query: 1496 KFSQVNISENDHVIIPLHLRVPEAARTQLTFGSFGAEFDSTNSVAYRFQANA-EQSNGER 1320
            K S+VNI EN +VII  H+RVPE  R +LTFGSFG EF+S + +   +QA A  +SNGE 
Sbjct: 410  KLSRVNIHENQNVIIAQHIRVPETDRCRLTFGSFGKEFESDSDLVNGYQAGAIGESNGEA 469

Query: 1319 SASVSAPSTPREDVN---QVDLLDGQXXXXXXXXXXXXXXSEHQLPYKNDS-SPENLENY 1152
            ++S+SAP +   D +   QVDL D Q              SE+Q P K +S SP+NL+NY
Sbjct: 470  ASSLSAPESSIGDASGSKQVDLTDEQIRNSGSDSPTSGGTSENQFPDKKESTSPQNLDNY 529

Query: 1151 ADIGLVRNDSPSYSPAEPQHQQDAAGLPSFPAYDHQTGYDMPFFRPM--IDESVHVQGAP 978
            ADIGLV+ +SPSY+PA+ Q Q +   LP F AYD QTGYD P+FRP    DE++  QG P
Sbjct: 530  ADIGLVQGNSPSYAPADSQ-QPEHPELPGFSAYDSQTGYDFPYFRPASATDEAMRGQGLP 588

Query: 977  SPAEALSSHAINSIPA--STXXXXXXXXXXQLYPQVHVPHFPNFMPYRQFVSPVFVPPMA 804
            +P EA SSH  NS+P   S           Q+YPQVHV HF N MPYRQF+SPV+VPPMA
Sbjct: 589  TPQEAFSSHNTNSVPTTISMVQQQQQPPVAQMYPQVHVSHFANLMPYRQFLSPVYVPPMA 648

Query: 803  MPGYSSNPAAYPHPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQG 624
            MPGYSS+PA YPHPSN NSY+LMPGG +HL A  LKYG  Q+KP+PAG PTGFGN++N  
Sbjct: 649  MPGYSSSPA-YPHPSNGNSYLLMPGGGTHLNANSLKYGVQQFKPVPAGNPTGFGNFSNPN 707

Query: 623  GYAMNHQGTVGGATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNI 444
            GYA+N  G VGGATGLEDS+RIKYK+GN+YVPNPQAETSE+WIQ PRELPG+QS PYYN+
Sbjct: 708  GYAINTPGVVGGATGLEDSSRIKYKDGNLYVPNPQAETSEMWIQNPRELPGLQSTPYYNM 767

Query: 443  PGQAPHAASYMPSHTGHASFNAAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXXXXXXX 264
            PGQ+PHAA Y+PSHTGHAS+N AAAAQ +H+ + GLYHP PQPAAIANP HH+       
Sbjct: 768  PGQSPHAA-YLPSHTGHASYN-AAAAQSSHMQFPGLYHP-PQPAAIANP-HHLGPAMGGN 823

Query: 263  XXXXXXXXXXGTQVGAYQQPQVGHMNWSTNF 171
                      G QVGAYQQPQ+GH+NW+TNF
Sbjct: 824  VGVGVAAAAPGAQVGAYQQPQLGHLNWTTNF 854


>ref|XP_007024587.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508779953|gb|EOY27209.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 849

 Score =  727 bits (1877), Expect = 0.0
 Identities = 434/864 (50%), Positives = 537/864 (62%), Gaps = 26/864 (3%)
 Frame = -3

Query: 2684 MVLGSQVEVETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQDP 2505
            MV G+++E +   +SA VRKTIQSIKE+VGNHSD+DIYV LKE NMDPNET QKLL+QD 
Sbjct: 1    MVNGARIEGD---ISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQDT 57

Query: 2504 FHEVXXXXXXXKENTAHMGSAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNALPGISR 2325
            FHEV       KE+  +  S + RK  ENV Q  K   + +             LPG++R
Sbjct: 58   FHEVRRKRDRKKESIEYKVSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRN-TLPGVNR 116

Query: 2324 EFXXXXXXXXXXXXXREIK-PASLQSVPVVGVSPVAILSDQEHLGLRSSEGQKSSEAL-- 2154
            EF             +++K P S  S       PV + +++   G  S++   SS +L  
Sbjct: 117  EFRVVRDNRVNQNANKDMKTPFSQCSTSANEQVPVNV-AEKGSTGTSSNQRPFSSRSLSQ 175

Query: 2153 --NGPS--------DSSSGRGQGKEMFEDVGVKVPSSGSQSRGQKXXXXXXXXXXXXXXX 2004
              NGPS        D++S     KE+ E+    +P++  +S+  K               
Sbjct: 176  TSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQSSSS 235

Query: 2003 XXXXXXXXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXXXXSHPVT 1824
                                   G VGAIKREVGVVGVRR               S   +
Sbjct: 236  SVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLSNS 295

Query: 1823 VSGKGISPSPESFQAPTSVTKRDQ-GQTTATESVLPSISVNRSFLSNQYGSKPHQQLMGH 1647
            + G+    S E+F++  S+++ DQ   T+ATES++P IS +RSFLSNQYGS+ +QQ +GH
Sbjct: 296  LVGR--DNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGH 353

Query: 1646 QKAPQSNMEWKPKASQKSSHIRPGIIGATTPN-SPPTDISTESKIE-ANLLQKFSQVNIS 1473
            QKA Q N EWKPK SQKSS   PG+IG    + SPP D +     E A L  KFSQVNI 
Sbjct: 354  QKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIY 413

Query: 1472 ENDHVIIPLHLRVPEAARTQLTFGSFGAEFDSTNSVAYRFQAN--AEQSNGERSAS--VS 1305
            EN++VII  H+RVPE  R +LTFGSFG EFDS  +    FQA   AE SNGE +AS  VS
Sbjct: 414  ENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASLSVS 473

Query: 1304 APSTPREDV---NQVDLLDGQXXXXXXXXXXXXXXSEHQLP-YKNDSSPENLENYADIGL 1137
            AP T  +D      +++LD Q              SEHQLP  K+ SSP+NL++YADIGL
Sbjct: 474  APDTSSDDAAGGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGL 533

Query: 1136 VRNDSPSYSPAEPQHQQDAAGLPSFPAYDHQTGYDMPFFRPMIDESVHVQGAPSPAEALS 957
            V+++SPSY+P+E Q QQD   LPSF AYD QTGYD+P+FRP IDE+   QG PSP EALS
Sbjct: 534  VQDNSPSYAPSESQKQQDPPELPSFSAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALS 593

Query: 956  SHAINSIPASTXXXXXXXXXXQ--LYPQVHVPHFPNFMPYRQFVSPVFVPPMAMPGYSSN 783
            +H  N +PAST             +YPQVHV HF N MPYRQFVSP+++P MAMPGYSSN
Sbjct: 594  AHTAN-VPASTIPMMQQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSN 652

Query: 782  PAAYPHPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQGGYAMNHQ 603
            PA YPHPSN +SY+LMPGGSSHL A GLKYG  Q+KP+PAG+PTGFGN+T+  GYA+N  
Sbjct: 653  PA-YPHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAP 711

Query: 602  GTVGGATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNIPGQAPHA 423
            G VG  TGLEDS+RIKYK+GNIYVPN QA+TS+LWIQ PRELPG+QSAPYYN+P Q PH 
Sbjct: 712  GVVGNPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNMP-QTPH- 769

Query: 422  ASYMPSHTGHASFNAAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXXXXXXXXXXXXXX 243
              YMPSHTGHASFN AAAAQ +H+ + GLYHP PQPAA+ANP  H+              
Sbjct: 770  -GYMPSHTGHASFN-AAAAQSSHMQFPGLYHPPPQPAAMANP--HLGPAMGANVGVGVAP 825

Query: 242  XXXGTQVGAYQQPQVGHMNWSTNF 171
               G QVGAYQQPQ+GH+NW+TNF
Sbjct: 826  AAPGAQVGAYQQPQLGHLNWTTNF 849


>ref|XP_007024589.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508779955|gb|EOY27211.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 839

 Score =  719 bits (1856), Expect = 0.0
 Identities = 429/859 (49%), Positives = 531/859 (61%), Gaps = 21/859 (2%)
 Frame = -3

Query: 2684 MVLGSQVEVETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQDP 2505
            MV G+++E +   +SA VRKTIQSIKE+VGNHSD+DIYV LKE NMDPNET QKLL+QD 
Sbjct: 1    MVNGARIEGD---ISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQDT 57

Query: 2504 FHEVXXXXXXXKENTAHMGSAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNALPGISR 2325
            FHEV       KE+  +  S + RK  ENV Q  K   + +             LPG++R
Sbjct: 58   FHEVRRKRDRKKESIEYKVSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRN-TLPGVNR 116

Query: 2324 EFXXXXXXXXXXXXXREIK-PASLQSVPVVGVSPVAILSDQEHLGLRSSEGQKSSEAL-- 2154
            EF             +++K P S  S       PV + +++   G  S++   SS +L  
Sbjct: 117  EFRVVRDNRVNQNANKDMKTPFSQCSTSANEQVPVNV-AEKGSTGTSSNQRPFSSRSLSQ 175

Query: 2153 --NGPS--------DSSSGRGQGKEMFEDVGVKVPSSGSQSRGQKXXXXXXXXXXXXXXX 2004
              NGPS        D++S     KE+ E+    +P++  +S+  K               
Sbjct: 176  TSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQSSSS 235

Query: 2003 XXXXXXXXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXXXXSHPVT 1824
                                   G VGAIKREVGVVGVRR               S   +
Sbjct: 236  SVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLSNS 295

Query: 1823 VSGKGISPSPESFQAPTSVTKRDQ-GQTTATESVLPSISVNRSFLSNQYGSKPHQQLMGH 1647
            + G+    S E+F++  S+++ DQ   T+ATES++P IS +RSFLSNQYGS+ +QQ +GH
Sbjct: 296  LVGR--DNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGH 353

Query: 1646 QKAPQSNMEWKPKASQKSSHIRPGIIGATTPN-SPPTDISTESKIE-ANLLQKFSQVNIS 1473
            QKA Q N EWKPK SQKSS   PG+IG    + SPP D +     E A L  KFSQVNI 
Sbjct: 354  QKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVNIY 413

Query: 1472 ENDHVIIPLHLRVPEAARTQLTFGSFGAEFDSTNSVAYRFQAN--AEQSNGERSASVSAP 1299
            EN++VII  H+RVPE  R +LTFGSFG EFDS  +    FQA   AE SNGE +AS  A 
Sbjct: 414  ENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASDDAA 473

Query: 1298 STPREDVNQVDLLDGQXXXXXXXXXXXXXXSEHQLP-YKNDSSPENLENYADIGLVRNDS 1122
                     +++LD Q              SEHQLP  K+ SSP+NL++YADIGLV+++S
Sbjct: 474  GG-----KPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQDNS 528

Query: 1121 PSYSPAEPQHQQDAAGLPSFPAYDHQTGYDMPFFRPMIDESVHVQGAPSPAEALSSHAIN 942
            PSY+P+E Q QQD   LPSF AYD QTGYD+P+FRP IDE+   QG PSP EALS+H  N
Sbjct: 529  PSYAPSESQKQQDPPELPSFSAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALSAHTAN 588

Query: 941  SIPASTXXXXXXXXXXQ--LYPQVHVPHFPNFMPYRQFVSPVFVPPMAMPGYSSNPAAYP 768
             +PAST             +YPQVHV HF N MPYRQFVSP+++P MAMPGYSSNPA YP
Sbjct: 589  -VPASTIPMMQQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNPA-YP 646

Query: 767  HPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQGGYAMNHQGTVGG 588
            HPSN +SY+LMPGGSSHL A GLKYG  Q+KP+PAG+PTGFGN+T+  GYA+N  G VG 
Sbjct: 647  HPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGVVGN 706

Query: 587  ATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNIPGQAPHAASYMP 408
             TGLEDS+RIKYK+GNIYVPN QA+TS+LWIQ PRELPG+QSAPYYN+P Q PH   YMP
Sbjct: 707  PTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNMP-QTPH--GYMP 763

Query: 407  SHTGHASFNAAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXXXXXXXXXXXXXXXXXGT 228
            SHTGHASFN AAAAQ +H+ + GLYHP PQPAA+ANP  H+                 G 
Sbjct: 764  SHTGHASFN-AAAAQSSHMQFPGLYHPPPQPAAMANP--HLGPAMGANVGVGVAPAAPGA 820

Query: 227  QVGAYQQPQVGHMNWSTNF 171
            QVGAYQQPQ+GH+NW+TNF
Sbjct: 821  QVGAYQQPQLGHLNWTTNF 839


>ref|XP_002521347.1| conserved hypothetical protein [Ricinus communis]
            gi|223539425|gb|EEF41015.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 864

 Score =  718 bits (1854), Expect = 0.0
 Identities = 437/857 (50%), Positives = 530/857 (61%), Gaps = 29/857 (3%)
 Frame = -3

Query: 2654 TKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQDPFHEVXXXXXX 2475
            T  LSA VRKTIQSIKE+VGN SD+DIY+ LKETNMDPNETAQKLLNQDPFHEV      
Sbjct: 18   THTLSATVRKTIQSIKEIVGNFSDADIYMALKETNMDPNETAQKLLNQDPFHEVKRKRDK 77

Query: 2474 XKENTAHMGSAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNALPG---ISREFXXXXX 2304
             KE+ A+ GS + RK  EN+ Q TK  TFSD            A+PG   I+REF     
Sbjct: 78   KKESMAYRGSLDSRKNPENMGQGTKFRTFSDRNTRQGGYIRA-AVPGNAGINREFRVVRD 136

Query: 2303 XXXXXXXXREIKPASLQ---SVPVVGVSPVAILSDQEHLG-LRSSEGQKSSEALNGPSDS 2136
                    RE KPA  Q   S   +G+S V         G ++ S  + SS+A NGP DS
Sbjct: 137  NRVNLNTTREPKPAMQQGSISSDELGISTVTEKGSSGSSGNVKHSGVRSSSQASNGPPDS 196

Query: 2135 SSGRGQG--------KEMFEDVGVKVPSSGSQSRGQKXXXXXXXXXXXXXXXXXXXXXXX 1980
             S   +         K M E+    VPS+ S+ +  K                       
Sbjct: 197  QSRHTRDATSNFTDRKAMTEEKRAVVPSAASRIQVMKPSSQHHSATLASSNSVVGVYSSS 256

Query: 1979 XXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXXXXSHPVTVSGKGISP 1800
                             VGAIKREVGVVG RR               S   +V G+  S 
Sbjct: 257  MDPVHVPSPESRSSAA-VGAIKREVGVVGGRRQSSENAVKNSSASSSSFSNSVLGRDGS- 314

Query: 1799 SPESFQAPTSVTKRDQ-GQTTATESVLPSISVNRSFLSNQYGSKPHQQLMGHQKAPQSNM 1623
             PESFQ   +++K DQ  +  ATES +PSISV RSFL NQY S+ HQ  +GHQKA Q N 
Sbjct: 315  LPESFQPFPTISKNDQVNEPVATESAMPSISVGRSFLGNQY-SRTHQTAVGHQKATQHNK 373

Query: 1622 EWKPKASQKSSHIRPGIIGATTPNS-PPTDISTESKIEANLLQ-KFSQVNISENDHVIIP 1449
            EWKPK+SQK+S   PG+IG  T +S PP   S + + +A  +Q K  +VNI EN +VII 
Sbjct: 374  EWKPKSSQKASVGSPGVIGTPTKSSSPPAGNSKDLESDATDMQEKLLRVNIYENQNVIIA 433

Query: 1448 LHLRVPEAARTQLTFGSFGAEFDSTNSVAYRFQANA--EQSNGERSASVSA--PSTPRED 1281
             H+RVPE  R +LTFGSFG EFDS+ ++   FQA    + S  E +AS+SA  P +  +D
Sbjct: 434  QHIRVPETDRCRLTFGSFGVEFDSSRNMPSGFQAAGVTKDSKAESAASLSASAPESSSDD 493

Query: 1280 VN---QVDLLDGQXXXXXXXXXXXXXXSEHQLPYKNDSSPENLENYADIGLVRNDSPSYS 1110
             +   QV+LLD Q              SEHQ P K+ SSP NL+NYADIGLVR+ SP ++
Sbjct: 494  ASGNKQVELLDEQVRNSGSDSPASGAVSEHQSPDKS-SSPPNLDNYADIGLVRDSSP-FT 551

Query: 1109 PAEPQHQQDAAGLPSFPAYDHQTGYDMPFFRPMIDESVHVQGAPSPAEALSSHAINSIPA 930
             +E QHQQD   LPSF AYD QT YDM +FRP IDE+V  QG  S  EAL SH ++S+PA
Sbjct: 552  SSESQHQQDPPELPSFSAYDPQTVYDMSYFRPQIDETVRGQGLQSAQEALISHRVDSMPA 611

Query: 929  STXXXXXXXXXXQL---YPQVHVPHFPNFMPYRQFVSPVFVPPMAMPGYSSNPAAYPHPS 759
            S+           +   YPQVHV H+ N MPYRQF+SPV+VP MAMPGYSSNPA YPHPS
Sbjct: 612  SSIPMVQQQQQPPIAQMYPQVHVSHYTNLMPYRQFLSPVYVPQMAMPGYSSNPA-YPHPS 670

Query: 758  NANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQGGYAMNHQGTVGGATG 579
            N +SY+LMPGGSSHL A GLKYG  Q+KP+P  +PTGFGN+T+  GYA+N  G VG ATG
Sbjct: 671  NGSSYLLMPGGSSHLSANGLKYGIQQFKPVPGSSPTGFGNFTSPTGYAINAPGVVGSATG 730

Query: 578  LEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNIPGQAPHAASYMPSHT 399
            LEDS+R+KYK+GN+YVPNPQAETSE+W+Q PRELPG+QSAPYYN+PGQ+PHAA Y+PSHT
Sbjct: 731  LEDSSRMKYKDGNLYVPNPQAETSEIWVQNPRELPGLQSAPYYNMPGQSPHAA-YLPSHT 789

Query: 398  GHASFNAAAAAQPTHLHYSGLY-HPSPQPAAIANPHHHIXXXXXXXXXXXXXXXXXGTQV 222
            GHASFN AAAAQ +H+ +SGLY  P P PAA+ANP HH+                 G QV
Sbjct: 790  GHASFN-AAAAQSSHMQFSGLYPPPPPTPAAMANP-HHLGPVMGGNVGVGVAPAAPGAQV 847

Query: 221  GAYQQPQVGHMNWSTNF 171
            GAYQQPQ+GH+NW+TNF
Sbjct: 848  GAYQQPQLGHLNWTTNF 864


>ref|XP_007024585.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508779951|gb|EOY27207.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 852

 Score =  717 bits (1852), Expect = 0.0
 Identities = 434/867 (50%), Positives = 537/867 (61%), Gaps = 29/867 (3%)
 Frame = -3

Query: 2684 MVLGSQVEVETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQDP 2505
            MV G+++E +   +SA VRKTIQSIKE+VGNHSD+DIYV LKE NMDPNET QKLL+QD 
Sbjct: 1    MVNGARIEGD---ISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQDT 57

Query: 2504 FHEVXXXXXXXKENTAHMGSAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNALP--GI 2331
            FHEV       KE+  +  S + RK  ENV Q  K   + +             LP  G+
Sbjct: 58   FHEVRRKRDRKKESIEYKVSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRN-TLPDAGV 116

Query: 2330 SREFXXXXXXXXXXXXXREIK-PASLQSVPVVGVSPVAILSDQEHLGLRSSEGQKSSEAL 2154
            +REF             +++K P S  S       PV + +++   G  S++   SS +L
Sbjct: 117  NREFRVVRDNRVNQNANKDMKTPFSQCSTSANEQVPVNV-AEKGSTGTSSNQRPFSSRSL 175

Query: 2153 ----NGPS--------DSSSGRGQGKEMFEDVGVKVPSSGSQSRGQKXXXXXXXXXXXXX 2010
                NGPS        D++S     KE+ E+    +P++  +S+  K             
Sbjct: 176  SQTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQSS 235

Query: 2009 XXXXXXXXXXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXXXXSHP 1830
                                     G VGAIKREVGVVGVRR               S  
Sbjct: 236  SSSVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLS 295

Query: 1829 VTVSGKGISPSPESFQAPTSVTKRDQ-GQTTATESVLPSISVNRSFLSNQYGSKPHQQLM 1653
             ++ G+    S E+F++  S+++ DQ   T+ATES++P IS +RSFLSNQYGS+ +QQ +
Sbjct: 296  NSLVGR--DNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQAL 353

Query: 1652 GHQKAPQSNMEWKPKASQKSSHIRPGIIGATTPN-SPPTDISTESKIE-ANLLQKFSQVN 1479
            GHQKA Q N EWKPK SQKSS   PG+IG    + SPP D +     E A L  KFSQVN
Sbjct: 354  GHQKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVN 413

Query: 1478 ISENDHVIIPLHLRVPEAARTQLTFGSFGAEFDSTNSVAYRFQAN--AEQSNGERSAS-- 1311
            I EN++VII  H+RVPE  R +LTFGSFG EFDS  +    FQA   AE SNGE +AS  
Sbjct: 414  IYENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASLS 473

Query: 1310 VSAPSTPREDV---NQVDLLDGQXXXXXXXXXXXXXXSEHQLP-YKNDSSPENLENYADI 1143
            VSAP T  +D      +++LD Q              SEHQLP  K+ SSP+NL++YADI
Sbjct: 474  VSAPDTSSDDAAGGKPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADI 533

Query: 1142 GLVRNDSPSYSPAEPQHQQDAAGLPSFP-AYDHQTGYDMPFFRPMIDESVHVQGAPSPAE 966
            GLV+++SPSY+P+E Q QQD   LPSF  AYD QTGYD+P+FRP IDE+   QG PSP E
Sbjct: 534  GLVQDNSPSYAPSESQKQQDPPELPSFSQAYDPQTGYDLPYFRPPIDETARGQGLPSPQE 593

Query: 965  ALSSHAINSIPASTXXXXXXXXXXQ--LYPQVHVPHFPNFMPYRQFVSPVFVPPMAMPGY 792
            ALS+H  N +PAST             +YPQVHV HF N MPYRQFVSP+++P MAMPGY
Sbjct: 594  ALSAHTAN-VPASTIPMMQQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGY 652

Query: 791  SSNPAAYPHPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQGGYAM 612
            SSNPA YPHPSN +SY+LMPGGSSHL A GLKYG  Q+KP+PAG+PTGFGN+T+  GYA+
Sbjct: 653  SSNPA-YPHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAI 711

Query: 611  NHQGTVGGATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNIPGQA 432
            N  G VG  TGLEDS+RIKYK+GNIYVPN QA+TS+LWIQ PRELPG+QSAPYYN+P Q 
Sbjct: 712  NAPGVVGNPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNMP-QT 770

Query: 431  PHAASYMPSHTGHASFNAAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXXXXXXXXXXX 252
            PH   YMPSHTGHASFN AAAAQ +H+ + GLYHP PQPAA+ANP  H+           
Sbjct: 771  PH--GYMPSHTGHASFN-AAAAQSSHMQFPGLYHPPPQPAAMANP--HLGPAMGANVGVG 825

Query: 251  XXXXXXGTQVGAYQQPQVGHMNWSTNF 171
                  G QVGAYQQPQ+GH+NW+TNF
Sbjct: 826  VAPAAPGAQVGAYQQPQLGHLNWTTNF 852


>ref|XP_007024588.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508779954|gb|EOY27210.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 842

 Score =  709 bits (1831), Expect = 0.0
 Identities = 429/862 (49%), Positives = 531/862 (61%), Gaps = 24/862 (2%)
 Frame = -3

Query: 2684 MVLGSQVEVETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQDP 2505
            MV G+++E +   +SA VRKTIQSIKE+VGNHSD+DIYV LKE NMDPNET QKLL+QD 
Sbjct: 1    MVNGARIEGD---ISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQDT 57

Query: 2504 FHEVXXXXXXXKENTAHMGSAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNALP--GI 2331
            FHEV       KE+  +  S + RK  ENV Q  K   + +             LP  G+
Sbjct: 58   FHEVRRKRDRKKESIEYKVSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRN-TLPDAGV 116

Query: 2330 SREFXXXXXXXXXXXXXREIK-PASLQSVPVVGVSPVAILSDQEHLGLRSSEGQKSSEAL 2154
            +REF             +++K P S  S       PV + +++   G  S++   SS +L
Sbjct: 117  NREFRVVRDNRVNQNANKDMKTPFSQCSTSANEQVPVNV-AEKGSTGTSSNQRPFSSRSL 175

Query: 2153 ----NGPS--------DSSSGRGQGKEMFEDVGVKVPSSGSQSRGQKXXXXXXXXXXXXX 2010
                NGPS        D++S     KE+ E+    +P++  +S+  K             
Sbjct: 176  SQTSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQSS 235

Query: 2009 XXXXXXXXXXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXXXXSHP 1830
                                     G VGAIKREVGVVGVRR               S  
Sbjct: 236  SSSVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLS 295

Query: 1829 VTVSGKGISPSPESFQAPTSVTKRDQ-GQTTATESVLPSISVNRSFLSNQYGSKPHQQLM 1653
             ++ G+    S E+F++  S+++ DQ   T+ATES++P IS +RSFLSNQYGS+ +QQ +
Sbjct: 296  NSLVGR--DNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQAL 353

Query: 1652 GHQKAPQSNMEWKPKASQKSSHIRPGIIGATTPN-SPPTDISTESKIE-ANLLQKFSQVN 1479
            GHQKA Q N EWKPK SQKSS   PG+IG    + SPP D +     E A L  KFSQVN
Sbjct: 354  GHQKANQHNKEWKPKLSQKSSVNNPGVIGTPKKSASPPADDAKGLDSETAKLQDKFSQVN 413

Query: 1478 ISENDHVIIPLHLRVPEAARTQLTFGSFGAEFDSTNSVAYRFQAN--AEQSNGERSASVS 1305
            I EN++VII  H+RVPE  R +LTFGSFG EFDS  +    FQA   AE SNGE +AS  
Sbjct: 414  IYENENVIIAQHIRVPENDRCRLTFGSFGVEFDSLRNFVPGFQATGVAEDSNGESAASDD 473

Query: 1304 APSTPREDVNQVDLLDGQXXXXXXXXXXXXXXSEHQLP-YKNDSSPENLENYADIGLVRN 1128
            A          +++LD Q              SEHQLP  K+ SSP+NL++YADIGLV++
Sbjct: 474  AAGG-----KPIEILDDQIGNSGSDSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQD 528

Query: 1127 DSPSYSPAEPQHQQDAAGLPSFP-AYDHQTGYDMPFFRPMIDESVHVQGAPSPAEALSSH 951
            +SPSY+P+E Q QQD   LPSF  AYD QTGYD+P+FRP IDE+   QG PSP EALS+H
Sbjct: 529  NSPSYAPSESQKQQDPPELPSFSQAYDPQTGYDLPYFRPPIDETARGQGLPSPQEALSAH 588

Query: 950  AINSIPASTXXXXXXXXXXQ--LYPQVHVPHFPNFMPYRQFVSPVFVPPMAMPGYSSNPA 777
              N +PAST             +YPQVHV HF N MPYRQFVSP+++P MAMPGYSSNPA
Sbjct: 589  TAN-VPASTIPMMQQQQPPVAQMYPQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNPA 647

Query: 776  AYPHPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQGGYAMNHQGT 597
             YPHPSN +SY+LMPGGSSHL A GLKYG  Q+KP+PAG+PTGFGN+T+  GYA+N  G 
Sbjct: 648  -YPHPSNGSSYVLMPGGSSHLNANGLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGV 706

Query: 596  VGGATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNIPGQAPHAAS 417
            VG  TGLEDS+RIKYK+GNIYVPN QA+TS+LWIQ PRELPG+QSAPYYN+P Q PH   
Sbjct: 707  VGNPTGLEDSSRIKYKDGNIYVPNQQADTSDLWIQNPRELPGLQSAPYYNMP-QTPH--G 763

Query: 416  YMPSHTGHASFNAAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXXXXXXXXXXXXXXXX 237
            YMPSHTGHASFN AAAAQ +H+ + GLYHP PQPAA+ANP  H+                
Sbjct: 764  YMPSHTGHASFN-AAAAQSSHMQFPGLYHPPPQPAAMANP--HLGPAMGANVGVGVAPAA 820

Query: 236  XGTQVGAYQQPQVGHMNWSTNF 171
             G QVGAYQQPQ+GH+NW+TNF
Sbjct: 821  PGAQVGAYQQPQLGHLNWTTNF 842


>ref|XP_002304144.2| hypothetical protein POPTR_0003s06200g [Populus trichocarpa]
            gi|550342535|gb|EEE79123.2| hypothetical protein
            POPTR_0003s06200g [Populus trichocarpa]
          Length = 858

 Score =  709 bits (1829), Expect = 0.0
 Identities = 424/855 (49%), Positives = 513/855 (60%), Gaps = 30/855 (3%)
 Frame = -3

Query: 2645 LSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQDPFHEVXXXXXXXKE 2466
            LSARVRK IQSIKE+VGN SD+DIY++LKETNMDPNET QKLLNQDPFHEV       KE
Sbjct: 30   LSARVRKIIQSIKEIVGNFSDADIYMVLKETNMDPNETVQKLLNQDPFHEVKRKRDKKKE 89

Query: 2465 NTAHMGSAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNALP--GISREFXXXXXXXXX 2292
            + ++ GS + RK  EN  Q  +  TF D           +++   G++REF         
Sbjct: 90   SMSYRGSVDSRKQPENFDQGMRPRTFLDRYAQRGGHTRTDSIGNRGVNREFRVVRDNRIN 149

Query: 2291 XXXXREIKPASLQSVPVV-----GVSPVAILSDQEHLGLRSSEGQKSSEALNGPS----- 2142
                RE KPA  Q          GV+         +  L+ S  Q SS+  NGP+     
Sbjct: 150  QNANREPKPALPQGSTSAKEKGSGVTEKGSAGISNN-NLKPSNAQSSSQTSNGPTYPEPR 208

Query: 2141 ---DSSSGRGQGKEMFEDVGVKVPSSGSQSRGQ--KXXXXXXXXXXXXXXXXXXXXXXXX 1977
               D+ S  G  K + E+      S+ + SR Q  K                        
Sbjct: 209  YNRDAKSRAGDRKVVSEEKR-STASNATTSRAQVVKPNNSQQHDASLASSNSVVGVYSSS 267

Query: 1976 XXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXXXXSHPVTVSGKGISPS 1797
                          G VGAIKREVGVVG RR                       K +S S
Sbjct: 268  TDPVHVPSPDSRSSGVVGAIKREVGVVGGRRQSENAV-----------------KDLSSS 310

Query: 1796 ---PESFQAPTSVTKRDQGQTTATESVLPSISVNRSFLSNQYGSKPHQQLMGHQKAPQSN 1626
                ESF   T+++  DQ + TA    +PS+ VNRS L NQY S+PHQQ +G+ KA Q N
Sbjct: 311  NSFSESFHPLTAISNTDQVRQTAVIESMPSVPVNRSLLHNQYNSRPHQQTVGYPKASQHN 370

Query: 1625 MEWKPKASQKSSHIRPGIIGATTPNS-PPTDISTESKIEA-NLLQKFSQVNISENDHVII 1452
             EWKPK+SQKSS   PG+IG  T +S PPTD S   ++ A NL  KFS+VNI EN +VII
Sbjct: 371  KEWKPKSSQKSSITSPGVIGTPTKSSLPPTDNSKSMELNAANLQDKFSRVNIHENQNVII 430

Query: 1451 PLHLRVPEAARTQLTFGSFGAEFDSTNSVAYRFQAN--AEQSNGERSASV--SAPSTPRE 1284
              H+RVPE+ R +LTFGSFG EFD + +    FQA   +E+SN E + S+  S P +  E
Sbjct: 431  AQHIRVPESDRCKLTFGSFGVEFDPSRNSTPGFQAVGISEESNRESAISLPASCPESSSE 490

Query: 1283 DV---NQVDLLDGQXXXXXXXXXXXXXXSEHQLPYKNDSSPENLENYADIGLVRNDSPSY 1113
            D     Q++LLD Q              SEHQLP K+ SSP +L+NYADIGLVRN SPSY
Sbjct: 491  DAPGGKQIELLDDQARNSESDSPEAGLASEHQLPEKS-SSPPDLDNYADIGLVRNSSPSY 549

Query: 1112 SPAEPQHQQDAAGLPSFPAYDHQTGYDMPFFRPMIDESVHVQGAPSPAEALSSHAINSIP 933
            +P+E Q QQD   LPSF AYD QTGYDM +F+P IDE+V  QG PSP EAL++H  N IP
Sbjct: 550  APSESQQQQDHPELPSFSAYDPQTGYDMSYFQPPIDETVQGQGQPSPREALTAHTGNHIP 609

Query: 932  ASTXXXXXXXXXXQ-LYPQVHVPHFPNFMPYRQFVSPVFVPPMAMPGYSSNPAAYPHPSN 756
             ST            +YPQVHV  F N MPYRQF+SPV+VPPM MPGYSSNPA YPHPSN
Sbjct: 610  TSTMPTMQQQPPMAQMYPQVHVSPFTNLMPYRQFISPVYVPPMPMPGYSSNPA-YPHPSN 668

Query: 755  ANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQGGYAMNHQGTVGGATGL 576
             NSYMLMPGG SHL A GLKYG   YKP+P+  P GFGN+T+  GYA+N  G VG A GL
Sbjct: 669  GNSYMLMPGGGSHLNANGLKYGIQHYKPVPSSNPAGFGNFTSPSGYAINAPGVVGSAAGL 728

Query: 575  EDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNIPGQAPHAASYMPSHTG 396
            ED +R+KYK+GNIYVPNPQAE+SE+WIQ PR+LPG+QS+PYYNIPGQ  HAA Y+PSHTG
Sbjct: 729  EDPSRMKYKDGNIYVPNPQAESSEIWIQNPRDLPGLQSSPYYNIPGQT-HAA-YLPSHTG 786

Query: 395  HASFNAAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXXXXXXXXXXXXXXXXXGTQVGA 216
            HASFN AAAAQ +H+ + GLY P PQP A+A+P HH+                 G QVGA
Sbjct: 787  HASFN-AAAAQSSHMQFPGLY-PPPQPTAMASP-HHLGPVMGNNVGVGVAPSAPGAQVGA 843

Query: 215  YQQPQVGHMNWSTNF 171
            YQQPQ+GH+NW+TNF
Sbjct: 844  YQQPQLGHLNWTTNF 858


>ref|XP_007024584.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508779950|gb|EOY27206.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 883

 Score =  706 bits (1821), Expect = 0.0
 Identities = 434/898 (48%), Positives = 537/898 (59%), Gaps = 60/898 (6%)
 Frame = -3

Query: 2684 MVLGSQVEVETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQDP 2505
            MV G+++E +   +SA VRKTIQSIKE+VGNHSD+DIYV LKE NMDPNET QKLL+QD 
Sbjct: 1    MVNGARIEGD---ISAPVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLLHQDT 57

Query: 2504 FHEVXXXXXXXKENTAHMGSAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNALPGISR 2325
            FHEV       KE+  +  S + RK  ENV Q  K   + +             LPG++R
Sbjct: 58   FHEVRRKRDRKKESIEYKVSLDSRKRSENVGQGMKFRPYPERGSRRGSYTRN-TLPGVNR 116

Query: 2324 EFXXXXXXXXXXXXXREIK-PASLQSVPVVGVSPVAILSDQEHLGLRSSEGQKSSEAL-- 2154
            EF             +++K P S  S       PV + +++   G  S++   SS +L  
Sbjct: 117  EFRVVRDNRVNQNANKDMKTPFSQCSTSANEQVPVNV-AEKGSTGTSSNQRPFSSRSLSQ 175

Query: 2153 --NGPS--------DSSSGRGQGKEMFEDVGVKVPSSGSQSRGQKXXXXXXXXXXXXXXX 2004
              NGPS        D++S     KE+ E+    +P++  +S+  K               
Sbjct: 176  TSNGPSSSQTRHARDANSSGIDRKEISEEKRNFIPNAVLRSQAVKPNNSQAHAATQSSSS 235

Query: 2003 XXXXXXXXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXXXXSHPVT 1824
                                   G VGAIKREVGVVGVRR               S   +
Sbjct: 236  SVVGVYSSSTDPVHVPSPDSRSSGAVGAIKREVGVVGVRRQPSENAVKDSSGSSGSLSNS 295

Query: 1823 VSGKGISPSPESFQAPTSVTKRDQ-GQTTATESVLPSISVNRSFLSNQYGSKPHQQLMGH 1647
            + G+    S E+F++  S+++ DQ   T+ATES++P IS +RSFLSNQYGS+ +QQ +GH
Sbjct: 296  LVGR--DNSSEAFRSFPSISRADQLSHTSATESIMPGISGSRSFLSNQYGSRQNQQALGH 353

Query: 1646 QK---------------------------APQSNMEWKPKASQKSSHIRPGIIGATTPN- 1551
            QK                           A Q N EWKPK SQKSS   PG+IG    + 
Sbjct: 354  QKEASYCSAFHPFIDQISLWESLSCIFDAANQHNKEWKPKLSQKSSVNNPGVIGTPKKSA 413

Query: 1550 SPPTDISTESKIE-ANLLQKFSQVNISENDHVIIPLHLRVPEAARTQLTFGSFGAEFDST 1374
            SPP D +     E A L  KFSQVNI EN++VII  H+RVPE  R +LTFGSFG EFDS 
Sbjct: 414  SPPADDAKGLDSETAKLQDKFSQVNIYENENVIIAQHIRVPENDRCRLTFGSFGVEFDSL 473

Query: 1373 NSVAYRFQAN--AEQSNGERSA--------SVSAPSTPREDV---NQVDLLDGQXXXXXX 1233
             +    FQA   AE SNGE +A        SVSAP T  +D      +++LD Q      
Sbjct: 474  RNFVPGFQATGVAEDSNGESAARLVFSPNLSVSAPDTSSDDAAGGKPIEILDDQIGNSGS 533

Query: 1232 XXXXXXXXSEHQLP-YKNDSSPENLENYADIGLVRNDSPSYSPAEPQHQQDAAGLPSFP- 1059
                    SEHQLP  K+ SSP+NL++YADIGLV+++SPSY+P+E Q QQD   LPSF  
Sbjct: 534  DSPLSGTASEHQLPDTKDTSSPQNLDSYADIGLVQDNSPSYAPSESQKQQDPPELPSFSQ 593

Query: 1058 AYDHQTGYDMPFFRPMIDESVHVQGAPSPAEALSSHAINSIPASTXXXXXXXXXXQ--LY 885
            AYD QTGYD+P+FRP IDE+   QG PSP EALS+H  N +PAST             +Y
Sbjct: 594  AYDPQTGYDLPYFRPPIDETARGQGLPSPQEALSAHTAN-VPASTIPMMQQQQPPVAQMY 652

Query: 884  PQVHVPHFPNFMPYRQFVSPVFVPPMAMPGYSSNPAAYPHPSNANSYMLMPGGSSHLPAG 705
            PQVHV HF N MPYRQFVSP+++P MAMPGYSSNPA YPHPSN +SY+LMPGGSSHL A 
Sbjct: 653  PQVHVSHFANIMPYRQFVSPIYLPQMAMPGYSSNPA-YPHPSNGSSYVLMPGGSSHLNAN 711

Query: 704  GLKYGASQYKPIPAGTPTGFGNYTNQGGYAMNHQGTVGGATGLEDSNRIKYKEGNIYVPN 525
            GLKYG  Q+KP+PAG+PTGFGN+T+  GYA+N  G VG  TGLEDS+RIKYK+GNIYVPN
Sbjct: 712  GLKYGIQQFKPVPAGSPTGFGNFTSPSGYAINAPGVVGNPTGLEDSSRIKYKDGNIYVPN 771

Query: 524  PQAETSELWIQTPRELPGMQSAPYYNIPGQAPHAASYMPSHTGHASFNAAAAAQPTHLHY 345
             QA+TS+LWIQ PRELPG+QSAPYYN+P Q PH   YMPSHTGHASFN AAAAQ +H+ +
Sbjct: 772  QQADTSDLWIQNPRELPGLQSAPYYNMP-QTPH--GYMPSHTGHASFN-AAAAQSSHMQF 827

Query: 344  SGLYHPSPQPAAIANPHHHIXXXXXXXXXXXXXXXXXGTQVGAYQQPQVGHMNWSTNF 171
             GLYHP PQPAA+ANP  H+                 G QVGAYQQPQ+GH+NW+TNF
Sbjct: 828  PGLYHPPPQPAAMANP--HLGPAMGANVGVGVAPAAPGAQVGAYQQPQLGHLNWTTNF 883


>ref|XP_007135474.1| hypothetical protein PHAVU_010G132600g [Phaseolus vulgaris]
            gi|561008519|gb|ESW07468.1| hypothetical protein
            PHAVU_010G132600g [Phaseolus vulgaris]
          Length = 864

 Score =  690 bits (1780), Expect = 0.0
 Identities = 431/877 (49%), Positives = 519/877 (59%), Gaps = 39/877 (4%)
 Frame = -3

Query: 2684 MVLGSQVE--VETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQ 2511
            MV GS+ E    T +LSARVRKTIQSIKE+VGNHSD+DIYV LKETNMDPNET QKLLNQ
Sbjct: 1    MVPGSRTESATGTHLLSARVRKTIQSIKEIVGNHSDADIYVALKETNMDPNETTQKLLNQ 60

Query: 2510 DPFHEVXXXXXXXKE--NTAHMGSAEPRKPIENVR-QWTKSHTFSDXXXXXXXXXXXNAL 2340
            DPFHEV       KE  N  + GSA+ R+P EN   Q  K HT S+             L
Sbjct: 61   DPFHEVKRRRDRKKEPQNVGNNGSADSRRPSENNSGQGVKFHTPSERNVRRANYSRN-TL 119

Query: 2339 PGISREFXXXXXXXXXXXXXREIKPASLQSVPVVGVSPVAILSDQ---EHLGLRSSEGQK 2169
            PGISREF              E+KP S Q +          LS++        RSS  + 
Sbjct: 120  PGISREFRVVRDNRVNYIYK-EVKPLSQQHLASASEELNVNLSEKGSSASTSHRSSGSRN 178

Query: 2168 SSEALNG--------PSDSSSGRGQGKEMFEDVGVKVPS--SGSQSRGQ--KXXXXXXXX 2025
            SS+ALNG        P D+       K   ED      S  S +  R Q  K        
Sbjct: 179  SSQALNGPSDSFARYPKDAVPNIVDRKIASEDKDKDKQSMISNAAERVQPIKPNHIHQNP 238

Query: 2024 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXX 1845
                                            VGAI+REVGVVGVRR             
Sbjct: 239  ASVASSSSAVGVYSSSTDPVHVPSPDSRSSSVVGAIRREVGVVGVRRQPSDNKVKQSFAP 298

Query: 1844 XXSHPVTVSGKGISPSPESFQAPTSVTKRDQ-GQTTATESVLPSISVNRSFLSNQYGSKP 1668
              S+   V+GK    S +SFQ   +V K +Q  QT  TE  L  + V+R  ++NQY  +P
Sbjct: 299  SSSY---VAGKD-GTSADSFQPVGAVLKTEQFSQTKVTEPSLSGVPVSRPSVNNQYNGRP 354

Query: 1667 HQQLMGHQKAPQSNMEWKPKASQKSSHIRPGIIGATTPN----SPPTDISTESKIEANLL 1500
            HQQL+GHQ+  Q N EWKPK+SQK +   PG+IG  TP     SPP + S + + +A  L
Sbjct: 355  HQQLVGHQRVSQQNKEWKPKSSQKPNSNNPGVIG--TPKKAAASPPAENSVDIESDAVEL 412

Query: 1499 Q-KFSQVNISENDHVIIPLHLRVPEAARTQLTFGSFGAEFDSTNSVA-YRFQANAEQSNG 1326
            Q K SQ+NI EN +VII  H++VPE  R +LTFG+ G E DS+   + Y     +E+SN 
Sbjct: 413  QDKLSQLNIYENQNVIIAQHIQVPETDRCRLTFGTIGTEIDSSRLQSKYHIVGPSEKSND 472

Query: 1325 ERSAS--VSAPSTPREDVN---QVDLLDGQXXXXXXXXXXXXXXSEHQLPYKNDSS-PEN 1164
            E +AS  V AP    +DV+   QVDLLD                SE QLP   DSS  +N
Sbjct: 473  ELAASLAVPAPELSTDDVSGSKQVDLLDEHIRSSGSDSPVSGAPSEQQLPDNKDSSNTQN 532

Query: 1163 LENYADIGLVRNDSPSYSPAEPQHQQDAAGLPSFPAYDHQTGYDMPFFRPMIDESVHVQG 984
            L+NYA+IGLVR+ SPSY+P+EPQ QQ++  +P F AYD  TGYD+P+FRP IDE+V  QG
Sbjct: 533  LDNYANIGLVRDSSPSYAPSEPQ-QQESHDMPGFAAYDPPTGYDIPYFRPTIDETVRGQG 591

Query: 983  APSPAEALSSHAINSIPASTXXXXXXXXXXQ-----LYPQVHVPHFPNFMPYRQFVSPVF 819
              SP EAL SH  N+ PAST          Q     +YPQ+HV HF N MPYRQF+SPV+
Sbjct: 592  LSSPQEALISHGTNNTPASTIAMVQQQQQQQPPVPQMYPQMHVSHFANLMPYRQFLSPVY 651

Query: 818  V-PPMAMPGYSSNPAAYPHPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFG 642
            V PPMAMPGYSSNP  YPHP+N NSY+LMPGG SHL A  LKYG  QYKP+PAG P GFG
Sbjct: 652  VPPPMAMPGYSSNP-PYPHPTNGNSYVLMPGGGSHLNANNLKYGVQQYKPVPAGNPAGFG 710

Query: 641  NYTNQGGYAMNHQGTVGGATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQS 462
            N+ +  GYAM   G VGGAT LEDS+R+KYK+ N+YVPNPQAETSE+W+Q PR+LPGMQS
Sbjct: 711  NFASPAGYAMITPGVVGGATALEDSSRVKYKD-NLYVPNPQAETSEIWLQNPRDLPGMQS 769

Query: 461  APYYNIPGQAPHAASYMPSHTGHASFNAAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIX 282
            APYYN+PGQ PHAA YMPSHTGHASFN AAAAQ +H+ + G+YH  PQPAA+A+PHH   
Sbjct: 770  APYYNMPGQTPHAA-YMPSHTGHASFN-AAAAQSSHMQFPGMYHTPPQPAAMASPHHLGP 827

Query: 281  XXXXXXXXXXXXXXXXGTQVGAYQQPQVGHMNWSTNF 171
                            G QVGAYQQPQ+GH+NW+TNF
Sbjct: 828  PSIGNNVGVGVAAAAPGAQVGAYQQPQLGHINWTTNF 864


>ref|XP_003528451.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max]
          Length = 863

 Score =  686 bits (1770), Expect = 0.0
 Identities = 423/877 (48%), Positives = 509/877 (58%), Gaps = 39/877 (4%)
 Frame = -3

Query: 2684 MVLGSQVE--VETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQ 2511
            MV GS+ E    T +LSARVRKTIQSIKE+VGNHSD+DIYV LKETNMDPNET QKLLNQ
Sbjct: 1    MVPGSRTEGGTGTHLLSARVRKTIQSIKEIVGNHSDADIYVALKETNMDPNETTQKLLNQ 60

Query: 2510 DPFHEVXXXXXXXKE--NTAHMG--SAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNA 2343
            DPFHEV       KE  N  + G  SA+ R+  EN                       N 
Sbjct: 61   DPFHEVKRRRDRKKETQNVGNKGQPSADSRRSSENNSGQGMKFNAPSERNVRRTNYSRNT 120

Query: 2342 LPGISREFXXXXXXXXXXXXXREIKPASLQSVPV------VGVSPVAILSDQEHLGLRSS 2181
            LPGIS+EF              E+KP + Q          V        +   H   RSS
Sbjct: 121  LPGISKEFRVVRDNRVNHIYK-EVKPLTQQHSTSATEQLNVNTPDKGSSTSTNH---RSS 176

Query: 2180 EGQKSSEALNGPSDSSSGRGQGKEMFEDVGVKVPSSGSQSRGQ-----------KXXXXX 2034
              + SS A NGPSDS + R     +   +  K+ S     +G            K     
Sbjct: 177  GSRNSSLASNGPSDSHA-RYLKDAVPNIIDRKIASEDKDKQGMISNAAGRVQPIKPNNAH 235

Query: 2033 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXX 1854
                                             G VGAI+REVGVVGVRR          
Sbjct: 236  QNSASVASTSSAVGVYSSSTDPVHVPSPDSRSSGVVGAIRREVGVVGVRRQSSDNKAKQS 295

Query: 1853 XXXXXSHPVTVSGKGISPSPESFQAPTSVTKRDQ-GQTTATESVLPSISVNRSFLSNQYG 1677
                 S+ V   G     S +SFQ+  +V+K +Q  QT  TE  L  + V+R  L+NQY 
Sbjct: 296  FAPSISYVVGKDGT----SADSFQSVGAVSKTEQFSQTNVTEPSLSGMPVSRPSLNNQYN 351

Query: 1676 SKPHQQLMGHQKAPQSNMEWKPKASQKSSHIRPGIIG-----ATTPNSPPTDISTESKIE 1512
            ++PHQQL+GHQ+  Q N EWKPK+SQK +   PG+IG     A    SPP + S + +  
Sbjct: 352  NRPHQQLVGHQRVSQQNKEWKPKSSQKPNSNSPGVIGTPKKAAVAAASPPAENSGDIESN 411

Query: 1511 ANLLQ-KFSQVNISENDHVIIPLHLRVPEAARTQLTFGSFGAEFDSTNSVA-YRFQANAE 1338
               LQ K SQVNI EN +VII  H+RVPE  R QLTFG+ G E DS+   + Y     +E
Sbjct: 412  TTELQDKLSQVNIYENQNVIIAQHIRVPETDRCQLTFGTIGTELDSSRLQSKYHIIGASE 471

Query: 1337 QSNGERSAS--VSAPSTPREDVN---QVDLLDGQXXXXXXXXXXXXXXSEHQLPYKNDSS 1173
            +SN E +AS  V AP    +DV+   QVDL D                SE QLP   DSS
Sbjct: 472  KSNEELTASLTVPAPELSTDDVSGSKQVDLRDEHIRSSRSDSPVSGAASEQQLPDNKDSS 531

Query: 1172 -PENLENYADIGLVRNDSPSYSPAEPQHQQDAAGLPSFPAYDHQTGYDMPFFRPMIDESV 996
              +NL+NYA+IGLVR+ SPSY+P+EPQ QQD+  +P F AYD   GYD+P+FRP IDE+V
Sbjct: 532  NTQNLDNYANIGLVRDSSPSYAPSEPQ-QQDSHDMPGFAAYDPPAGYDIPYFRPTIDETV 590

Query: 995  HVQGAPSPAEALSSHAINSIPASTXXXXXXXXXXQ--LYPQVHVPHFPNFMPYRQFVSPV 822
              QG  SP EAL SHA N+ PAST             +YPQVHV HF N MPYRQF+SPV
Sbjct: 591  RGQGLSSPQEALISHATNNPPASTIAMVQQQQPPVPQMYPQVHVSHFANLMPYRQFLSPV 650

Query: 821  FVPPMAMPGYSSNPAAYPHPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFG 642
            +VPPMAMPGYSSNP  YPHP+N +SY+LMPGG SHL A  LKYG  Q+KP+PAG+PTGFG
Sbjct: 651  YVPPMAMPGYSSNPP-YPHPTNGSSYLLMPGGGSHLNANNLKYGVQQFKPVPAGSPTGFG 709

Query: 641  NYTNQGGYAMNHQGTVGGATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQS 462
            N+ N  GYAM   G VGGAT LEDS+R+KYK+ N+YVPNPQAETSE+W+Q PR+LPGMQS
Sbjct: 710  NFANPTGYAMITPGVVGGATALEDSSRVKYKD-NLYVPNPQAETSEIWLQNPRDLPGMQS 768

Query: 461  APYYNIPGQAPHAASYMPSHTGHASFNAAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIX 282
             PYYN+PGQ PHAA YMPSHTGHASFN AAAAQ +H+ + G+YH  PQPAA+A+PHH   
Sbjct: 769  TPYYNMPGQTPHAA-YMPSHTGHASFN-AAAAQSSHMQFPGMYHTPPQPAAMASPHHLGP 826

Query: 281  XXXXXXXXXXXXXXXXGTQVGAYQQPQVGHMNWSTNF 171
                            G QVGAYQQPQ+GH+NW+TNF
Sbjct: 827  PAIGNNVGVGVAAAAPGAQVGAYQQPQLGHINWTTNF 863


>ref|XP_006583148.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Glycine max]
          Length = 855

 Score =  685 bits (1768), Expect = 0.0
 Identities = 423/876 (48%), Positives = 509/876 (58%), Gaps = 38/876 (4%)
 Frame = -3

Query: 2684 MVLGSQVE--VETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQ 2511
            MV GS+ E    T +LSARVRKTIQSIKE+VGNHSD+DIYV LKETNMDPNET QKLLNQ
Sbjct: 1    MVPGSRTEGGTGTHLLSARVRKTIQSIKEIVGNHSDADIYVALKETNMDPNETTQKLLNQ 60

Query: 2510 DPFHEVXXXXXXXKE--NTAHMG--SAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNA 2343
            DPFHEV       KE  N  + G  SA+ R+  EN                       N 
Sbjct: 61   DPFHEVKRRRDRKKETQNVGNKGQPSADSRRSSENNSGQGMKFNAPSERNVRRTNYSRNT 120

Query: 2342 LPGISREFXXXXXXXXXXXXXREIKPASLQSVPVVGVSPVAILSDQEHLGLR-----SSE 2178
            LPGIS+EF              E+KP + Q             S  E L +      SS 
Sbjct: 121  LPGISKEFRVVRDNRVNHIYK-EVKPLTQQHST----------SATEQLNVNTPDKGSSG 169

Query: 2177 GQKSSEALNGPSDSSSGRGQGKEMFEDVGVKVPSSGSQSRGQ-----------KXXXXXX 2031
             + SS A NGPSDS + R     +   +  K+ S     +G            K      
Sbjct: 170  SRNSSLASNGPSDSHA-RYLKDAVPNIIDRKIASEDKDKQGMISNAAGRVQPIKPNNAHQ 228

Query: 2030 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXX 1851
                                            G VGAI+REVGVVGVRR           
Sbjct: 229  NSASVASTSSAVGVYSSSTDPVHVPSPDSRSSGVVGAIRREVGVVGVRRQSSDNKAKQSF 288

Query: 1850 XXXXSHPVTVSGKGISPSPESFQAPTSVTKRDQ-GQTTATESVLPSISVNRSFLSNQYGS 1674
                S+ V   G     S +SFQ+  +V+K +Q  QT  TE  L  + V+R  L+NQY +
Sbjct: 289  APSISYVVGKDGT----SADSFQSVGAVSKTEQFSQTNVTEPSLSGMPVSRPSLNNQYNN 344

Query: 1673 KPHQQLMGHQKAPQSNMEWKPKASQKSSHIRPGIIG-----ATTPNSPPTDISTESKIEA 1509
            +PHQQL+GHQ+  Q N EWKPK+SQK +   PG+IG     A    SPP + S + +   
Sbjct: 345  RPHQQLVGHQRVSQQNKEWKPKSSQKPNSNSPGVIGTPKKAAVAAASPPAENSGDIESNT 404

Query: 1508 NLLQ-KFSQVNISENDHVIIPLHLRVPEAARTQLTFGSFGAEFDSTNSVA-YRFQANAEQ 1335
              LQ K SQVNI EN +VII  H+RVPE  R QLTFG+ G E DS+   + Y     +E+
Sbjct: 405  TELQDKLSQVNIYENQNVIIAQHIRVPETDRCQLTFGTIGTELDSSRLQSKYHIIGASEK 464

Query: 1334 SNGERSAS--VSAPSTPREDVN---QVDLLDGQXXXXXXXXXXXXXXSEHQLPYKNDSS- 1173
            SN E +AS  V AP    +DV+   QVDL D                SE QLP   DSS 
Sbjct: 465  SNEELTASLTVPAPELSTDDVSGSKQVDLRDEHIRSSRSDSPVSGAASEQQLPDNKDSSN 524

Query: 1172 PENLENYADIGLVRNDSPSYSPAEPQHQQDAAGLPSFPAYDHQTGYDMPFFRPMIDESVH 993
             +NL+NYA+IGLVR+ SPSY+P+EPQ QQD+  +P F AYD   GYD+P+FRP IDE+V 
Sbjct: 525  TQNLDNYANIGLVRDSSPSYAPSEPQ-QQDSHDMPGFAAYDPPAGYDIPYFRPTIDETVR 583

Query: 992  VQGAPSPAEALSSHAINSIPASTXXXXXXXXXXQ--LYPQVHVPHFPNFMPYRQFVSPVF 819
             QG  SP EAL SHA N+ PAST             +YPQVHV HF N MPYRQF+SPV+
Sbjct: 584  GQGLSSPQEALISHATNNPPASTIAMVQQQQPPVPQMYPQVHVSHFANLMPYRQFLSPVY 643

Query: 818  VPPMAMPGYSSNPAAYPHPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGN 639
            VPPMAMPGYSSNP  YPHP+N +SY+LMPGG SHL A  LKYG  Q+KP+PAG+PTGFGN
Sbjct: 644  VPPMAMPGYSSNPP-YPHPTNGSSYLLMPGGGSHLNANNLKYGVQQFKPVPAGSPTGFGN 702

Query: 638  YTNQGGYAMNHQGTVGGATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSA 459
            + N  GYAM   G VGGAT LEDS+R+KYK+ N+YVPNPQAETSE+W+Q PR+LPGMQS 
Sbjct: 703  FANPTGYAMITPGVVGGATALEDSSRVKYKD-NLYVPNPQAETSEIWLQNPRDLPGMQST 761

Query: 458  PYYNIPGQAPHAASYMPSHTGHASFNAAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXX 279
            PYYN+PGQ PHAA YMPSHTGHASFN AAAAQ +H+ + G+YH  PQPAA+A+PHH    
Sbjct: 762  PYYNMPGQTPHAA-YMPSHTGHASFN-AAAAQSSHMQFPGMYHTPPQPAAMASPHHLGPP 819

Query: 278  XXXXXXXXXXXXXXXGTQVGAYQQPQVGHMNWSTNF 171
                           G QVGAYQQPQ+GH+NW+TNF
Sbjct: 820  AIGNNVGVGVAAAAPGAQVGAYQQPQLGHINWTTNF 855


>ref|XP_004163891.1| PREDICTED: uncharacterized protein LOC101226902 [Cucumis sativus]
          Length = 846

 Score =  685 bits (1767), Expect = 0.0
 Identities = 412/859 (47%), Positives = 513/859 (59%), Gaps = 21/859 (2%)
 Frame = -3

Query: 2684 MVLGSQVEVETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQDP 2505
            MV G +V+  T +L ARVRKTIQSIKE+VGNHSD+DIY  LKETNMDPNETAQKLLNQDP
Sbjct: 1    MVSGLRVDGGTHVLPARVRKTIQSIKEIVGNHSDADIYTTLKETNMDPNETAQKLLNQDP 60

Query: 2504 FHEVXXXXXXXKENTAHMGSAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNALPGISR 2325
            F EV       KEN  + GS + ++  E+VRQ TK +T SD            + PGIS+
Sbjct: 61   FREVKRRRDKKKENVGYKGSLDAQRNSEDVRQGTKVYTLSDRNVRRGAYAKS-SWPGISK 119

Query: 2324 EFXXXXXXXXXXXXXREIKPAS----LQSVPV-VGVSPVAILSDQEHLG-----LRSSEG 2175
            EF             RE+KPAS    L +  V   VS   I     H G     +     
Sbjct: 120  EFRVVRDNRVNRNSNREVKPASSHLALSTNEVSTNVSKSVITPRGAHGGSFGGRISQVSF 179

Query: 2174 QKSSEALNGPSDSSSGRGQGKEMFEDVGVKVPSSGSQSRGQKXXXXXXXXXXXXXXXXXX 1995
            +K+    + P D  S     KE+ +DVGV + SS                          
Sbjct: 180  RKTDSHPSNPRDGHSTGMAQKELRDDVGVSMLSSIPDMHIGNPNDSEPHSPVLASNGAAV 239

Query: 1994 XXXXXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXXXXSHPVTVSG 1815
                                  VGAIKREVG VGVRR               S   +VS 
Sbjct: 240  GLYSSSTDPVHVPSPDSRSSAPVGAIKREVGAVGVRRQLKDSSINQSSGPSVSLANSVSE 299

Query: 1814 KGISPSPESFQAPTSVTKRDQGQTTATESVLPSISVNRSFLSNQYGSKPHQQLMGHQKAP 1635
            +    S +SFQ P S T + +  +  TESV+P +  +R+ L+NQ+ S+ HQ  MGHQKA 
Sbjct: 300  R--DGSSDSFQ-PMSSTSKGEQLSQITESVIPGLVGSRTSLNNQHSSRQHQPTMGHQKAS 356

Query: 1634 QSNMEWKPKASQKSSHIRPGIIGATTPNSPPTDISTESKIEA-NLLQKFSQVNISENDHV 1458
            Q N EWKPK+SQK S   PG+IG  + +  P D S E   EA N+ +K ++V++ EN HV
Sbjct: 357  QPNKEWKPKSSQKLSTGNPGVIGTPSKSKAPADESKELHSEAANVQEKLARVDLHENQHV 416

Query: 1457 IIPLHLRVPEAARTQLTFGSFGAEFDSTNSVAYRFQA--NAEQSNGERSAS--VSAPSTP 1290
            II  H+RVP+  + +L FGSFG E DS+  +    QA    E+ NGE SAS  VSA    
Sbjct: 417  IIAEHIRVPDNDQYRLVFGSFGTESDSSGCLVSGLQAIRGPEELNGESSASQSVSALEIS 476

Query: 1289 REDVN---QVDLLDGQXXXXXXXXXXXXXXSEHQLPYKNDSS-PENLENYADIGLVRNDS 1122
             +D +   QVDLLD Q              +E Q   K +SS P+ L+ YA+IGLVR+ +
Sbjct: 477  TDDASGSRQVDLLDDQVRNSESNSPDSGTATELQSADKRESSSPQPLDTYAEIGLVRDRN 536

Query: 1121 PSYSPAEPQHQQDAAGLPSFPAYDHQTGYDMPFFRPMIDESVHVQGAPSPAEALSSHAIN 942
              Y+PA PQHQ D + L  F AYD QTGYD+P+FRP +DE+V VQG PS  +A++SH  N
Sbjct: 537  LKYTPA-PQHQ-DPSELLGFSAYDPQTGYDLPYFRPTMDETVRVQGLPSQ-DAVNSHTAN 593

Query: 941  SIPASTXXXXXXXXXXQ--LYPQVHVPHFPNFMPYRQFVSPVFVPPMAMPGYSSNPAAYP 768
             IPAST             +YPQVHV HF N MPYRQF+SPV+VPPMAMPGYSS+PA YP
Sbjct: 594  GIPASTMPMVQQQQTPVAQMYPQVHVSHFANLMPYRQFLSPVYVPPMAMPGYSSSPA-YP 652

Query: 767  HPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQGGYAMNHQGTVGG 588
            HPSN NS++LMPGGS+H+ A  LKYG  Q+KP+PAG+P GFGN+ +  G+A+N  G VG 
Sbjct: 653  HPSNGNSFLLMPGGSTHMNANNLKYGIQQFKPLPAGSPAGFGNFNSPAGFAVNAPGVVGS 712

Query: 587  ATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNIPGQAPHAASYMP 408
            ATGLEDS+RIKYK+GN+YVPN QAETSE+WIQ PR+LPG+QSAPYYN+PGQ PH A Y+P
Sbjct: 713  ATGLEDSSRIKYKDGNLYVPNAQAETSEIWIQNPRDLPGLQSAPYYNMPGQTPHGA-YLP 771

Query: 407  SHTGHASFNAAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXXXXXXXXXXXXXXXXXGT 228
            SHTGHASF +AA AQ TH+ + GLYHP+PQPAAI NPHH                   G 
Sbjct: 772  SHTGHASF-SAAVAQSTHMQFPGLYHPTPQPAAIGNPHH---MGPGMGGNVGVAAATPGP 827

Query: 227  QVGAYQQPQVGHMNWSTNF 171
            QVG +QQPQ+GH+NW+TNF
Sbjct: 828  QVGTFQQPQLGHLNWTTNF 846


>ref|XP_006598817.1| PREDICTED: putative uncharacterized protein DDB_G0277255-like
            [Glycine max]
          Length = 852

 Score =  677 bits (1748), Expect = 0.0
 Identities = 423/871 (48%), Positives = 514/871 (59%), Gaps = 33/871 (3%)
 Frame = -3

Query: 2684 MVLGSQVEV----ETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLL 2517
            MV GS+ E      T +LSARVRKTIQSIKE+VGNHSD+DIYV LKE NMDPNET QKLL
Sbjct: 1    MVPGSKTEGGGTGTTHLLSARVRKTIQSIKEIVGNHSDADIYVALKEANMDPNETTQKLL 60

Query: 2516 NQDPFHEVXXXXXXXKE--NTAHMG--SAEPRKPIENVR-QWTKSHTFSDXXXXXXXXXX 2352
            NQDPFHEV       KE  N  + G  SA+ R+P EN   Q  K HT S+          
Sbjct: 61   NQDPFHEVKRRRDRKKETQNVGNRGQPSADSRRPSENNSGQGMKFHTHSERNVRRTNYSR 120

Query: 2351 XNALPGISREFXXXXXXXXXXXXXREIKPASLQSVPVVGVSPVAILSDQEHLGLRSSEGQ 2172
                PGISREF              E+ P S Q    V       +SD+   G R+S   
Sbjct: 121  S-TFPGISREFRVVRDNRVNHIYK-EVTPLSQQHSTSVTEQLNVNISDKGSSGSRNS--- 175

Query: 2171 KSSEALNGPSDSSSGRG----QGKEMFEDVGVKVPSSGSQSRGQ--KXXXXXXXXXXXXX 2010
              S+A NGPSDS +         K ++ED   +   S +  R Q  K             
Sbjct: 176  --SQASNGPSDSHARYAPKTIDRKIVYEDKDKQGMISNAAGRVQPIKPNSVHQNSALVAS 233

Query: 2009 XXXXXXXXXXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXXXXSHP 1830
                                    PG VGAI+REVG VGVRR                H 
Sbjct: 234  TSSAVGVYSSSTDPVHVPSPDSRSPGVVGAIRREVGFVGVRRQSSDNKAKQSFAPSSPHV 293

Query: 1829 VTVSGKGISPSPESFQAPTSVTKRDQ-GQTTATESVLPSISVNRSFLSNQYGSKPHQQLM 1653
            V   G     S +SFQ+  +V+K +Q  QT  TE  L  + V+R  L+NQ+ ++PHQQL+
Sbjct: 294  VGKDGT----SADSFQSVGAVSKTEQFSQTNVTEPSLSGMPVSRPSLNNQHNNRPHQQLV 349

Query: 1652 GHQKAPQSNMEWKPKASQKSS-HIRPGIIG----ATTPNSPPTDISTESKIEANLLQ--- 1497
            GHQ+  Q N EWKPK+SQK + +  PG+IG    A    SPP + S +  IE+N ++   
Sbjct: 350  GHQRVSQQNKEWKPKSSQKPNCNNSPGVIGTPKKAAAAASPPAENSGD--IESNTVELQD 407

Query: 1496 KFSQVNISENDHVIIPLHLRVPEAARTQLTFGSFGAEFDSTNSVA-YRFQANAEQSNGER 1320
            K SQVNI EN +VII  H+RVPE  R +LTFG+ G E DS+   + Y     +E+SN E 
Sbjct: 408  KLSQVNIYENQNVIIAQHIRVPETDRCRLTFGTIGTELDSSRPQSKYHIIGASEKSNEEL 467

Query: 1319 SAS--VSAPSTPREDVN---QVDLLDGQXXXXXXXXXXXXXXSEHQLPYKNDSS-PENLE 1158
            +AS  V AP    +DV+   QVDL D                SE QLP   DSS  +NL+
Sbjct: 468  TASLTVPAPELSTDDVSGSKQVDLRDEHIRSLGSDSPVSGATSEQQLPDNKDSSNTKNLD 527

Query: 1157 NYADIGLVRNDSPSYSPAEPQHQQDAAGLPSFPAYDHQTGYDMPFFRPMIDESVHVQGAP 978
            NYA+IGLVR+ SPSY+P+E Q QQD+  +P F AYD   GYD+P+FRP IDE+V  QG  
Sbjct: 528  NYANIGLVRDSSPSYAPSE-QQQQDSHDMPGFAAYDSPAGYDIPYFRPTIDETVRGQGLS 586

Query: 977  SPAEALSSHAINSIPASTXXXXXXXXXXQ--LYPQVHVPHFPNFMPYRQFVSPVFVPPMA 804
            SP EAL SH  N+ PAST             +YPQVHV HF N MPYRQF+SPV+VPPMA
Sbjct: 587  SPQEALISHPTNT-PASTIAMVQQQQPPVPQMYPQVHVSHFANLMPYRQFLSPVYVPPMA 645

Query: 803  MPGYSSNPAAYPHPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQG 624
            MPGYSSNP  YPHP+N +SY+LMPGG SHL A  LKYG  Q+KP+PAG+PTGFGN+ N  
Sbjct: 646  MPGYSSNPP-YPHPTNGSSYLLMPGGGSHLNANNLKYGVQQFKPVPAGSPTGFGNFANPT 704

Query: 623  GYAMNHQGTVGGATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNI 444
            GYAM   G VGGAT LEDS+R+KYK+ N+YVPNPQAETSE+W+Q PR+ PGMQS PYYN+
Sbjct: 705  GYAMITPGVVGGATALEDSSRVKYKD-NLYVPNPQAETSEIWLQNPRDHPGMQSTPYYNM 763

Query: 443  PGQAPHAASYMPSHTGHASFNAAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXXXXXXX 264
            PGQ PHAA YMPSHTGHASFN  A AQ +H+ + G+YH  PQPAA+A+PHH         
Sbjct: 764  PGQTPHAA-YMPSHTGHASFN-GATAQSSHMQFPGMYHTPPQPAAMASPHHLGPPAIGNN 821

Query: 263  XXXXXXXXXXGTQVGAYQQPQVGHMNWSTNF 171
                      G QVGAYQQPQ+GH+NW+TNF
Sbjct: 822  VGVGVAAAAPGAQVGAYQQPQLGHINWTTNF 852


>ref|XP_006426626.1| hypothetical protein CICLE_v10024871mg [Citrus clementina]
            gi|557528616|gb|ESR39866.1| hypothetical protein
            CICLE_v10024871mg [Citrus clementina]
          Length = 866

 Score =  669 bits (1726), Expect = 0.0
 Identities = 401/860 (46%), Positives = 504/860 (58%), Gaps = 26/860 (3%)
 Frame = -3

Query: 2672 SQVEVETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQDPFHEV 2493
            +++E  T+ILSA +R TIQ+IKE+VGNHSD+DIY  LK++NMDPNETAQKLLNQDPF EV
Sbjct: 17   TRIEGGTQILSAGMRNTIQTIKEIVGNHSDADIYFTLKDSNMDPNETAQKLLNQDPFLEV 76

Query: 2492 XXXXXXXKENTAHMGSAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNALP--GISREF 2319
                   KEN ++    EPRK  E   +  +  T++D           NALP  GI+REF
Sbjct: 77   KRRRDKKKENMSYKSLEEPRKNSEIFGKTMRIRTYAD-RNARRRGYNRNALPDAGINREF 135

Query: 2318 XXXXXXXXXXXXXREIKPASLQSVPVVGVSPVAILSDQEHLGLRSSE----GQKSSEALN 2151
                         +E K    QS          +       G   SE    G+  S+A N
Sbjct: 136  RVVRDNRVNPEANQETKSPLPQSSISTNEKVTNVKEKGSPTGTTGSEKPSGGRSFSQASN 195

Query: 2150 G-----PSDSSSGRGQGKEMFEDVGVKVPSSGSQSRGQKXXXXXXXXXXXXXXXXXXXXX 1986
            G     P  +      G +  E    K  +S                             
Sbjct: 196  GSTNLHPRHAYDHNITGTDRIEPSAEKFTTSAVNFIQHNITEGYSATLASSNSVGGYFSS 255

Query: 1985 XXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXXXXSHPVTVSGKGI 1806
                             G   AIKREVGVVG  R               S   ++ G+  
Sbjct: 256  KDPVHVPSPDSRASSAVG---AIKREVGVVGGGRQCSDNAVKDSTAPCSSFSNSILGRDN 312

Query: 1805 SPSPESFQAPTSVTKRDQ-GQTTATESVLPSISVNRSFLSNQYGSKPHQQLMGHQKAPQS 1629
            S   +SF+   S++K DQ  Q  AT+S +  +  NR+  +NQY  + HQQ +GHQKA Q 
Sbjct: 313  S---DSFRPFPSISKADQINQIAATDSGVAGMPANRALFTNQYTGRSHQQSVGHQKASQH 369

Query: 1628 NMEWKPKASQKSSHIRPGIIGATTPN-SPPTDISTESKIE-ANLLQKFSQVNISENDHVI 1455
            N EWKPK+SQKS+ I PG+IG  T + SPP D S + + + A L  + S+VNI EN +VI
Sbjct: 370  NKEWKPKSSQKSNVIGPGVIGTPTKSPSPPVDDSKDLESDVAKLQDELSRVNIHENQNVI 429

Query: 1454 IPLHLRVPEAARTQLTFGSFGAEFDSTNSVAYRFQA--NAEQSNGERSASV--SAPSTPR 1287
            I  H+RVPE  R +LTFGSFG +F+S+ ++   F A  +AE+SNGE +AS+  +A  T  
Sbjct: 430  IAQHIRVPETDRCRLTFGSFGVDFESSRNLGSGFLAAGSAEESNGESAASLTGAASKTSG 489

Query: 1286 EDVN---QVDLLDGQXXXXXXXXXXXXXXSEHQLP--YKNDSSPENLENYADIGLVRNDS 1122
             DV+    VD+LD                SEHQLP   K+ SSP++L+ YADIGLVR+  
Sbjct: 490  NDVSGRKPVDILDDLVRNSGSNSPASGEASEHQLPDDIKDASSPQDLDGYADIGLVRDTD 549

Query: 1121 PSYSPAEPQHQQDAAGLPSFPAYDHQTGYDMPFFRPMIDESVHVQGAPSPAEALSSHAIN 942
            PSY  +E Q QQD++ L SFPAYD QTGYDM +FRP +DESV  QG PSP EAL+SH+ N
Sbjct: 550  PSYPLSESQQQQDSSELASFPAYDSQTGYDMSYFRPTMDESVRGQGLPSPQEALASHSAN 609

Query: 941  SIPAST---XXXXXXXXXXQLYPQVHVPHFPNFMPYRQFVSPVFVPPMAMPGYSSNPAAY 771
            SIPAS+             Q+YPQVHV HFPN MPYRQ +SPV+VP MAMPGYSSNP AY
Sbjct: 610  SIPASSIAMLQHQQQPQMAQMYPQVHVSHFPNMMPYRQIISPVYVPQMAMPGYSSNP-AY 668

Query: 770  PHPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQGGYAMNHQGTVG 591
            PHPSN +SY+LMPGGSSHL   GLKYG  Q+KP+P  +PTGFGN+T+  GYA+N    VG
Sbjct: 669  PHPSNGSSYLLMPGGSSHLSTNGLKYGIQQFKPVPTASPTGFGNFTSPAGYAINAPSVVG 728

Query: 590  GATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNIPGQAPHAASYM 411
              TGLEDS+R+KYK+GN+YV N QA+TSELWI  PRELPGMQS PYYN+P Q PHAA+Y+
Sbjct: 729  SVTGLEDSSRMKYKDGNLYVSNQQADTSELWIHNPRELPGMQSGPYYNMPAQTPHAAAYL 788

Query: 410  PSHTGHASFNAAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXXXXXXXXXXXXXXXXXG 231
            PSH GHASFN AA  Q +H+ + G+YHP+ QP A+ANP HH+                 G
Sbjct: 789  PSHAGHASFN-AAVPQSSHMQFPGMYHPTAQPPAMANP-HHMGPAMGGNVGVGVPPAAPG 846

Query: 230  TQVGAYQQPQVGHMNWSTNF 171
             QVGAYQQPQ+G+ NWS NF
Sbjct: 847  AQVGAYQQPQLGNFNWSPNF 866


>ref|XP_006465941.1| PREDICTED: dentin sialophosphoprotein-like [Citrus sinensis]
          Length = 862

 Score =  664 bits (1714), Expect = 0.0
 Identities = 401/860 (46%), Positives = 504/860 (58%), Gaps = 26/860 (3%)
 Frame = -3

Query: 2672 SQVEVETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQDPFHEV 2493
            +++E  T+ILSA +R TIQ+IKE+VGNHSD+DIY  LK++NMDPNETAQKLLNQDPF EV
Sbjct: 17   TRIEGGTQILSAGMRNTIQTIKEIVGNHSDADIYFTLKDSNMDPNETAQKLLNQDPFLEV 76

Query: 2492 XXXXXXXKENTAHMGSAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNALP--GISREF 2319
                   KEN ++    EPRK  E   +  +  T++D           NALP  GI+REF
Sbjct: 77   KRRRDKKKENMSYKSLEEPRKNSEIFGKTMRIRTYAD-RNARRRGYNRNALPDAGINREF 135

Query: 2318 XXXXXXXXXXXXXREIKPASLQSVPVVGVSPVAILSDQEHLGLRSSE----GQKSSEALN 2151
                         +E K    QS          +       G   SE    G+  S+A N
Sbjct: 136  RVVRDNRVNPEANQETKSPLPQSSISTNEKVTNVKEKGSPTGTTGSERPSGGRSFSQASN 195

Query: 2150 G-----PSDSSSGRGQGKEMFEDVGVKVPSSGSQSRGQKXXXXXXXXXXXXXXXXXXXXX 1986
            G     P  +      G +  E    K  +S                             
Sbjct: 196  GSTNLHPRHAYDHNITGTDRIEPSAEKFTTSAVNFIQHNITEGHSATLASSNSVGGYFSS 255

Query: 1985 XXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXXXXSHPVTVSGKGI 1806
                             G   AIKREVGVVG  R               S   ++ G+  
Sbjct: 256  KDPVHVPSPDSRASSAVG---AIKREVGVVGGGRQCSDNAVRDSTAPRSSFSNSILGRDN 312

Query: 1805 SPSPESFQAPTSVTKRDQ-GQTTATESVLPSISVNRSFLSNQYGSKPHQQLMGHQKAPQS 1629
            S   +SF+   S++K DQ  Q  AT+S +     NR+  +NQY  + HQQ +GHQKA Q 
Sbjct: 313  S---DSFRPFPSISKADQINQIAATDSGV----ANRALFTNQYTGRSHQQSVGHQKASQH 365

Query: 1628 NMEWKPKASQKSSHIRPGIIGATTPN-SPPTDISTESKIE-ANLLQKFSQVNISENDHVI 1455
            N EWKPK+SQKS+ I PG+IG  T + SPP D S + + + A L  + S+VNI+EN +VI
Sbjct: 366  NKEWKPKSSQKSNVIGPGVIGTPTKSPSPPVDDSKDLESDVAKLQDELSRVNINENQNVI 425

Query: 1454 IPLHLRVPEAARTQLTFGSFGAEFDSTNSVAYRFQA--NAEQSNGERSASV--SAPSTPR 1287
            I  H+RVPE  R +LTFGSFG +F+S+ ++   F A  +AE+SNGE +AS+  +A  T  
Sbjct: 426  IAQHIRVPETDRCRLTFGSFGVDFESSRNLGSGFLAAGSAEESNGESAASLTGAASKTSG 485

Query: 1286 EDVN---QVDLLDGQXXXXXXXXXXXXXXSEHQLP--YKNDSSPENLENYADIGLVRNDS 1122
             DV+    VD+LD                SEHQLP   K+ SSP++L+ YADIGLVR+  
Sbjct: 486  NDVSGRKPVDILDDLVRNSGSNSPASGEASEHQLPDDIKDASSPQDLDGYADIGLVRDTD 545

Query: 1121 PSYSPAEPQHQQDAAGLPSFPAYDHQTGYDMPFFRPMIDESVHVQGAPSPAEALSSHAIN 942
            PSY  +E Q QQD++ L SFPAYD QTGYDM +FRP +DESV  QG PSP EAL+SH+ N
Sbjct: 546  PSYPLSESQQQQDSSELASFPAYDSQTGYDMSYFRPTMDESVRGQGLPSPQEALASHSAN 605

Query: 941  SIPAST---XXXXXXXXXXQLYPQVHVPHFPNFMPYRQFVSPVFVPPMAMPGYSSNPAAY 771
            SIPAS+             Q+YPQVHV HFPN MPYRQ +SPV+VP MAMPGYSSNP AY
Sbjct: 606  SIPASSIAMLQHQQQPQMAQMYPQVHVSHFPNMMPYRQIISPVYVPQMAMPGYSSNP-AY 664

Query: 770  PHPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQGGYAMNHQGTVG 591
            PHPSN +SY+LMPGGSSHL   GLKYG  Q+KP+P  +PTGFGN+T+  GYA+N    VG
Sbjct: 665  PHPSNGSSYLLMPGGSSHLSTNGLKYGIQQFKPVPTASPTGFGNFTSPAGYAINAPSVVG 724

Query: 590  GATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNIPGQAPHAASYM 411
              TGLEDS+R+KYK+GN+YV N QA+TSELWI  PRELPGMQS PYYN+P Q PHAA+Y+
Sbjct: 725  SVTGLEDSSRMKYKDGNLYVSNQQADTSELWIHNPRELPGMQSGPYYNMPAQTPHAAAYL 784

Query: 410  PSHTGHASFNAAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXXXXXXXXXXXXXXXXXG 231
            PSH GHASFN AA  Q +H+ + G+YHP+ QP A+ANP HH+                 G
Sbjct: 785  PSHAGHASFN-AAVPQSSHMQFPGMYHPTAQPPAMANP-HHMGPAMGGNVGVGVPPAAPG 842

Query: 230  TQVGAYQQPQVGHMNWSTNF 171
             QVGAYQQPQ+G+ NWS NF
Sbjct: 843  AQVGAYQQPQLGNFNWSPNF 862


>ref|XP_006426627.1| hypothetical protein CICLE_v10024871mg [Citrus clementina]
            gi|557528617|gb|ESR39867.1| hypothetical protein
            CICLE_v10024871mg [Citrus clementina]
          Length = 867

 Score =  664 bits (1714), Expect = 0.0
 Identities = 401/861 (46%), Positives = 504/861 (58%), Gaps = 27/861 (3%)
 Frame = -3

Query: 2672 SQVEVETKILSARVRKTIQSIKEVVGNHSDSDIYVMLKETNMDPNETAQKLLNQDPFHEV 2493
            +++E  T+ILSA +R TIQ+IKE+VGNHSD+DIY  LK++NMDPNETAQKLLNQDPF EV
Sbjct: 17   TRIEGGTQILSAGMRNTIQTIKEIVGNHSDADIYFTLKDSNMDPNETAQKLLNQDPFLEV 76

Query: 2492 XXXXXXXKENTAHMGSAEPRKPIENVRQWTKSHTFSDXXXXXXXXXXXNALP--GISREF 2319
                   KEN ++    EPRK  E   +  +  T++D           NALP  GI+REF
Sbjct: 77   KRRRDKKKENMSYKSLEEPRKNSEIFGKTMRIRTYAD-RNARRRGYNRNALPDAGINREF 135

Query: 2318 XXXXXXXXXXXXXREIKPASLQSVPVVGVSPVAILSDQEHLGLRSSE----GQKSSEALN 2151
                         +E K    QS          +       G   SE    G+  S+A N
Sbjct: 136  RVVRDNRVNPEANQETKSPLPQSSISTNEKVTNVKEKGSPTGTTGSEKPSGGRSFSQASN 195

Query: 2150 G-----PSDSSSGRGQGKEMFEDVGVKVPSSGSQSRGQKXXXXXXXXXXXXXXXXXXXXX 1986
            G     P  +      G +  E    K  +S                             
Sbjct: 196  GSTNLHPRHAYDHNITGTDRIEPSAEKFTTSAVNFIQHNITEGYSATLASSNSVGGYFSS 255

Query: 1985 XXXXXXXXXXXXXXXXPGFVGAIKREVGVVGVRRLXXXXXXXXXXXXXXSHPVTVSGKGI 1806
                             G   AIKREVGVVG  R               S   ++ G+  
Sbjct: 256  KDPVHVPSPDSRASSAVG---AIKREVGVVGGGRQCSDNAVKDSTAPCSSFSNSILGRDN 312

Query: 1805 SPSPESFQAPTSVTKRDQ-GQTTATESVLPSISVNRSFLSNQYGSKPHQQLMGHQKAPQS 1629
            S   +SF+   S++K DQ  Q  AT+S +  +  NR+  +NQY  + HQQ +GHQKA Q 
Sbjct: 313  S---DSFRPFPSISKADQINQIAATDSGVAGMPANRALFTNQYTGRSHQQSVGHQKASQH 369

Query: 1628 NMEWKPKASQKSSHIRPGIIGATTPN-SPPTDISTESKIE-ANLLQKFSQVNISENDHVI 1455
            N EWKPK+SQKS+ I PG+IG  T + SPP D S + + + A L  + S+VNI EN +VI
Sbjct: 370  NKEWKPKSSQKSNVIGPGVIGTPTKSPSPPVDDSKDLESDVAKLQDELSRVNIHENQNVI 429

Query: 1454 IPLHLRVPEAARTQLTFGSFGAEFDSTNSVAYRFQA--NAEQSNGERSASV--SAPSTPR 1287
            I  H+RVPE  R +LTFGSFG +F+S+ ++   F A  +AE+SNGE +AS+  +A  T  
Sbjct: 430  IAQHIRVPETDRCRLTFGSFGVDFESSRNLGSGFLAAGSAEESNGESAASLTGAASKTSG 489

Query: 1286 EDVN---QVDLLDGQXXXXXXXXXXXXXXSEHQLP--YKNDSSPENLENYADIGLVRNDS 1122
             DV+    VD+LD                SEHQLP   K+ SSP++L+ YADIGLVR+  
Sbjct: 490  NDVSGRKPVDILDDLVRNSGSNSPASGEASEHQLPDDIKDASSPQDLDGYADIGLVRDTD 549

Query: 1121 PSYSPAEPQHQQDAAGLPSFP-AYDHQTGYDMPFFRPMIDESVHVQGAPSPAEALSSHAI 945
            PSY  +E Q QQD++ L SFP AYD QTGYDM +FRP +DESV  QG PSP EAL+SH+ 
Sbjct: 550  PSYPLSESQQQQDSSELASFPQAYDSQTGYDMSYFRPTMDESVRGQGLPSPQEALASHSA 609

Query: 944  NSIPAST---XXXXXXXXXXQLYPQVHVPHFPNFMPYRQFVSPVFVPPMAMPGYSSNPAA 774
            NSIPAS+             Q+YPQVHV HFPN MPYRQ +SPV+VP MAMPGYSSNP A
Sbjct: 610  NSIPASSIAMLQHQQQPQMAQMYPQVHVSHFPNMMPYRQIISPVYVPQMAMPGYSSNP-A 668

Query: 773  YPHPSNANSYMLMPGGSSHLPAGGLKYGASQYKPIPAGTPTGFGNYTNQGGYAMNHQGTV 594
            YPHPSN +SY+LMPGGSSHL   GLKYG  Q+KP+P  +PTGFGN+T+  GYA+N    V
Sbjct: 669  YPHPSNGSSYLLMPGGSSHLSTNGLKYGIQQFKPVPTASPTGFGNFTSPAGYAINAPSVV 728

Query: 593  GGATGLEDSNRIKYKEGNIYVPNPQAETSELWIQTPRELPGMQSAPYYNIPGQAPHAASY 414
            G  TGLEDS+R+KYK+GN+YV N QA+TSELWI  PRELPGMQS PYYN+P Q PHAA+Y
Sbjct: 729  GSVTGLEDSSRMKYKDGNLYVSNQQADTSELWIHNPRELPGMQSGPYYNMPAQTPHAAAY 788

Query: 413  MPSHTGHASFNAAAAAQPTHLHYSGLYHPSPQPAAIANPHHHIXXXXXXXXXXXXXXXXX 234
            +PSH GHASFN AA  Q +H+ + G+YHP+ QP A+ANP HH+                 
Sbjct: 789  LPSHAGHASFN-AAVPQSSHMQFPGMYHPTAQPPAMANP-HHMGPAMGGNVGVGVPPAAP 846

Query: 233  GTQVGAYQQPQVGHMNWSTNF 171
            G QVGAYQQPQ+G+ NWS NF
Sbjct: 847  GAQVGAYQQPQLGNFNWSPNF 867


Top