BLASTX nr result

ID: Akebia23_contig00009123 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00009123
         (1469 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002319244.2| hypothetical protein POPTR_0013s07550g [Popu...   283   1e-73
gb|EXB29133.1| DnAJ-like protein [Morus notabilis]                    280   2e-72
ref|XP_004154159.1| PREDICTED: uncharacterized protein LOC101211...   279   2e-72
ref|XP_004145410.1| PREDICTED: uncharacterized protein LOC101208...   279   2e-72
ref|XP_006494936.1| PREDICTED: uncharacterized protein LOC102623...   272   3e-70
ref|XP_006494937.1| PREDICTED: uncharacterized protein LOC102623...   260   1e-66
ref|XP_007029689.1| RING/FYVE/PHD zinc finger superfamily protei...   257   8e-66
ref|XP_002325874.2| PHD finger family protein [Populus trichocar...   256   1e-65
emb|CBI33889.3| unnamed protein product [Vitis vinifera]              255   3e-65
emb|CAN64336.1| hypothetical protein VITISV_001809 [Vitis vinifera]   246   2e-62
ref|XP_006494938.1| PREDICTED: uncharacterized protein LOC102623...   235   3e-59
ref|XP_007039510.1| Uncharacterized protein isoform 6, partial [...   228   7e-57
ref|XP_007039509.1| Uncharacterized protein isoform 5 [Theobroma...   228   7e-57
ref|XP_007039508.1| Uncharacterized protein isoform 4, partial [...   228   7e-57
ref|XP_007039507.1| Uncharacterized protein isoform 3 [Theobroma...   228   7e-57
ref|XP_007039506.1| Uncharacterized protein isoform 2, partial [...   228   7e-57
ref|XP_007039505.1| Uncharacterized protein isoform 1 [Theobroma...   228   7e-57
ref|XP_003631477.1| PREDICTED: uncharacterized protein LOC100243...   225   3e-56
ref|XP_004511407.1| PREDICTED: serine-rich adhesin for platelets...   224   8e-56
ref|XP_006385644.1| hypothetical protein POPTR_0003s08970g [Popu...   223   1e-55

>ref|XP_002319244.2| hypothetical protein POPTR_0013s07550g [Populus trichocarpa]
            gi|550325198|gb|EEE95167.2| hypothetical protein
            POPTR_0013s07550g [Populus trichocarpa]
          Length = 1586

 Score =  283 bits (724), Expect = 1e-73
 Identities = 195/496 (39%), Positives = 254/496 (51%), Gaps = 23/496 (4%)
 Frame = -3

Query: 1458 EPEMTPVLKGSYRIQGPTDEVDHDTPKNTESSRAENRFRRHSMSDEVHTGAESGTCNVCA 1279
            EP++T VL+  +R++GP D+      K  E S+AE    + SM  +V   AESGTCNVC+
Sbjct: 22   EPKITSVLREGHRMEGPLDKTQK---KYMEPSQAEKGLGKPSMRRKVRMRAESGTCNVCS 78

Query: 1278 APCSPCMHFNQAGSCVELDVKDEFSDATSTGKVGSQCSFNDGNVLPTFKKRLCGDRHNAT 1099
            APCS CMH   A  C+     DEFSD T      SQ S NDG+ + +FK R      + T
Sbjct: 79   APCSSCMHLKLA--CMG-SKGDEFSDETCRVTASSQYSNNDGDGIVSFKSRARDSLQHTT 135

Query: 1098 SETSNLLSVCSSHDSLSENAESKASLRTFDSSENVE--MLPNVSLVGIASKHQLLSKPQT 925
            SE SNLLSV SSHDSLSENAESKA++R+ D+  + E  MLP +S     ++     KPQ 
Sbjct: 136  SEASNLLSVSSSHDSLSENAESKANIRSTDADASAESQMLPKLSSGRAVAEDHFSPKPQC 195

Query: 924  VTRQNVFISSSIQSEQRMGLECAGDNISCVSANMPVGDLTVDVDKKDVSCSSASIGSFLP 745
            ++ Q          +   G +   D ISCVS       + V   KK++   +    S L 
Sbjct: 196  LSDQKTLSKKHGDPKSEEGQD---DTISCVSRASDASKV-VSYPKKNLDRDNLLRSSALE 251

Query: 744  EATAG---LLDKSGQSNVPSLKDFSVGDTSKKVWSPYPHSQSGKSNFHNSDAKDLEANSC 574
               +G   +   SG    PS  D   G +S KV +        K    N++ K L+ +  
Sbjct: 252  VEGSGKALVSHNSGSLETPS-NDADAGSSSPKVQT--------KCLSLNANGKCLDEHPS 302

Query: 573  SHQQDEPSECPTNHVESSFAKLATPDGGSAEKSTTHN-CNNILPKFENSKASSFGASVKI 397
             H   +P ECP   V  S +K A  +         HN  +N         A S   S KI
Sbjct: 303  LHDHGKPFECPMEQVNLSLSKEAASNIDCGGNLAAHNNADNHANGKSTINAESSKVSCKI 362

Query: 396  YPCLEA-----GSDMHIESYSASP------------EDADNREPPLESQINDNRDESDTV 268
            Y  LE        D   E +  S             E  D +E  L+S   D  DES+ +
Sbjct: 363  YSKLELEADKDSGDQSNEGFKGSEQVGREEKLNDLEELTDMQEIHLQSASMDESDESEIL 422

Query: 267  EADVKVCDICGDAGREYLLAICSRCSDGAEHTYCMRIMMDKVPEGEWICEGCTLKEESEN 88
            E DVKVCDICGDAGRE LLAICSRC+DGAEHTYCMR M+ KVPEG+W+CE C L EE+EN
Sbjct: 423  EHDVKVCDICGDAGREDLLAICSRCTDGAEHTYCMRDMLQKVPEGDWLCEECKLAEETEN 482

Query: 87   QKQDKFETELGTSKAS 40
            QK D  E  + ++++S
Sbjct: 483  QKPDAEEKRMNSTQSS 498


>gb|EXB29133.1| DnAJ-like protein [Morus notabilis]
          Length = 1795

 Score =  280 bits (715), Expect = 2e-72
 Identities = 194/491 (39%), Positives = 260/491 (52%), Gaps = 16/491 (3%)
 Frame = -3

Query: 1464 ILEPEMTPVLKGSYRIQGPTDEVDHD---TPKNTESSRAENRFRRHSMSDEVHTGAESGT 1294
            I   ++TPVL+GSY +QGP D+ DHD   +  NT SSR+EN+F ++ M+ +V    ESG 
Sbjct: 83   IAPSDITPVLRGSYSMQGPFDDTDHDDHHSHNNTVSSRSENKFSKYYMNHKVRMRGESGA 142

Query: 1293 C-NVCAAPCSPCMHFNQAGSCVELDVKDEFSDATSTGKVGSQCSFNDG-NVLPTFKKRLC 1120
            C NVCAAPCS CMH N     +     DEFSD T      SQ S N   +   +FK +  
Sbjct: 143  CCNVCAAPCSSCMHLNHD---LMASKTDEFSDETCRVNAASQYSVNGARDTSSSFKSKRR 199

Query: 1119 GDRHNATSETSNLLSVCSSHDSLSENAESKASLRTFDSSENVEMLPNVSLVGIASKHQLL 940
                N  SETSN++SV S+HDSLSENA+SKASLR+ + + ++++LP +S  G   +    
Sbjct: 200  ESLQNTASETSNIMSVSSNHDSLSENADSKASLRSSNDALDMQLLP-LSSGGTTGEVGPS 258

Query: 939  SKPQTVTRQNVFISSSIQSEQRMGLECAGDNISCVS----ANMPVGDLTVDVDKKDVSCS 772
             KP     Q     S  + E    LE   D+ISCVS    AN+ VG+ + ++D+ ++SCS
Sbjct: 259  PKPLCNLYQG---GSPNKHEDSKVLEVHDDDISCVSRANDANVAVGNSSRNIDRTNMSCS 315

Query: 771  SASIGSFLPEATAGLLDKSGQSNVPSLKDFSVGDTSKKVWSPYPHSQSGKSNFHNSDAKD 592
            SAS+ S  PE +     + G  ++                              +  +KD
Sbjct: 316  SASVSSLGPEES-----RKGHESIA----------------------------RDMPSKD 342

Query: 591  LEANSCSHQQDEPSECPTNHVESSFAKLATPDGGSAEKS---TTHNCNNILPKFE----N 433
             +A+S S  +++  E     + +S  ++A  DG S +KS   T+       PK E    N
Sbjct: 343  ADASSSS-PKEKLFESSPEQIGASSKEVAAVDGASCQKSIACTSDVPMKFSPKLEAEVNN 401

Query: 432  SKASSFGASVKIYPCLEAGSDMHIESYSASPEDADNREPPLESQINDNRDESDTVEADVK 253
                S G + K +   +A  D     +       D REPP +S   D  DESD VE DVK
Sbjct: 402  DGQGSTGGTPKCFG--QAEQDEKSSKF-------DVREPPSQSMSGDESDESDIVEHDVK 452

Query: 252  VCDICGDAGREYLLAICSRCSDGAEHTYCMRIMMDKVPEGEWICEGCTLKEESENQKQDK 73
            VCDICGDAGRE +LA CSRCSDGAEHTYCMR M+ KVP   W+CE C   EE   QKQ+K
Sbjct: 453  VCDICGDAGREDMLATCSRCSDGAEHTYCMRKMLRKVPGRNWMCEECKFAEEINTQKQEK 512

Query: 72   FETELGTSKAS 40
                  TSKAS
Sbjct: 513  --EGKSTSKAS 521


>ref|XP_004154159.1| PREDICTED: uncharacterized protein LOC101211560, partial [Cucumis
            sativus]
          Length = 1116

 Score =  279 bits (714), Expect = 2e-72
 Identities = 190/484 (39%), Positives = 262/484 (54%), Gaps = 21/484 (4%)
 Frame = -3

Query: 1464 ILEPEMTPVLKG-SYRIQGPTDEVDHDTPKNTESSRAENRFRRHSMSDEVHTGAESGTCN 1288
            + E ++TPVL G S+R QG   E D+DT  N  S ++  +F  +SM+  VH   ESGTCN
Sbjct: 20   VSEMKITPVLGGGSHRTQGSIGETDNDTQWNMVSPQSSKKFT-NSMNQTVHMRGESGTCN 78

Query: 1287 VCAAPCSPCMHFNQAGSCVELDVKDEFSDATSTGKVGSQCSFNDGNVLPTFKKRLCGDRH 1108
            VC+APCS CMH  +A   + +   +EFSD TS     SQ S ND + + + K R+C    
Sbjct: 79   VCSAPCSSCMHLKRA---LTVSKTEEFSDETSHVNATSQYSANDADAISSIKSRVCESSL 135

Query: 1107 NATSETSNLLSVCSSHDSLSENAESKASLRTFDS---SENVEMLPNVSLVGIASKHQLLS 937
            +A SETSNLLSV SSHDS SENA+S A++R+FD+   S +++ +      GI  +  + +
Sbjct: 136  HANSETSNLLSVNSSHDSFSENADSMATIRSFDAANFSVDIDDMHKKLFSGIVPEGHIAT 195

Query: 936  KPQTVTRQNVFISSSIQSEQRMGLECAGDNISCVS----ANMPVGDLTVDVDKKDVSCSS 769
            +P   T       +S +     G E   DNISCVS    AN+ V      +D K+VS  S
Sbjct: 196  EPTVQT-------TSEKHRSIKGAEGHDDNISCVSGSSDANIAVVSHEKIMDNKNVSSGS 248

Query: 768  ASIGSFLPEATAGLL--DKSGQSNVPSLKDFSVGDTSKKVWSPYPHSQSGKSNFHNSDAK 595
            AS+ S   E +  ++   K   S++P+ K+  V ++SK+  +    S S K         
Sbjct: 249  ASVDSLCREGSDKVVFSSKLAISDIPASKE--VHNSSKEAHTVDSFSPSDKP----LSEI 302

Query: 594  DLEANSCSHQQDEPSECPTNHVESSFAKLAT--PDGGSAEKSTTHNCNNILPKFENSKAS 421
              E N  +  + EP E    H +S   ++ T  P G   EK  T+ CN +   F+ S   
Sbjct: 303  GYEQNPSTCVKGEPLESSLVHSDSLTREVVTAPPHG---EKFVTNICNEVGDDFKVSSQI 359

Query: 420  SFGASVKIYPCLEAG---------SDMHIESYSASPEDADNREPPLESQINDNRDESDTV 268
               +  + +                D H E++      +D +E   +S      DESD V
Sbjct: 360  LLKSEEENHVDRSEPPDGDMKIQYEDEHCENFKDLSGSSDVKEHHSQSASGSESDESDIV 419

Query: 267  EADVKVCDICGDAGREYLLAICSRCSDGAEHTYCMRIMMDKVPEGEWICEGCTLKEESEN 88
            E DVKVCDICGDAGRE LLAICSRC+DGAEHTYCMR  +D+VPEG+W+CE C   EE+EN
Sbjct: 420  EHDVKVCDICGDAGREDLLAICSRCTDGAEHTYCMRERLDEVPEGDWLCEECKSAEENEN 479

Query: 87   QKQD 76
            QKQD
Sbjct: 480  QKQD 483


>ref|XP_004145410.1| PREDICTED: uncharacterized protein LOC101208726 [Cucumis sativus]
            gi|449515520|ref|XP_004164797.1| PREDICTED:
            uncharacterized LOC101211560 [Cucumis sativus]
          Length = 1567

 Score =  279 bits (714), Expect = 2e-72
 Identities = 190/484 (39%), Positives = 262/484 (54%), Gaps = 21/484 (4%)
 Frame = -3

Query: 1464 ILEPEMTPVLKG-SYRIQGPTDEVDHDTPKNTESSRAENRFRRHSMSDEVHTGAESGTCN 1288
            + E ++TPVL G S+R QG   E D+DT  N  S ++  +F  +SM+  VH   ESGTCN
Sbjct: 20   VSEMKITPVLGGGSHRTQGSIGETDNDTQWNMVSPQSSKKFT-NSMNQTVHMRGESGTCN 78

Query: 1287 VCAAPCSPCMHFNQAGSCVELDVKDEFSDATSTGKVGSQCSFNDGNVLPTFKKRLCGDRH 1108
            VC+APCS CMH  +A   + +   +EFSD TS     SQ S ND + + + K R+C    
Sbjct: 79   VCSAPCSSCMHLKRA---LTVSKTEEFSDETSHVNATSQYSANDADAISSIKSRVCESSL 135

Query: 1107 NATSETSNLLSVCSSHDSLSENAESKASLRTFDS---SENVEMLPNVSLVGIASKHQLLS 937
            +A SETSNLLSV SSHDS SENA+S A++R+FD+   S +++ +      GI  +  + +
Sbjct: 136  HANSETSNLLSVNSSHDSFSENADSMATIRSFDAANFSVDIDDMHKKLFSGIVPEGHIAT 195

Query: 936  KPQTVTRQNVFISSSIQSEQRMGLECAGDNISCVS----ANMPVGDLTVDVDKKDVSCSS 769
            +P   T       +S +     G E   DNISCVS    AN+ V      +D K+VS  S
Sbjct: 196  EPTVQT-------TSEKHRSIKGAEGHDDNISCVSGSSDANIAVVSHEKIMDNKNVSSGS 248

Query: 768  ASIGSFLPEATAGLL--DKSGQSNVPSLKDFSVGDTSKKVWSPYPHSQSGKSNFHNSDAK 595
            AS+ S   E +  ++   K   S++P+ K+  V ++SK+  +    S S K         
Sbjct: 249  ASVDSLCREGSDKVVFSSKLAISDIPASKE--VHNSSKEAHTVDSFSPSDKP----LSEI 302

Query: 594  DLEANSCSHQQDEPSECPTNHVESSFAKLAT--PDGGSAEKSTTHNCNNILPKFENSKAS 421
              E N  +  + EP E    H +S   ++ T  P G   EK  T+ CN +   F+ S   
Sbjct: 303  GYEQNPSTCVKGEPLESSLVHSDSLTREVVTAPPHG---EKFVTNICNEVGDDFKVSSQI 359

Query: 420  SFGASVKIYPCLEAG---------SDMHIESYSASPEDADNREPPLESQINDNRDESDTV 268
               +  + +                D H E++      +D +E   +S      DESD V
Sbjct: 360  LLKSEEENHVDRSEPPDGDMKIQYEDEHCENFKDLSGSSDVKEHHSQSASGSESDESDIV 419

Query: 267  EADVKVCDICGDAGREYLLAICSRCSDGAEHTYCMRIMMDKVPEGEWICEGCTLKEESEN 88
            E DVKVCDICGDAGRE LLAICSRC+DGAEHTYCMR  +D+VPEG+W+CE C   EE+EN
Sbjct: 420  EHDVKVCDICGDAGREDLLAICSRCTDGAEHTYCMRERLDEVPEGDWLCEECKSAEENEN 479

Query: 87   QKQD 76
            QKQD
Sbjct: 480  QKQD 483


>ref|XP_006494936.1| PREDICTED: uncharacterized protein LOC102623421 isoform X1 [Citrus
            sinensis]
          Length = 1658

 Score =  272 bits (695), Expect = 3e-70
 Identities = 190/471 (40%), Positives = 244/471 (51%), Gaps = 11/471 (2%)
 Frame = -3

Query: 1458 EPEMTPVLKGSYRIQGPTDEVDHDTPKNTESSRAENRFRRHSMSDEVHTGAESGTCNVCA 1279
            E E+T VL GS  +QGP +E + DT KN  +S++E RF + SMS +    AESGTCNVC 
Sbjct: 30   EAEITSVLSGSCHMQGPAEERNLDTRKNMVTSQSERRFGKRSMSRKNRMRAESGTCNVCF 89

Query: 1278 APCSPCMHFNQA--GSCVELDVKDEFSDATSTGKVGSQCSFNDGNVLPTFKKRLCGDRHN 1105
            APCS CMH N A  GS  E     EFSD T     GSQ S N+ + L +FK+  C     
Sbjct: 90   APCSSCMHLNLALMGSKTE-----EFSDETCRETTGSQYSINEADDLRSFKRGPCNKLQQ 144

Query: 1104 ATSETSNLLSVCSSHDSLSENAESKASLRT---FDSSENVEMLPNVSLVGIASKHQLLSK 934
              SE SN LSV SSHDS S NAESK +LR+    D+SE+ E+ P  S  G  ++ Q+  K
Sbjct: 145  TASEASNPLSVNSSHDSFSVNAESKVTLRSSEISDASEDFEIHPKFSSRGGTAEGQISPK 204

Query: 933  PQTVTRQNVFISSSIQSEQRMGLECAGDNISCVS----ANMPVGDLTVDVDKKDVSCSSA 766
             +    Q + ++   + +   G E   DNISCVS     +  + +   ++D K++S SSA
Sbjct: 205  LEIGLDQRISLN---KYDDPKGAEGLDDNISCVSRANDTSTALSENNRNMDIKNLSHSSA 261

Query: 765  SIGSFLPEA--TAGLLDKSGQSNVPSLKDFSVGDTSKKVWSPYPHSQSGKSNFHNSDAKD 592
            S+ S  PE    A   +K   S +PS++       S KV SP P SQS K    +S    
Sbjct: 262  SVCSLGPEGLEKAQSSEKLELSEIPSVEKVGASCGSPKVRSPVPDSQSDKRLVESSS--- 318

Query: 591  LEANSCSHQQDEPSECPTNHVESSFAKLATPDGGSAEKSTTHNCNNILPKFENSKASSFG 412
             +  +  HQ+ E                A  DG + E           P  E  K     
Sbjct: 319  -DVLTKVHQKSE----------------AETDGDNGE-----------PPDEALK----- 345

Query: 411  ASVKIYPCLEAGSDMHIESYSASPEDADNREPPLESQINDNRDESDTVEADVKVCDICGD 232
                   CL+   +    +  A   D         +   D  DESD +E DVKVCDICGD
Sbjct: 346  -------CLDKDKEELTSTQLAELPDVQR----FPAASGDETDESDIMEQDVKVCDICGD 394

Query: 231  AGREYLLAICSRCSDGAEHTYCMRIMMDKVPEGEWICEGCTLKEESENQKQ 79
            AGRE LLAICSRCSDGAEHTYCM+ M+ KVPEG+W+CE C   EE+E QKQ
Sbjct: 395  AGREDLLAICSRCSDGAEHTYCMKEMLQKVPEGDWLCEECKFAEETEKQKQ 445


>ref|XP_006494937.1| PREDICTED: uncharacterized protein LOC102623421 isoform X2 [Citrus
            sinensis]
          Length = 1616

 Score =  260 bits (665), Expect = 1e-66
 Identities = 183/458 (39%), Positives = 236/458 (51%), Gaps = 11/458 (2%)
 Frame = -3

Query: 1419 IQGPTDEVDHDTPKNTESSRAENRFRRHSMSDEVHTGAESGTCNVCAAPCSPCMHFNQA- 1243
            +QGP +E + DT KN  +S++E RF + SMS +    AESGTCNVC APCS CMH N A 
Sbjct: 1    MQGPAEERNLDTRKNMVTSQSERRFGKRSMSRKNRMRAESGTCNVCFAPCSSCMHLNLAL 60

Query: 1242 -GSCVELDVKDEFSDATSTGKVGSQCSFNDGNVLPTFKKRLCGDRHNATSETSNLLSVCS 1066
             GS  E     EFSD T     GSQ S N+ + L +FK+  C       SE SN LSV S
Sbjct: 61   MGSKTE-----EFSDETCRETTGSQYSINEADDLRSFKRGPCNKLQQTASEASNPLSVNS 115

Query: 1065 SHDSLSENAESKASLRT---FDSSENVEMLPNVSLVGIASKHQLLSKPQTVTRQNVFISS 895
            SHDS S NAESK +LR+    D+SE+ E+ P  S  G  ++ Q+  K +    Q + ++ 
Sbjct: 116  SHDSFSVNAESKVTLRSSEISDASEDFEIHPKFSSRGGTAEGQISPKLEIGLDQRISLN- 174

Query: 894  SIQSEQRMGLECAGDNISCVS----ANMPVGDLTVDVDKKDVSCSSASIGSFLPEA--TA 733
              + +   G E   DNISCVS     +  + +   ++D K++S SSAS+ S  PE    A
Sbjct: 175  --KYDDPKGAEGLDDNISCVSRANDTSTALSENNRNMDIKNLSHSSASVCSLGPEGLEKA 232

Query: 732  GLLDKSGQSNVPSLKDFSVGDTSKKVWSPYPHSQSGKSNFHNSDAKDLEANSCSHQQDEP 553
               +K   S +PS++       S KV SP P SQS K    +S     +  +  HQ+ E 
Sbjct: 233  QSSEKLELSEIPSVEKVGASCGSPKVRSPVPDSQSDKRLVESSS----DVLTKVHQKSE- 287

Query: 552  SECPTNHVESSFAKLATPDGGSAEKSTTHNCNNILPKFENSKASSFGASVKIYPCLEAGS 373
                           A  DG + E           P  E  K            CL+   
Sbjct: 288  ---------------AETDGDNGE-----------PPDEALK------------CLDKDK 309

Query: 372  DMHIESYSASPEDADNREPPLESQINDNRDESDTVEADVKVCDICGDAGREYLLAICSRC 193
            +    +  A   D         +   D  DESD +E DVKVCDICGDAGRE LLAICSRC
Sbjct: 310  EELTSTQLAELPDVQR----FPAASGDETDESDIMEQDVKVCDICGDAGREDLLAICSRC 365

Query: 192  SDGAEHTYCMRIMMDKVPEGEWICEGCTLKEESENQKQ 79
            SDGAEHTYCM+ M+ KVPEG+W+CE C   EE+E QKQ
Sbjct: 366  SDGAEHTYCMKEMLQKVPEGDWLCEECKFAEETEKQKQ 403


>ref|XP_007029689.1| RING/FYVE/PHD zinc finger superfamily protein, putative isoform 1
            [Theobroma cacao] gi|508718294|gb|EOY10191.1|
            RING/FYVE/PHD zinc finger superfamily protein, putative
            isoform 1 [Theobroma cacao]
          Length = 1474

 Score =  257 bits (657), Expect = 8e-66
 Identities = 185/472 (39%), Positives = 233/472 (49%), Gaps = 10/472 (2%)
 Frame = -3

Query: 1464 ILEPEMTPVLKGSYRIQGPTDEVDHDTPKNTESSRAENRFRRHSMSDEVHTGAESGTCNV 1285
            I EPE+TP+L+G Y +QGP DE++    KN    +   +  R  MS +V+T AESGTCNV
Sbjct: 28   IYEPEITPILRGIYCMQGPADEIEQSIQKNMAPPKTVRKLVRRYMSQKVYTKAESGTCNV 87

Query: 1284 CAAPCSPCMHFNQAGSCVELDVK-DEFSDATSTGKVGSQCSFNDGNVLPTFKKRLCGDRH 1108
            C+APCS CMH     S  +++ K +EFSD T    V SQ S N+            GD  
Sbjct: 88   CSAPCSSCMHL----STPQMESKSEEFSDDTDRVAVASQYSINEDK---------AGDSL 134

Query: 1107 NAT-SETSNLLSVCSSHDSLSENAESKASLR---TFDSSENVEMLPNVSLVGIASKHQLL 940
              T SE SNLLSV SSHDS SEN ESKA++R     D+SE+VE+    S     SK    
Sbjct: 135  QPTPSEASNLLSVNSSHDSYSENIESKATIRPSNVSDASEDVEIQRTFSNAYDGSK---- 190

Query: 939  SKPQTVTRQNVFISSSIQSEQRMGLECAGDNISCVS----ANMPVGDLTVDVDKKDVSCS 772
                                   G+E   DNISC S     N        D+D K+ S S
Sbjct: 191  -----------------------GVEGHDDNISCASRASDENAASSYCNKDLDSKNSSRS 227

Query: 771  SASIGSFLPEATAGLLDKSGQSNVPSLK-DFSVGDTSKKVWSPYPHSQSGKSNFHNSDAK 595
            SAS+ S L         K   S +PS+K +   G TS ++ SP+ HSQSGKS    S   
Sbjct: 228  SASVSS-LGSGKVLSSQKLELSELPSIKEEVDAGSTSLRMQSPHSHSQSGKSAVGGS--- 283

Query: 594  DLEANSCSHQQDEPSECPTNHVESSFAKLATPDGGSAEKSTTHNCNNILPKFENSKASSF 415
                          SE  T       A + +  G  A+K+     +  L + E  K +  
Sbjct: 284  --------------SEISTKIHSKLEADIDSNSGDPADKT-----DKSLNEDEQDKLNEL 324

Query: 414  GASVKIYPCLEAGSDMHIESYSASPEDADNREPPLESQINDNRDESDTVEADVKVCDICG 235
                                     E  D +E P ++   D   ESD  E DVKVCDICG
Sbjct: 325  ------------------------VELPDKQESPSQAVSGDESYESDATEHDVKVCDICG 360

Query: 234  DAGREYLLAICSRCSDGAEHTYCMRIMMDKVPEGEWICEGCTLKEESENQKQ 79
            DAGRE LLAICS+C+DGAEHTYCMR M+ KVPEG+W+CE C L EE+E+QKQ
Sbjct: 361  DAGREDLLAICSKCADGAEHTYCMREMLQKVPEGDWLCEECKLAEETESQKQ 412


>ref|XP_002325874.2| PHD finger family protein [Populus trichocarpa]
            gi|550316893|gb|EEF00256.2| PHD finger family protein
            [Populus trichocarpa]
          Length = 1539

 Score =  256 bits (655), Expect = 1e-65
 Identities = 185/486 (38%), Positives = 237/486 (48%), Gaps = 44/486 (9%)
 Frame = -3

Query: 1374 TESSRAENRFRRHSMSDEVHTGAESGTCNVCAAPCSPCMHFNQAGSCVELDVKDEFSDAT 1195
            T S + E    + SM  +V T  ESGTCNVC+APCS CMH   A  C+     DEFSD T
Sbjct: 9    TGSMQVEKGLGKPSMRRKVRTSTESGTCNVCSAPCSSCMHLKLA--CMG-SKGDEFSDET 65

Query: 1194 STGKVGSQCSFNDGNVLPTFKKRLCGDRHNATSETSNLLSVCSSHDSLSENAESKASLRT 1015
                  SQ S NDG+ L +FK R      + TSE SNLLSV SSHDSLSENAESK + ++
Sbjct: 66   CRVTASSQYSNNDGDGLVSFKSRARDSLQHTTSEASNLLSVSSSHDSLSENAESKVNRKS 125

Query: 1014 FDSSENVE--MLPNVSLVGIASKHQLLSKPQTVTRQNVFISSSIQSEQRMGLECAGDNIS 841
             D+  + E  M P +S     ++ Q   K ++   Q  F  +++ S+   G +   DN+S
Sbjct: 126  SDADASAESQMRPKMSSGRAVAEDQFSPKAESFPDQKTFSKNNVDSKSEEGHD---DNMS 182

Query: 840  CVS----ANMPVGDLTVDVDKKDVSCSSASIGSFLPEATAGLLDKSGQSNVPSLKDFSVG 673
            CVS    A+  V     ++D K+  C  +S         A    KSG    PS  D    
Sbjct: 183  CVSRANDASKVVSYYNKNLDMKN--CLPSSALEVEGSGKAPFSHKSGSFETPS-NDVDAC 239

Query: 672  DTSKKVWSPYPHSQSGKSNFHNSDAKDLEANSCSHQQDEPSECPTNHVESSFAKLATPDG 493
             +S KV +        K    NS+ K L+ +   H   +  ECPT  V  S +K A+ + 
Sbjct: 240  SSSPKVQT--------KCLSSNSNGKHLDEDPALHDHGKRFECPTEQVNLSLSKEASANI 291

Query: 492  GSAEKSTTHNC--NNILPKFENSKASSFGASVKIYPCLEAGSDMHI-------------- 361
                    HN   NN   K     A S   S KI   LE  +D                 
Sbjct: 292  DCVGNLAAHNIADNNANGK-STLNADSSKVSCKINSKLELEADEDSGDQADEGFKCSDQV 350

Query: 360  ---ESYSASPEDADNREPPLESQINDNRDESDTVEAD-------------------VKVC 247
               E  + S E AD +EP L+S   D  DES+ +E D                   VKVC
Sbjct: 351  ERKEKLNESDELADMQEPMLQSASGDESDESEILEHDNLFLHSLFNLLILHSGGLKVKVC 410

Query: 246  DICGDAGREYLLAICSRCSDGAEHTYCMRIMMDKVPEGEWICEGCTLKEESENQKQDKFE 67
            DICGDAGRE  LAICSRC+DGAEH YCMR M+ K+PEG+W+CE C L EE+ENQKQD  E
Sbjct: 411  DICGDAGREDFLAICSRCADGAEHIYCMREMLQKLPEGDWLCEECKLAEEAENQKQDAEE 470

Query: 66   TELGTS 49
              +  +
Sbjct: 471  KRMNVA 476


>emb|CBI33889.3| unnamed protein product [Vitis vinifera]
          Length = 1457

 Score =  255 bits (652), Expect = 3e-65
 Identities = 177/464 (38%), Positives = 240/464 (51%), Gaps = 16/464 (3%)
 Frame = -3

Query: 1467 KILEPEMTPVLKGSYRIQGPTDEVDHDTPKNTESSRAENRFRRHSMSDEVHTGAESGTCN 1288
            ++ +P++TPVLKG YRIQGP D+ +        S   E  F  H  S +++T AES  CN
Sbjct: 58   EVSQPKITPVLKGGYRIQGPADDAESVIQLTMGSCGTEKGFSGHFSSGKLYTRAESEICN 117

Query: 1287 VCAAPCSPCMHFNQAGSCVELDVKDEFSDATSTGKVGSQCSFNDGNVLPTFKKRLCGDRH 1108
            VCA  CS CMHF++  S V      EFSD     K+ S+C FND  +L   K     D+ 
Sbjct: 118  VCATLCSSCMHFDRVASLV--GKMTEFSDEGCQEKIASRCFFNDAELLSPCKSNASDDQQ 175

Query: 1107 NATSETSNLLSVCSSHDSLSENAESKASLRTFDSSENVEMLPNVSLVGIASKHQLLSKPQ 928
            + +SETSNLLS CSSH+S SENAESK  LR   +SE++EM   ++      +   L  P 
Sbjct: 176  HTSSETSNLLSGCSSHESFSENAESKVILRASHTSEDIEMGQPLA------EDSGLPNPS 229

Query: 927  TVTRQNVFISSSIQSEQRMGLECAGDNISCVS-ANMPVGDLTVDVDKKDVSCSSASIGSF 751
            T     VF   S Q + +  LEC GD+ISC+S A+ PVGD   + D+K+VS SSAS+ S 
Sbjct: 230  TFHGNIVF---SNQHKNQNDLECPGDDISCISRADGPVGDHNGEGDRKNVSYSSASVNSS 286

Query: 750  LPEATAGLLDKSGQSNVPSLKDFSVGDTSKKVWSPYPHSQSGKSNFHNSDAKDLEA---- 583
                    ++ +    V S     +   S+        +    +    S+   L      
Sbjct: 287  PIAVATVNVEPTSHCLVSSHCGEELEHKSEFTKESMRKTAGLSNKLDPSEISYLRGVYAG 346

Query: 582  NSCSHQQDEPSECPTNHVESSFAKLATPDGGSAEKSTTHNCNNILPKFENSKASSFGASV 403
             S + ++ EPSEC    VESS A++A           T +    +P   N        SV
Sbjct: 347  PSPTSRKGEPSECSGKQVESSSARVAV---------ATSSFGGQMPGIPNC-----ARSV 392

Query: 402  KIYPCLEAG---------SDM--HIESYSASPEDADNREPPLESQINDNRDESDTVEADV 256
            K    L+ G         SD   H E   A  E +  ++ PL+SQ+ D+  +SD +E +V
Sbjct: 393  KSDIDLDDGHQETEAVHFSDKKEHSEKSCALLETSSAQKGPLQSQLVDDNVKSDVLEYEV 452

Query: 255  KVCDICGDAGREYLLAICSRCSDGAEHTYCMRIMMDKVPEGEWI 124
            KVCDICGDAG E LLA C++CSDGAEH YCMRI ++KVP   WI
Sbjct: 453  KVCDICGDAGLEELLATCTKCSDGAEHIYCMRIKLEKVPGRGWI 496


>emb|CAN64336.1| hypothetical protein VITISV_001809 [Vitis vinifera]
          Length = 1953

 Score =  246 bits (628), Expect = 2e-62
 Identities = 176/473 (37%), Positives = 242/473 (51%), Gaps = 16/473 (3%)
 Frame = -3

Query: 1377 NTESSRAENRFRRHSMSDEVHTGAESGTCNVCAAPCSPCMHFNQAGSCVELDVKDEFSDA 1198
            N  S   E  F  H  S ++ T AES  CNVCA  CS CMHF++  S V      EFSD 
Sbjct: 573  NKGSCGTEKGFSGHFSSGKLXTXAESXICNVCATLCSSCMHFDRVASLV--GKMTEFSDE 630

Query: 1197 TSTGKVGSQCSFNDGNVLPTFKKRLCGDRHNATSETSNLLSVCSSHDSLSENAESKASLR 1018
                K+ S+C FND  +L   K     D+ + +SETSNLLS CSSH+S SENAESK  LR
Sbjct: 631  GCQEKIASRCFFNDAELLSPCKSNASDDQQHTSSETSNLLSGCSSHESFSENAESKVILR 690

Query: 1017 TFDSSENVEMLPNVSLVGIASKHQLLSKPQTVTRQNVFISSSIQSEQRMGLECAGDNISC 838
               +SE++EM   ++      +   L  P T     +F   S Q + +  LEC GD+ISC
Sbjct: 691  ASHTSEDIEMGQPLA------EDSGLPNPSTFHGNIIF---SNQHKNQNDLECPGDDISC 741

Query: 837  VS-ANMPVGDLTVDVDKKDVSCSSASIGSFLPEATAGLLDKSGQSNVPSLKDFSVGDTSK 661
            +S A+ PVGD   + D+K+VS SSAS+ S         ++ +    V S +   +   S+
Sbjct: 742  ISRADGPVGDHNGEGDRKNVSYSSASVNSSPIAVATVNVEPTSHCLVSSHRGEELEHKSE 801

Query: 660  KVWSPYPHSQSGKSNFHNSDAKDLEA----NSCSHQQDEPSECPTNHVESSFAKLATPDG 493
                    +    +    S+   L       S + ++ EPSEC    VESS A++A    
Sbjct: 802  FTKESMRKTAGLSNKLDPSEISYLRGVYAGPSPTSRKGEPSECSGKQVESSSARVAV--- 858

Query: 492  GSAEKSTTHNCNNILPKFENSKASSFGASVKIYPCLEAG---------SDM--HIESYSA 346
                   T +    +P   N        SVK    L+ G         SD   H E   A
Sbjct: 859  ------ATSSFGGQMPGIPNC-----ARSVKSDIDLDDGHQETEAVHFSDKKEHSEKSCA 907

Query: 345  SPEDADNREPPLESQINDNRDESDTVEADVKVCDICGDAGREYLLAICSRCSDGAEHTYC 166
              E +  ++ PL+SQ+ D+  +SD +E +VKVCDICGDAG E LLA C++CSDGAEH YC
Sbjct: 908  LLETSSAQKGPLQSQLVDDNVKSDVLEYEVKVCDICGDAGLEELLATCTKCSDGAEHIYC 967

Query: 165  MRIMMDKVPEGEWICEGCTLKEESENQKQDKFETELGTSKASCVNGESQTNSG 7
            MRI ++KVP   W+CE C  KEE+    Q + +  +G  K S +N +++ NSG
Sbjct: 968  MRIKLEKVPGRGWMCEECMAKEET----QKEMKCTIGFLKGSSLN-QTRKNSG 1015


>ref|XP_006494938.1| PREDICTED: uncharacterized protein LOC102623421 isoform X3 [Citrus
            sinensis]
          Length = 1587

 Score =  235 bits (600), Expect = 3e-59
 Identities = 170/429 (39%), Positives = 216/429 (50%), Gaps = 11/429 (2%)
 Frame = -3

Query: 1332 MSDEVHTGAESGTCNVCAAPCSPCMHFNQA--GSCVELDVKDEFSDATSTGKVGSQCSFN 1159
            MS +    AESGTCNVC APCS CMH N A  GS  E     EFSD T     GSQ S N
Sbjct: 1    MSRKNRMRAESGTCNVCFAPCSSCMHLNLALMGSKTE-----EFSDETCRETTGSQYSIN 55

Query: 1158 DGNVLPTFKKRLCGDRHNATSETSNLLSVCSSHDSLSENAESKASLRT---FDSSENVEM 988
            + + L +FK+  C       SE SN LSV SSHDS S NAESK +LR+    D+SE+ E+
Sbjct: 56   EADDLRSFKRGPCNKLQQTASEASNPLSVNSSHDSFSVNAESKVTLRSSEISDASEDFEI 115

Query: 987  LPNVSLVGIASKHQLLSKPQTVTRQNVFISSSIQSEQRMGLECAGDNISCVS----ANMP 820
             P  S  G  ++ Q+  K +    Q + ++   + +   G E   DNISCVS     +  
Sbjct: 116  HPKFSSRGGTAEGQISPKLEIGLDQRISLN---KYDDPKGAEGLDDNISCVSRANDTSTA 172

Query: 819  VGDLTVDVDKKDVSCSSASIGSFLPEA--TAGLLDKSGQSNVPSLKDFSVGDTSKKVWSP 646
            + +   ++D K++S SSAS+ S  PE    A   +K   S +PS++       S KV SP
Sbjct: 173  LSENNRNMDIKNLSHSSASVCSLGPEGLEKAQSSEKLELSEIPSVEKVGASCGSPKVRSP 232

Query: 645  YPHSQSGKSNFHNSDAKDLEANSCSHQQDEPSECPTNHVESSFAKLATPDGGSAEKSTTH 466
             P SQS K    +S     +  +  HQ+ E                A  DG + E     
Sbjct: 233  VPDSQSDKRLVESSS----DVLTKVHQKSE----------------AETDGDNGE----- 267

Query: 465  NCNNILPKFENSKASSFGASVKIYPCLEAGSDMHIESYSASPEDADNREPPLESQINDNR 286
                  P  E  K            CL+   +    +  A   D         +   D  
Sbjct: 268  ------PPDEALK------------CLDKDKEELTSTQLAELPDVQR----FPAASGDET 305

Query: 285  DESDTVEADVKVCDICGDAGREYLLAICSRCSDGAEHTYCMRIMMDKVPEGEWICEGCTL 106
            DESD +E DVKVCDICGDAGRE LLAICSRCSDGAEHTYCM+ M+ KVPEG+W+CE C  
Sbjct: 306  DESDIMEQDVKVCDICGDAGREDLLAICSRCSDGAEHTYCMKEMLQKVPEGDWLCEECKF 365

Query: 105  KEESENQKQ 79
             EE+E QKQ
Sbjct: 366  AEETEKQKQ 374


>ref|XP_007039510.1| Uncharacterized protein isoform 6, partial [Theobroma cacao]
            gi|590675664|ref|XP_007039511.1| Uncharacterized protein
            isoform 6, partial [Theobroma cacao]
            gi|508776755|gb|EOY24011.1| Uncharacterized protein
            isoform 6, partial [Theobroma cacao]
            gi|508776756|gb|EOY24012.1| Uncharacterized protein
            isoform 6, partial [Theobroma cacao]
          Length = 996

 Score =  228 bits (580), Expect = 7e-57
 Identities = 161/462 (34%), Positives = 224/462 (48%), Gaps = 19/462 (4%)
 Frame = -3

Query: 1368 SSRAENRFRR-HSMSDEVHTGAESGTCNVCAAPCSPCMHFNQAGSCVELDVKDEFSDATS 1192
            SS AE  F   HS S ++    ESGTCN CA  CSPC+H  Q  S       + FS    
Sbjct: 3    SSNAEKGFSGGHSSSSKLGLKEESGTCNTCAPSCSPCLHSEQVTSMATKT--NGFSGEAC 60

Query: 1191 TGKVGSQCSFNDGNVLPTFKKRLCGDRHNATSETSNLLSVCSSHDSLSENAESKASLRTF 1012
              K  + CSFND ++        C DRH+ +SETS  LS C S +S SENAES+ +LR  
Sbjct: 61   KKKDSNCCSFNDADLSSPRVNSACNDRHHTSSETSQPLSACLSRESFSENAESEETLRDC 120

Query: 1011 DSSENVEMLPNVSLVGIASKHQLLSKPQTVTRQNVFISSSIQSEQRMGLECAGDNISCVS 832
            ++SE ++M+   +L    S     S   ++    V    S Q E++  LEC GDNI+ + 
Sbjct: 121  NTSEGIKMIRKPNLCQ-NSADNCGSLKSSIFHDKVV---SNQLEKQKELECHGDNIAFIC 176

Query: 831  ANMPV----GDLTVDVDKKDVSCSSASIGSFLPEATAGLLDKSGQSNVPSLKDFSVGDTS 664
             +  V    G    D DKK++S  SAS+ SF     A         N        VG   
Sbjct: 177  GSDYVKTRGGGHNSDADKKNLSYRSASVDSFSETEKA--------VNAQPASSCLVGSPC 228

Query: 663  KKVWSPYPHSQSGKSNF---------HNSDAKDLEA--NSC---SHQQDEPSECPTNHVE 526
             +V + +P   +  +N          + SD  ++ +  +SC   S  + E SEC    V+
Sbjct: 229  DEVDNNHPRRSNRSTNVSSQEILCCSNKSDLSEISSLRDSCAGASSAKGERSECSEEQVQ 288

Query: 525  SSFAKLATPDGGSAEKSTTHNCNNILPKFENSKASSFGASVKIYPCLEAGSDMHIESYSA 346
            SSF +      GS      ++  +I P+                  +  G        + 
Sbjct: 289  SSFVRADALRIGSQIGDEHNSAESIQPETG----------------INGGEQTAEVKSTT 332

Query: 345  SPEDADNREPPLESQINDNRDESDTVEADVKVCDICGDAGREYLLAICSRCSDGAEHTYC 166
              +D +  E  + S+     D SD++E +VKVCDICGD GRE LLAICS+C+DGAEH YC
Sbjct: 333  VVKDVNMEESTIVSRPYACSDGSDSLELEVKVCDICGDIGREELLAICSKCNDGAEHIYC 392

Query: 165  MRIMMDKVPEGEWICEGCTLKEESENQKQDKFETELGTSKAS 40
            MR+ MD VP+ +W+CE C L +E+E QKQDK E  +G  K S
Sbjct: 393  MRVKMDNVPKSDWMCEECMLGKETEKQKQDKIEEGVGIFKKS 434


>ref|XP_007039509.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508776754|gb|EOY24010.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 1197

 Score =  228 bits (580), Expect = 7e-57
 Identities = 161/462 (34%), Positives = 224/462 (48%), Gaps = 19/462 (4%)
 Frame = -3

Query: 1368 SSRAENRFRR-HSMSDEVHTGAESGTCNVCAAPCSPCMHFNQAGSCVELDVKDEFSDATS 1192
            SS AE  F   HS S ++    ESGTCN CA  CSPC+H  Q  S       + FS    
Sbjct: 3    SSNAEKGFSGGHSSSSKLGLKEESGTCNTCAPSCSPCLHSEQVTSMATKT--NGFSGEAC 60

Query: 1191 TGKVGSQCSFNDGNVLPTFKKRLCGDRHNATSETSNLLSVCSSHDSLSENAESKASLRTF 1012
              K  + CSFND ++        C DRH+ +SETS  LS C S +S SENAES+ +LR  
Sbjct: 61   KKKDSNCCSFNDADLSSPRVNSACNDRHHTSSETSQPLSACLSRESFSENAESEETLRDC 120

Query: 1011 DSSENVEMLPNVSLVGIASKHQLLSKPQTVTRQNVFISSSIQSEQRMGLECAGDNISCVS 832
            ++SE ++M+   +L    S     S   ++    V    S Q E++  LEC GDNI+ + 
Sbjct: 121  NTSEGIKMIRKPNLCQ-NSADNCGSLKSSIFHDKVV---SNQLEKQKELECHGDNIAFIC 176

Query: 831  ANMPV----GDLTVDVDKKDVSCSSASIGSFLPEATAGLLDKSGQSNVPSLKDFSVGDTS 664
             +  V    G    D DKK++S  SAS+ SF     A         N        VG   
Sbjct: 177  GSDYVKTRGGGHNSDADKKNLSYRSASVDSFSETEKA--------VNAQPASSCLVGSPC 228

Query: 663  KKVWSPYPHSQSGKSNF---------HNSDAKDLEA--NSC---SHQQDEPSECPTNHVE 526
             +V + +P   +  +N          + SD  ++ +  +SC   S  + E SEC    V+
Sbjct: 229  DEVDNNHPRRSNRSTNVSSQEILCCSNKSDLSEISSLRDSCAGASSAKGERSECSEEQVQ 288

Query: 525  SSFAKLATPDGGSAEKSTTHNCNNILPKFENSKASSFGASVKIYPCLEAGSDMHIESYSA 346
            SSF +      GS      ++  +I P+                  +  G        + 
Sbjct: 289  SSFVRADALRIGSQIGDEHNSAESIQPETG----------------INGGEQTAEVKSTT 332

Query: 345  SPEDADNREPPLESQINDNRDESDTVEADVKVCDICGDAGREYLLAICSRCSDGAEHTYC 166
              +D +  E  + S+     D SD++E +VKVCDICGD GRE LLAICS+C+DGAEH YC
Sbjct: 333  VVKDVNMEESTIVSRPYACSDGSDSLELEVKVCDICGDIGREELLAICSKCNDGAEHIYC 392

Query: 165  MRIMMDKVPEGEWICEGCTLKEESENQKQDKFETELGTSKAS 40
            MR+ MD VP+ +W+CE C L +E+E QKQDK E  +G  K S
Sbjct: 393  MRVKMDNVPKSDWMCEECMLGKETEKQKQDKIEEGVGIFKKS 434


>ref|XP_007039508.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
            gi|508776753|gb|EOY24009.1| Uncharacterized protein
            isoform 4, partial [Theobroma cacao]
          Length = 1044

 Score =  228 bits (580), Expect = 7e-57
 Identities = 161/462 (34%), Positives = 224/462 (48%), Gaps = 19/462 (4%)
 Frame = -3

Query: 1368 SSRAENRFRR-HSMSDEVHTGAESGTCNVCAAPCSPCMHFNQAGSCVELDVKDEFSDATS 1192
            SS AE  F   HS S ++    ESGTCN CA  CSPC+H  Q  S       + FS    
Sbjct: 3    SSNAEKGFSGGHSSSSKLGLKEESGTCNTCAPSCSPCLHSEQVTSMATKT--NGFSGEAC 60

Query: 1191 TGKVGSQCSFNDGNVLPTFKKRLCGDRHNATSETSNLLSVCSSHDSLSENAESKASLRTF 1012
              K  + CSFND ++        C DRH+ +SETS  LS C S +S SENAES+ +LR  
Sbjct: 61   KKKDSNCCSFNDADLSSPRVNSACNDRHHTSSETSQPLSACLSRESFSENAESEETLRDC 120

Query: 1011 DSSENVEMLPNVSLVGIASKHQLLSKPQTVTRQNVFISSSIQSEQRMGLECAGDNISCVS 832
            ++SE ++M+   +L    S     S   ++    V    S Q E++  LEC GDNI+ + 
Sbjct: 121  NTSEGIKMIRKPNLCQ-NSADNCGSLKSSIFHDKVV---SNQLEKQKELECHGDNIAFIC 176

Query: 831  ANMPV----GDLTVDVDKKDVSCSSASIGSFLPEATAGLLDKSGQSNVPSLKDFSVGDTS 664
             +  V    G    D DKK++S  SAS+ SF     A         N        VG   
Sbjct: 177  GSDYVKTRGGGHNSDADKKNLSYRSASVDSFSETEKA--------VNAQPASSCLVGSPC 228

Query: 663  KKVWSPYPHSQSGKSNF---------HNSDAKDLEA--NSC---SHQQDEPSECPTNHVE 526
             +V + +P   +  +N          + SD  ++ +  +SC   S  + E SEC    V+
Sbjct: 229  DEVDNNHPRRSNRSTNVSSQEILCCSNKSDLSEISSLRDSCAGASSAKGERSECSEEQVQ 288

Query: 525  SSFAKLATPDGGSAEKSTTHNCNNILPKFENSKASSFGASVKIYPCLEAGSDMHIESYSA 346
            SSF +      GS      ++  +I P+                  +  G        + 
Sbjct: 289  SSFVRADALRIGSQIGDEHNSAESIQPETG----------------INGGEQTAEVKSTT 332

Query: 345  SPEDADNREPPLESQINDNRDESDTVEADVKVCDICGDAGREYLLAICSRCSDGAEHTYC 166
              +D +  E  + S+     D SD++E +VKVCDICGD GRE LLAICS+C+DGAEH YC
Sbjct: 333  VVKDVNMEESTIVSRPYACSDGSDSLELEVKVCDICGDIGREELLAICSKCNDGAEHIYC 392

Query: 165  MRIMMDKVPEGEWICEGCTLKEESENQKQDKFETELGTSKAS 40
            MR+ MD VP+ +W+CE C L +E+E QKQDK E  +G  K S
Sbjct: 393  MRVKMDNVPKSDWMCEECMLGKETEKQKQDKIEEGVGIFKKS 434


>ref|XP_007039507.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508776752|gb|EOY24008.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 1161

 Score =  228 bits (580), Expect = 7e-57
 Identities = 161/462 (34%), Positives = 224/462 (48%), Gaps = 19/462 (4%)
 Frame = -3

Query: 1368 SSRAENRFRR-HSMSDEVHTGAESGTCNVCAAPCSPCMHFNQAGSCVELDVKDEFSDATS 1192
            SS AE  F   HS S ++    ESGTCN CA  CSPC+H  Q  S       + FS    
Sbjct: 3    SSNAEKGFSGGHSSSSKLGLKEESGTCNTCAPSCSPCLHSEQVTSMATKT--NGFSGEAC 60

Query: 1191 TGKVGSQCSFNDGNVLPTFKKRLCGDRHNATSETSNLLSVCSSHDSLSENAESKASLRTF 1012
              K  + CSFND ++        C DRH+ +SETS  LS C S +S SENAES+ +LR  
Sbjct: 61   KKKDSNCCSFNDADLSSPRVNSACNDRHHTSSETSQPLSACLSRESFSENAESEETLRDC 120

Query: 1011 DSSENVEMLPNVSLVGIASKHQLLSKPQTVTRQNVFISSSIQSEQRMGLECAGDNISCVS 832
            ++SE ++M+   +L    S     S   ++    V    S Q E++  LEC GDNI+ + 
Sbjct: 121  NTSEGIKMIRKPNLCQ-NSADNCGSLKSSIFHDKVV---SNQLEKQKELECHGDNIAFIC 176

Query: 831  ANMPV----GDLTVDVDKKDVSCSSASIGSFLPEATAGLLDKSGQSNVPSLKDFSVGDTS 664
             +  V    G    D DKK++S  SAS+ SF     A         N        VG   
Sbjct: 177  GSDYVKTRGGGHNSDADKKNLSYRSASVDSFSETEKA--------VNAQPASSCLVGSPC 228

Query: 663  KKVWSPYPHSQSGKSNF---------HNSDAKDLEA--NSC---SHQQDEPSECPTNHVE 526
             +V + +P   +  +N          + SD  ++ +  +SC   S  + E SEC    V+
Sbjct: 229  DEVDNNHPRRSNRSTNVSSQEILCCSNKSDLSEISSLRDSCAGASSAKGERSECSEEQVQ 288

Query: 525  SSFAKLATPDGGSAEKSTTHNCNNILPKFENSKASSFGASVKIYPCLEAGSDMHIESYSA 346
            SSF +      GS      ++  +I P+                  +  G        + 
Sbjct: 289  SSFVRADALRIGSQIGDEHNSAESIQPETG----------------INGGEQTAEVKSTT 332

Query: 345  SPEDADNREPPLESQINDNRDESDTVEADVKVCDICGDAGREYLLAICSRCSDGAEHTYC 166
              +D +  E  + S+     D SD++E +VKVCDICGD GRE LLAICS+C+DGAEH YC
Sbjct: 333  VVKDVNMEESTIVSRPYACSDGSDSLELEVKVCDICGDIGREELLAICSKCNDGAEHIYC 392

Query: 165  MRIMMDKVPEGEWICEGCTLKEESENQKQDKFETELGTSKAS 40
            MR+ MD VP+ +W+CE C L +E+E QKQDK E  +G  K S
Sbjct: 393  MRVKMDNVPKSDWMCEECMLGKETEKQKQDKIEEGVGIFKKS 434


>ref|XP_007039506.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
            gi|508776751|gb|EOY24007.1| Uncharacterized protein
            isoform 2, partial [Theobroma cacao]
          Length = 1048

 Score =  228 bits (580), Expect = 7e-57
 Identities = 161/462 (34%), Positives = 224/462 (48%), Gaps = 19/462 (4%)
 Frame = -3

Query: 1368 SSRAENRFRR-HSMSDEVHTGAESGTCNVCAAPCSPCMHFNQAGSCVELDVKDEFSDATS 1192
            SS AE  F   HS S ++    ESGTCN CA  CSPC+H  Q  S       + FS    
Sbjct: 3    SSNAEKGFSGGHSSSSKLGLKEESGTCNTCAPSCSPCLHSEQVTSMATKT--NGFSGEAC 60

Query: 1191 TGKVGSQCSFNDGNVLPTFKKRLCGDRHNATSETSNLLSVCSSHDSLSENAESKASLRTF 1012
              K  + CSFND ++        C DRH+ +SETS  LS C S +S SENAES+ +LR  
Sbjct: 61   KKKDSNCCSFNDADLSSPRVNSACNDRHHTSSETSQPLSACLSRESFSENAESEETLRDC 120

Query: 1011 DSSENVEMLPNVSLVGIASKHQLLSKPQTVTRQNVFISSSIQSEQRMGLECAGDNISCVS 832
            ++SE ++M+   +L    S     S   ++    V    S Q E++  LEC GDNI+ + 
Sbjct: 121  NTSEGIKMIRKPNLCQ-NSADNCGSLKSSIFHDKVV---SNQLEKQKELECHGDNIAFIC 176

Query: 831  ANMPV----GDLTVDVDKKDVSCSSASIGSFLPEATAGLLDKSGQSNVPSLKDFSVGDTS 664
             +  V    G    D DKK++S  SAS+ SF     A         N        VG   
Sbjct: 177  GSDYVKTRGGGHNSDADKKNLSYRSASVDSFSETEKA--------VNAQPASSCLVGSPC 228

Query: 663  KKVWSPYPHSQSGKSNF---------HNSDAKDLEA--NSC---SHQQDEPSECPTNHVE 526
             +V + +P   +  +N          + SD  ++ +  +SC   S  + E SEC    V+
Sbjct: 229  DEVDNNHPRRSNRSTNVSSQEILCCSNKSDLSEISSLRDSCAGASSAKGERSECSEEQVQ 288

Query: 525  SSFAKLATPDGGSAEKSTTHNCNNILPKFENSKASSFGASVKIYPCLEAGSDMHIESYSA 346
            SSF +      GS      ++  +I P+                  +  G        + 
Sbjct: 289  SSFVRADALRIGSQIGDEHNSAESIQPETG----------------INGGEQTAEVKSTT 332

Query: 345  SPEDADNREPPLESQINDNRDESDTVEADVKVCDICGDAGREYLLAICSRCSDGAEHTYC 166
              +D +  E  + S+     D SD++E +VKVCDICGD GRE LLAICS+C+DGAEH YC
Sbjct: 333  VVKDVNMEESTIVSRPYACSDGSDSLELEVKVCDICGDIGREELLAICSKCNDGAEHIYC 392

Query: 165  MRIMMDKVPEGEWICEGCTLKEESENQKQDKFETELGTSKAS 40
            MR+ MD VP+ +W+CE C L +E+E QKQDK E  +G  K S
Sbjct: 393  MRVKMDNVPKSDWMCEECMLGKETEKQKQDKIEEGVGIFKKS 434


>ref|XP_007039505.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508776750|gb|EOY24006.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1201

 Score =  228 bits (580), Expect = 7e-57
 Identities = 161/462 (34%), Positives = 224/462 (48%), Gaps = 19/462 (4%)
 Frame = -3

Query: 1368 SSRAENRFRR-HSMSDEVHTGAESGTCNVCAAPCSPCMHFNQAGSCVELDVKDEFSDATS 1192
            SS AE  F   HS S ++    ESGTCN CA  CSPC+H  Q  S       + FS    
Sbjct: 3    SSNAEKGFSGGHSSSSKLGLKEESGTCNTCAPSCSPCLHSEQVTSMATKT--NGFSGEAC 60

Query: 1191 TGKVGSQCSFNDGNVLPTFKKRLCGDRHNATSETSNLLSVCSSHDSLSENAESKASLRTF 1012
              K  + CSFND ++        C DRH+ +SETS  LS C S +S SENAES+ +LR  
Sbjct: 61   KKKDSNCCSFNDADLSSPRVNSACNDRHHTSSETSQPLSACLSRESFSENAESEETLRDC 120

Query: 1011 DSSENVEMLPNVSLVGIASKHQLLSKPQTVTRQNVFISSSIQSEQRMGLECAGDNISCVS 832
            ++SE ++M+   +L    S     S   ++    V    S Q E++  LEC GDNI+ + 
Sbjct: 121  NTSEGIKMIRKPNLCQ-NSADNCGSLKSSIFHDKVV---SNQLEKQKELECHGDNIAFIC 176

Query: 831  ANMPV----GDLTVDVDKKDVSCSSASIGSFLPEATAGLLDKSGQSNVPSLKDFSVGDTS 664
             +  V    G    D DKK++S  SAS+ SF     A         N        VG   
Sbjct: 177  GSDYVKTRGGGHNSDADKKNLSYRSASVDSFSETEKA--------VNAQPASSCLVGSPC 228

Query: 663  KKVWSPYPHSQSGKSNF---------HNSDAKDLEA--NSC---SHQQDEPSECPTNHVE 526
             +V + +P   +  +N          + SD  ++ +  +SC   S  + E SEC    V+
Sbjct: 229  DEVDNNHPRRSNRSTNVSSQEILCCSNKSDLSEISSLRDSCAGASSAKGERSECSEEQVQ 288

Query: 525  SSFAKLATPDGGSAEKSTTHNCNNILPKFENSKASSFGASVKIYPCLEAGSDMHIESYSA 346
            SSF +      GS      ++  +I P+                  +  G        + 
Sbjct: 289  SSFVRADALRIGSQIGDEHNSAESIQPETG----------------INGGEQTAEVKSTT 332

Query: 345  SPEDADNREPPLESQINDNRDESDTVEADVKVCDICGDAGREYLLAICSRCSDGAEHTYC 166
              +D +  E  + S+     D SD++E +VKVCDICGD GRE LLAICS+C+DGAEH YC
Sbjct: 333  VVKDVNMEESTIVSRPYACSDGSDSLELEVKVCDICGDIGREELLAICSKCNDGAEHIYC 392

Query: 165  MRIMMDKVPEGEWICEGCTLKEESENQKQDKFETELGTSKAS 40
            MR+ MD VP+ +W+CE C L +E+E QKQDK E  +G  K S
Sbjct: 393  MRVKMDNVPKSDWMCEECMLGKETEKQKQDKIEEGVGIFKKS 434


>ref|XP_003631477.1| PREDICTED: uncharacterized protein LOC100243800 [Vitis vinifera]
          Length = 1528

 Score =  225 bits (574), Expect = 3e-56
 Identities = 169/498 (33%), Positives = 237/498 (47%), Gaps = 44/498 (8%)
 Frame = -3

Query: 1368 SSRAENRFRRHSMSDEVHTGAESGTCNVCAAPCSPCMHFNQAGSCVELDVKDEFSDATST 1189
            S   E  F  H  S +++T AES  CNVCA  CS CMHF++  S V      EFSD    
Sbjct: 101  SCGTEKGFSGHFSSGKLYTRAESEICNVCATLCSSCMHFDRVASLV--GKMTEFSDEGCQ 158

Query: 1188 GKVGSQCSFNDGNVLPTFKKRLCGDRHNATSETSNLLSVCSSHDSLSENAESKASLRTFD 1009
             K+ S+C FND  +L   K     D+ + +SETSNLLS CSSH+S SENAESK  LR   
Sbjct: 159  EKIASRCFFNDAELLSPCKSNASDDQQHTSSETSNLLSGCSSHESFSENAESKVILRASH 218

Query: 1008 SSENVEMLPNVSLVGIASKHQLLSKPQTVTRQNVFISSSIQSEQRMGLECAGDNISCVS- 832
            +SE++EM   ++      +   L  P T     VF   S Q + +  LEC GD+ISC+S 
Sbjct: 219  TSEDIEMGQPLA------EDSGLPNPSTFHGNIVF---SNQHKNQNDLECPGDDISCISR 269

Query: 831  ANMPVGDLTVDVDKKDVSCSSASIGSFLPEATAGLLDKSGQSNVPSLKDFSVGDTSKKVW 652
            A+ PVGD   + D+K+VS SSAS+ S         ++ +    V S     +   S+   
Sbjct: 270  ADGPVGDHNGEGDRKNVSYSSASVNSSPIAVATVNVEPTSHCLVSSHCGEELEHKSEFTK 329

Query: 651  SPYPHSQSGKSNFHNSDAKDLEA----NSCSHQQDEPSECPTNHVESSFAK--LATPDGG 490
                 +    +    S+   L       S + ++ EPSEC    VESS A+  +AT   G
Sbjct: 330  ESMRKTAGLSNKLDPSEISYLRGVYAGPSPTSRKGEPSECSGKQVESSSARVAVATSSFG 389

Query: 489  SAEKSTTHNCNNILPKFENSKASSFGASVKIYPCLEAGSDMHIESYSASPEDADNREPPL 310
                   +   ++    +         +V       +    H E   A  E +  ++ PL
Sbjct: 390  GQMPGIPNCARSVKSDIDLDDGHQETEAVHF-----SDKKEHSEKSCALLETSSAQKGPL 444

Query: 309  ESQINDNRDESDTVEAD-------------------------------------VKVCDI 241
            +SQ+ D+  +SD +E +                                     VKVCDI
Sbjct: 445  QSQLVDDNVKSDVLEYESRHPHAKGTYIAYPVVYIFSNYEAFYGHLGDMVSGTGVKVCDI 504

Query: 240  CGDAGREYLLAICSRCSDGAEHTYCMRIMMDKVPEGEWICEGCTLKEESENQKQDKFETE 61
            CGDAG E LLA C++CSDGAEH YCMRI ++KVP   W+CE C  KEE+    Q + +  
Sbjct: 505  CGDAGLEELLATCTKCSDGAEHIYCMRIKLEKVPGRGWMCEECMAKEET----QKEMKCT 560

Query: 60   LGTSKASCVNGESQTNSG 7
            +G  K S +N +++ NSG
Sbjct: 561  IGFLKGSSLN-QTRKNSG 577


>ref|XP_004511407.1| PREDICTED: serine-rich adhesin for platelets-like isoform X4 [Cicer
            arietinum]
          Length = 1529

 Score =  224 bits (571), Expect = 8e-56
 Identities = 166/439 (37%), Positives = 217/439 (49%), Gaps = 17/439 (3%)
 Frame = -3

Query: 1305 ESGTCNVCAAPCSPCMHFNQAGSCVELDVKDEFSDATS-TGKVGSQCSFNDGNVLPTFKK 1129
            ESGTCNVC+APCS CMH N A   +      EFSD    +G+  SQ S N+ NV  +   
Sbjct: 4    ESGTCNVCSAPCSSCMHLNHA---LTGSKAVEFSDDNCRSGEANSQNSMNESNV-HSLTS 59

Query: 1128 RLCGDRHNATSETSNLLSVCSSHDSLSENAESKASLRTFDSSENVEMLPNVSLVGIASKH 949
            R C +  +A SE SN+LSV S HDSLSENAES+  L                     +K+
Sbjct: 60   RACENTQHAVSEASNMLSVNSCHDSLSENAESRQILM--------------------NKY 99

Query: 948  QLLSKPQTVTRQNVFISSSIQSEQRMGLECAGDNISCVSANMPVGDLTV-DVDKKDVSCS 772
            Q                          LE   DN SC+S      D  + + D  ++ CS
Sbjct: 100  Q----------------------DPKHLEGHDDNTSCISR---ASDANLRNADGINIPCS 134

Query: 771  SASIGSFLPEATAGLLDKSGQS--NVPSLKDFSVGDTSKKVWSPYPHSQSGKSNFHNSDA 598
            SAS+ S +    +G+      S   +PS KD     +S KV   +  S++GKS   N   
Sbjct: 135  SASV-SHIGAERSGIAPSVDMSCLEIPSSKDADTDHSSPKVQRLHGQSETGKSLSDNQSL 193

Query: 597  KDLEANSCSHQQDEPSECPTNHVESSFAKLATPDGGSAEKSTTH-------NCNNILPKF 439
              +E  S SH  ++ SE    +  SS +K + P   S EK+T         N N +L   
Sbjct: 194  MHMERGSNSHIPEKVSEGSIENCSSSLSKESVPIVISGEKNTASKDNIVDDNSNALLKVC 253

Query: 438  ENSKASSFG--ASVKIYPCLEAGSDMHIESYSASPEDADNREPPLESQINDNRDESDTVE 265
              S+A +       K+  C  +G D H+E      E+        ESQ  +  DESD VE
Sbjct: 254  PKSQADTDNDVCDAKVEDCKCSGHDGHLEK----AEELVKSPGKQESQSENESDESDVVE 309

Query: 264  ADVKVCDICGDAGREYLLAICSRCSDGAEHTYCMRIMMDKVPEGEWICEGCTLKEESENQ 85
             DVKVCDICGDAGRE LLAICSRC+DGAEHTYCMR M++KVPE +W CE C    E+EN+
Sbjct: 310  HDVKVCDICGDAGREDLLAICSRCTDGAEHTYCMREMLEKVPEEDWFCEECQDALETENK 369

Query: 84   KQDKFETEL----GTSKAS 40
            + D  E ++     TS+AS
Sbjct: 370  RLDVEEKKIIKTASTSQAS 388


>ref|XP_006385644.1| hypothetical protein POPTR_0003s08970g [Populus trichocarpa]
            gi|550342775|gb|ERP63441.1| hypothetical protein
            POPTR_0003s08970g [Populus trichocarpa]
          Length = 1231

 Score =  223 bits (569), Expect = 1e-55
 Identities = 179/510 (35%), Positives = 246/510 (48%), Gaps = 28/510 (5%)
 Frame = -3

Query: 1449 MTPVLKGSYRIQGPTDEVDHDTPKNTESSRAENRF-RRHSMSDEVHTGAESGTCNVCAAP 1273
            + P  K  Y ++GP+D  +H    N  SS  EN F  +   SD+ H   ESGTCN C   
Sbjct: 12   IAPTFKVGYPVEGPSDGKNHTVGLNMGSSVTENMFGSKQYSSDKFHIKEESGTCNECTGS 71

Query: 1272 CSPCMHFNQAGSCVELDVKDEFSDATSTGKVGSQCSFNDGNVLPTFKKRLCGDRHNATSE 1093
            CS CM    A S + +     FS   S GKV +Q S +  ++L       C  R+ +TSE
Sbjct: 72   CSCCM----AASLLRMKADVGFSYEISKGKVDAQYSRSGADMLSPVDSS-CNSRNRSTSE 126

Query: 1092 TSNLLSVCSSHDSLSENAESKASLRTFDSSENVEMLPNVSLVGIASKHQLLSKPQTVTRQ 913
             SNLLS CSSHDS SEN ESK +LR   +SE+ EML   +    A K+  LS+       
Sbjct: 127  ISNLLSACSSHDSFSENEESKDTLRASGTSEHSEMLVEENDQQTARKNPGLSRTILFHDS 186

Query: 912  NVFISSSIQSEQRMGLECAGDNISCVSAN----MPVGDLTVDVDKKDVSCSSASIGSF-- 751
            N+   +  + ++   LEC GD+ SC+S +       GD     D+K+VS SS SI SF  
Sbjct: 187  NILFKNHQKPKE---LECIGDDASCISGSEYTDKIAGDHHCYTDRKNVSSSSTSIDSFPA 243

Query: 750  ----------LPEATAGLLDKSGQSNVPSLKDFSVGDTSKKVWSPYPHSQSGKSN-FHNS 604
                      L     G  D    +   +L  F+      K  SP     S KSN    S
Sbjct: 244  IENAANVRPTLCSLAKGQFDTIDNNQPRTLIKFT------KESSPTIAVFSNKSNQIDIS 297

Query: 603  DAKDLEANSCSHQQDEPSECPTNHVESSFAKLAT--PDGGSAEKSTTHNCNNILP-KFEN 433
             A+D    + S  + +PSEC    +ES   + AT   D    E+      N+  P K E 
Sbjct: 298  SARDFYIGANS-SKGKPSECSEEQIESPLMRAATFWVDAQIHEEE-----NHTEPVKSEI 351

Query: 432  SKASSFGASVKIYPCLEAGSDMHIE---SYSASPEDADNREPPLESQI-NDNRDESDTVE 265
             +     A  K   C +   D   +   +  A P   D     ++ ++  D+R+      
Sbjct: 352  GRKDGEAAVAK---CSDQKGDEPAKWQPTPKAQPMVHDGELDHIQDEVCKDDRE------ 402

Query: 264  ADVKVCDICGDAGREYLLAICSRCSDGAEHTYCMRIMMDKVPEGEWICEGCTLKEESENQ 85
             +VKVCDICGD G+E  LA CS+CSDGAEH YCMR  ++KVPEG W+CE C L +E++ Q
Sbjct: 403  -NVKVCDICGDVGQEEKLATCSKCSDGAEHIYCMREKLEKVPEGNWMCEDCMLGDENKRQ 461

Query: 84   KQDKFETELGTS-KASCVNG--ESQTNSGA 4
            K++ FE E     + S +N   ++  NSGA
Sbjct: 462  KKNNFEKEEAVQLEKSSLNEIIKNSKNSGA 491


Top