BLASTX nr result

ID: Cheilocostus21_contig00056370 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cheilocostus21_contig00056370
         (1189 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_009410492.1| PREDICTED: uncharacterized protein LOC103992...   495   e-168
ref|XP_009395974.1| PREDICTED: uncharacterized protein LOC103981...   491   e-166
ref|XP_008788040.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   429   e-142
ref|XP_010931415.1| PREDICTED: uncharacterized protein LOC105052...   425   e-139
ref|XP_020101154.1| uncharacterized protein LOC109719057 [Ananas...   401   e-131
gb|OAY66392.1| hypothetical protein ACMD2_15379 [Ananas comosus]      347   e-110
gb|OUZ99638.1| Protein of unknown function DUF936 [Macleaya cord...   321   e-99 
ref|XP_007013268.2| PREDICTED: uncharacterized protein LOC185886...   320   1e-99
gb|EOY30887.1| Serine/arginine repetitive matrix protein 1 [Theo...   318   9e-99
ref|XP_018822876.1| PREDICTED: uncharacterized protein LOC108992...   316   6e-98
ref|XP_020591643.1| uncharacterized protein LOC110032368 [Phalae...   314   3e-97
emb|CBI28115.3| unnamed protein product, partial [Vitis vinifera]     312   5e-97
ref|XP_002281524.1| PREDICTED: uncharacterized protein LOC100245...   312   2e-96
gb|PNS93155.1| hypothetical protein POPTR_018G072500v3 [Populus ...   304   4e-96
gb|PIA38521.1| hypothetical protein AQUCO_02700017v1 [Aquilegia ...   311   7e-96
gb|OMO57982.1| hypothetical protein COLO4_34937 [Corchorus olito...   307   2e-95
ref|XP_011009205.1| PREDICTED: uncharacterized protein LOC105114...   307   1e-94
gb|PNS93156.1| hypothetical protein POPTR_018G072500v3 [Populus ...   304   3e-94
gb|OMO80794.1| hypothetical protein CCACVL1_12741 [Corchorus cap...   305   8e-94
ref|XP_002324965.2| hypothetical protein POPTR_0018s06270g [Popu...   304   1e-93

>ref|XP_009410492.1| PREDICTED: uncharacterized protein LOC103992498 [Musa acuminata
            subsp. malaccensis]
          Length = 688

 Score =  495 bits (1275), Expect = e-168
 Identities = 275/400 (68%), Positives = 299/400 (74%), Gaps = 9/400 (2%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            GEHRS+VLQV GIVPA+SAS  +DLWPSHGFYLQLSDS NSTYV LSDADAD +LSSRPQ
Sbjct: 24   GEHRSSVLQVTGIVPAISASAEEDLWPSHGFYLQLSDSANSTYVALSDADADVVLSSRPQ 83

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQLVHVDR +FA PVPRA+GLRPVPG+R  PF GSP+PLVARSSPDHRGFVIQPAS TD
Sbjct: 84   LGQLVHVDRLQFAYPVPRAIGLRPVPGSRPYPFLGSPDPLVARSSPDHRGFVIQPASPTD 143

Query: 362  AGPPL-VSSSLRSDHLRQEKEKRTVFAAKENVVSTG--------KPRRFSSPNASKLSAK 514
            AG P   SSSL S+  R E+EKRTVFAAKENVVS+G        KPRRF SP A+KL+A+
Sbjct: 144  AGHPFHFSSSLGSNPSRLEEEKRTVFAAKENVVSSGKNHGDAAQKPRRFPSPAAAKLAAR 203

Query: 515  KSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXXXXXXKCEVPSLVAAKEEN 694
            KS  G  N +G+ PRDPSP  K                        KCEVPSLVAAKEEN
Sbjct: 204  KSGPGSGNGSGEQPRDPSPVLKMSSRPSSPASMGRAASRASSPIPSKCEVPSLVAAKEEN 263

Query: 695  RRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPARRLSGGLKVASPAAGESXXXX 874
            RRVAREPAIIVPSRYRQPSPV RK AASPMGRRGS SPARR SGGLKV+SPAAGE     
Sbjct: 264  RRVAREPAIIVPSRYRQPSPV-RKAAASPMGRRGSMSPARRPSGGLKVSSPAAGEGGGKK 322

Query: 875  XXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGAAESKEKNGSKNKVDIESILRTQV 1054
                    I+R SDAL  SVKSIRKSWDD S   VG +ESKEK GSK+KVD ESILRTQV
Sbjct: 323  KVGLVVAGISRASDALVSSVKSIRKSWDDPSASAVG-SESKEKGGSKSKVDKESILRTQV 381

Query: 1055 AISRRLSDFGGIQISNEVTSSAEKRRTSSKIERLLESEKP 1174
            AISRRLSD GG+  SNE  +S +  RTS K+E   ESEKP
Sbjct: 382  AISRRLSDAGGLPNSNEEAASNDMPRTSRKMESFSESEKP 421


>ref|XP_009395974.1| PREDICTED: uncharacterized protein LOC103981097 [Musa acuminata
            subsp. malaccensis]
          Length = 729

 Score =  491 bits (1265), Expect = e-166
 Identities = 269/404 (66%), Positives = 299/404 (74%), Gaps = 10/404 (2%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            GEHRSA+LQV+GIVPALSAST DDLWPSHGFYLQLSDSVNSTYV+LSDADADA+LSSR Q
Sbjct: 24   GEHRSAILQVVGIVPALSASTGDDLWPSHGFYLQLSDSVNSTYVSLSDADADAVLSSRAQ 83

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQLVHVDR +FA PVPRAVGLRPVPGAR  PF GSP+PLVA S+PDHRGF+IQ AS  +
Sbjct: 84   LGQLVHVDRLRFAHPVPRAVGLRPVPGARPHPFVGSPDPLVALSAPDHRGFIIQAASPAE 143

Query: 362  AGPPLV-SSSLRSDHLRQEKEKRTVFAAKENVV---------STGKPRRFSSPNASKLSA 511
            +GPPL+ S+S RS+    E+EKRTVFAAKENVV         + GKPRRFSS   SKL+A
Sbjct: 144  SGPPLLPSASHRSNLPHLEEEKRTVFAAKENVVVGSGKNQSDAAGKPRRFSSTATSKLTA 203

Query: 512  KKSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXXXXXXKCEVPSLVAAKEE 691
            +K+  G  N TG+  RDPSP  KT                       KCEVPSLV AKE+
Sbjct: 204  RKNGPGSGNGTGEQLRDPSPALKTSSRPSSPALGGRASSRPSSPVPSKCEVPSLVGAKED 263

Query: 692  NRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPARRLSGGLKVASPAAGESXXX 871
            NRRVAREPAIIVPSRYRQPSPVGRK AASPMGRRGS SPARRLSGGLKVASPA G+    
Sbjct: 264  NRRVAREPAIIVPSRYRQPSPVGRKAAASPMGRRGSMSPARRLSGGLKVASPATGDGGGK 323

Query: 872  XXXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGAAESKEKNGSKNKVDIESILRTQ 1051
                     I+R SD+L  SVKSIRKSWDDSS   V A+E KEK GSK+K+D ES LRTQ
Sbjct: 324  KKIGLVVAGISRGSDSLVASVKSIRKSWDDSSPSSVVASEPKEKEGSKSKLDKESFLRTQ 383

Query: 1052 VAISRRLSDFGGIQISNEVTSSAEKRRTSSKIERLLESEKPIMA 1183
             AISRRLSD  G+Q ++   SS EKRRTS K E   ES+K  MA
Sbjct: 384  AAISRRLSDAEGVQANSAEASSDEKRRTSRKTESFSESDKNYMA 427


>ref|XP_008788040.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103705912
            [Phoenix dactylifera]
          Length = 700

 Score =  429 bits (1103), Expect = e-142
 Identities = 259/428 (60%), Positives = 288/428 (67%), Gaps = 34/428 (7%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            GEHRSAVLQVIGIVPALSAST DDLWPSHGFYLQLSDS  STYV+LSDADAD+IL +RPQ
Sbjct: 24   GEHRSAVLQVIGIVPALSASTADDLWPSHGFYLQLSDSAYSTYVSLSDADADSILCNRPQ 83

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQLVHVDR  FA PVPRA G+RP+PG R   F GSPEPLVARS P  RGFVIQPAS  D
Sbjct: 84   LGQLVHVDRLHFAHPVPRASGVRPLPG-RPNXFLGSPEPLVARSDPSRRGFVIQPASHAD 142

Query: 362  AGPPLV---SSSLRSDHLRQEK-----------------EKRTVFAAKENVV-------- 457
            AGPPL+   SSSLRS+   +                   EKRTVFAAKENVV        
Sbjct: 143  AGPPLIPSSSSSLRSNPFPEGNTFGEEGMVFASKENTGGEKRTVFAAKENVVGATAKTAG 202

Query: 458  STGKPRRFSSPNASKLSAKKSDTGGVNCTGDLPRDPSP-----RAKTIXXXXXXXXXXXX 622
             +   RRFSSP  +KL+A+KS  GG   TG+  RDPSP     +A +             
Sbjct: 203  ESAAKRRFSSPAGAKLAARKS--GGGGGTGE-QRDPSPAVGQGKAGSRSSSPALGGAARG 259

Query: 623  XXXXXXXXXXKCEVPSLVAAKEENRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGST 802
                      KC VPSLVAAKEENRRVAREPAIIVPSRYRQPSPV RK A SPMGRRGS 
Sbjct: 260  GSRSSSPVPSKCIVPSLVAAKEENRRVAREPAIIVPSRYRQPSPVARKAAVSPMGRRGSM 319

Query: 803  SPARRLSGGLKVASPAAGE-SXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVV 979
            SP RRLS GLK ASPAAGE              I++VSDAL GSVKS+RKSWDDS+   V
Sbjct: 320  SPGRRLSAGLK-ASPAAGEGGGGKKKVGIVVAGISKVSDALMGSVKSVRKSWDDSAANTV 378

Query: 980  GAAESKEKNGSKNKVDIESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSSKIERLL 1159
             ++E KEK GSK+KVD E+ILRTQ A+SRR+SD  G Q +NE +S  EK +   +IE   
Sbjct: 379  VSSELKEKGGSKSKVDKEAILRTQAAMSRRISDATGAQSNNEESSPNEKPKPIKRIELTS 438

Query: 1160 ESEKPIMA 1183
            ESEKP  A
Sbjct: 439  ESEKPSCA 446


>ref|XP_010931415.1| PREDICTED: uncharacterized protein LOC105052338 [Elaeis guineensis]
          Length = 749

 Score =  425 bits (1092), Expect = e-139
 Identities = 255/422 (60%), Positives = 285/422 (67%), Gaps = 32/422 (7%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            GEHRSAVLQVI IVPALSAST  DLWPSHGFYLQLSDS  STYV+LSDADADAILS+RPQ
Sbjct: 24   GEHRSAVLQVIAIVPALSASTAHDLWPSHGFYLQLSDSAYSTYVSLSDADADAILSNRPQ 83

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQLVHVDR  FA PVPRA G+RP+PG R QPF GSPEPLVARS P  RGFVIQPAS  D
Sbjct: 84   LGQLVHVDRLHFAHPVPRASGVRPLPG-RPQPFLGSPEPLVARSDPSRRGFVIQPASPAD 142

Query: 362  AGPPLV-SSSLRSDHLRQEK-----------------EKRTVFAAKENVV--------ST 463
            AGPPL+ SSSLRS+   Q+                  EKRTVFA KENVV         +
Sbjct: 143  AGPPLIPSSSLRSNPFPQDNPFGEEETVSATKENTGGEKRTVFAPKENVVGAATKTFGES 202

Query: 464  GKPRRFSSPNASKLSAKKSDTGGVNCTGDLPRDPSP-----RAKTIXXXXXXXXXXXXXX 628
               RRFSSP  +KL+A+KS  GG   TG+  RDPSP     +A +               
Sbjct: 203  AAKRRFSSPAGAKLAARKSSGGG--GTGE-QRDPSPAVGQGKAGSRSSSPALGGAARGGS 259

Query: 629  XXXXXXXXKCEVPSLVAAKEENRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSP 808
                    KC VPSLVAAKEENRRVAREP+IIVPSRYRQPSPV RK A SPMGRR S SP
Sbjct: 260  RSSSPVPSKCIVPSLVAAKEENRRVAREPSIIVPSRYRQPSPVARKAAGSPMGRRSSMSP 319

Query: 809  ARRLSGGLKVASPAAGE-SXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGA 985
             RRLS GLK ASPA GE              I++VSDAL GSVKS+RKSWDDS+   + +
Sbjct: 320  GRRLSSGLK-ASPAVGEGGGGKKKVGIMVAGISKVSDALMGSVKSVRKSWDDSAVNTLVS 378

Query: 986  AESKEKNGSKNKVDIESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSSKIERLLES 1165
            +E KEK GSK+KVD  +ILRTQ AISRR+SD  G Q ++E +S  EK + + KIE   ES
Sbjct: 379  SELKEKGGSKSKVDKGAILRTQAAISRRISDASGAQSNDEESSPNEKPKLTKKIELTSES 438

Query: 1166 EK 1171
            EK
Sbjct: 439  EK 440


>ref|XP_020101154.1| uncharacterized protein LOC109719057 [Ananas comosus]
          Length = 690

 Score =  401 bits (1031), Expect = e-131
 Identities = 234/389 (60%), Positives = 272/389 (69%), Gaps = 6/389 (1%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            GEHRSAVLQV GIVPALSAST DDLWP+HGFYLQLSDS++STYV LSD DADA+LS+RPQ
Sbjct: 24   GEHRSAVLQVTGIVPALSASTADDLWPAHGFYLQLSDSLHSTYVALSDRDADAVLSARPQ 83

Query: 182  LGQLVHVDRFKFAQ-PVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASAT 358
            LGQLVHVDR +F + PVPRA+GLRPVP +R  PFAGSPEPLVARS+P HRGFVIQPAS  
Sbjct: 84   LGQLVHVDRLRFDRPPVPRALGLRPVPASRPLPFAGSPEPLVARSAPSHRGFVIQPASPA 143

Query: 359  DAGPPLV---SSSLRSDHLRQEKEKRTVFAAKENVVSTGK--PRRFSSPNASKLSAKKSD 523
             AGPPL+   SSS +S +   E +     + KEN  +      RRFSSP  +KL+A+KS+
Sbjct: 144  HAGPPLLPSSSSSRKSPNFPLEPDNS--ISDKENAAAAAPNGKRRFSSPAGAKLAARKSN 201

Query: 524  TGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXXXXXXKCEVPSLVAAKEENRRV 703
                   G+  R  SP   +                       KC VPSLVAAKEENRR 
Sbjct: 202  -------GESSRPSSPSVGS---------------RPSSPAPSKCVVPSLVAAKEENRRA 239

Query: 704  AREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPARRLSGGLKVASPAAGESXXXXXXX 883
            AREPAI+VPSRYR PSP GRK A+SPMGRRGS SPARRLSGGLK+ SPA GE        
Sbjct: 240  AREPAIVVPSRYRNPSPAGRKSASSPMGRRGSMSPARRLSGGLKM-SPATGEK---KKGG 295

Query: 884  XXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGAAESKEKNGSKNKVDIESILRTQVAIS 1063
                 I++VSDAL GSVKS+RKSWDDSS     ++E K+K GSK+KVD E+ILRTQVA+S
Sbjct: 296  VVVAGISKVSDALLGSVKSMRKSWDDSSV----SSELKDKGGSKSKVDKEAILRTQVAMS 351

Query: 1064 RRLSDFGGIQISNEVTSSAEKRRTSSKIE 1150
            RRLSD    Q +NE  S+ EK + S K+E
Sbjct: 352  RRLSDASWEQSNNEEASTNEKPKPSKKVE 380


>gb|OAY66392.1| hypothetical protein ACMD2_15379 [Ananas comosus]
          Length = 682

 Score =  347 bits (890), Expect = e-110
 Identities = 213/384 (55%), Positives = 251/384 (65%), Gaps = 1/384 (0%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            GEHRSAVLQV GIVPALSAST DDLWP+HGFYLQLSDS++STYV LSD DADA+LS+RPQ
Sbjct: 24   GEHRSAVLQVTGIVPALSASTADDLWPAHGFYLQLSDSLHSTYVALSDRDADAVLSARPQ 83

Query: 182  LGQLVHVDRFKFAQ-PVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASAT 358
            LGQLVHVDR +F + PVPRA+GLRPVP +R  P   SP P      P  R   + P+S++
Sbjct: 84   LGQLVHVDRLRFDRPPVPRALGLRPVPASRPSP---SPAPRAPPRLPRPRRPPLLPSSSS 140

Query: 359  DAGPPLVSSSLRSDHLRQEKEKRTVFAAKENVVSTGKPRRFSSPNASKLSAKKSDTGGVN 538
                P  +  L  D+   +KE     A        GK RRFSSP  +KL+A+KS+     
Sbjct: 141  SRKSP--NFPLEPDNSISDKENAAAAA------PNGK-RRFSSPAGAKLAARKSN----- 186

Query: 539  CTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXXXXXXKCEVPSLVAAKEENRRVAREPA 718
              G+  R  SP   +                       KC VPSLVAAKEENRR AREPA
Sbjct: 187  --GESSRPSSPSVGS---------------RPSSPAPSKCVVPSLVAAKEENRRAAREPA 229

Query: 719  IIVPSRYRQPSPVGRKGAASPMGRRGSTSPARRLSGGLKVASPAAGESXXXXXXXXXXXX 898
            I+VPSRYR PSP GRK A+SPMGRRGS SPARRLSGGLK+ SPA GE             
Sbjct: 230  IVVPSRYRNPSPAGRKSASSPMGRRGSMSPARRLSGGLKM-SPATGEK---KKGGVVVAG 285

Query: 899  IARVSDALAGSVKSIRKSWDDSSTCVVGAAESKEKNGSKNKVDIESILRTQVAISRRLSD 1078
            I++VSDAL GSVKS+RKSWDDSS     ++E K+K GSK+KVD E+ILRTQVA+SRRLSD
Sbjct: 286  ISKVSDALLGSVKSMRKSWDDSSV----SSELKDKGGSKSKVDKEAILRTQVAMSRRLSD 341

Query: 1079 FGGIQISNEVTSSAEKRRTSSKIE 1150
                Q +NE  S+ EK + S K+E
Sbjct: 342  ASWEQSNNEEASTNEKPKPSKKVE 365


>gb|OUZ99638.1| Protein of unknown function DUF936 [Macleaya cordata]
          Length = 716

 Score =  321 bits (822), Expect = e-99
 Identities = 202/402 (50%), Positives = 252/402 (62%), Gaps = 17/402 (4%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            GEHRS +LQVIGIVPAL+ +   DLWP+HGF++QLSDS NSTYV+LS+ D D IL++R Q
Sbjct: 20   GEHRSVLLQVIGIVPALAGA---DLWPNHGFFVQLSDSTNSTYVSLSERDNDLILTNRLQ 76

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQ  +VDR  F  PVPR  GLRP+ G    PF GS EPL+A+ SP  RGFVIQP S +D
Sbjct: 77   LGQFAYVDRLLFDSPVPRVSGLRPISG--RHPFVGSLEPLIAKFSPSKRGFVIQPVSDSD 134

Query: 362  AGPPLVS---SSLRSDHLRQEKE--KRTVFAAKEN--VVSTG----------KPRRFSSP 490
            A    +S   S+ +S+ ++ +K+   R V AA++N  VV++G          KPRRFSSP
Sbjct: 135  ASLHPISAYISNKKSEEVKSDKDTTTRPVLAARDNVPVVNSGNNFEGSKQSEKPRRFSSP 194

Query: 491  NASKLSAKKSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXXXXXXKCEVPS 670
             ++K   + S  G  N    + R+PSP +K                        KC VPS
Sbjct: 195  ASTK--QRSSSVGKKNGGSVVEREPSPASKV-------------NSRSVSPAPSKCVVPS 239

Query: 671  LVAAKEENRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPARRLSGGLKVASPA 850
            LVAAK+ENR+ +REPAI+VPSRYRQPSP GRK  ASP  RR S SP RRLSGGLKV SP 
Sbjct: 240  LVAAKDENRKTSREPAIVVPSRYRQPSPNGRK-QASPNARRVSLSPGRRLSGGLKV-SPV 297

Query: 851  AGESXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGAAESKEKNGSKNKVDI 1030
             G+S            I++VS+AL GS KS+RKSWD+        AE KEK   K+K D+
Sbjct: 298  VGDSASKKKMATIVAGISKVSEALVGSGKSMRKSWDEQPE-FADLAEQKEKVVVKSKPDM 356

Query: 1031 ESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSSKIERL 1156
             +ILRTQVA+SRRLSD    Q + E  SS EK ++S K E L
Sbjct: 357  RAILRTQVAMSRRLSDANAGQPNQEDASSNEKPKSSCKTEGL 398


>ref|XP_007013268.2| PREDICTED: uncharacterized protein LOC18588649 [Theobroma cacao]
          Length = 708

 Score =  320 bits (821), Expect = 1e-99
 Identities = 206/422 (48%), Positives = 254/422 (60%), Gaps = 26/422 (6%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            G+HRSA+LQVIGIVPAL+ S   DLWP+HGFY+QLSDS+NSTYV+LS+ D + ILS+R Q
Sbjct: 24   GDHRSALLQVIGIVPALAGS---DLWPNHGFYVQLSDSLNSTYVSLSERDTELILSNRLQ 80

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQ V+VDRF F  PVPR  G+RP+ G    PF GSP+PL+AR S   R FVIQP S ++
Sbjct: 81   LGQFVYVDRFHFDSPVPRVSGIRPIAG--RHPFVGSPDPLIARISSSKRDFVIQPVSESE 138

Query: 362  AGPPLVSSSLRSDHLRQEK-------------EKRTVFAAKENV-----------VSTGK 469
                 ++  L +  L Q++             + R   A ++NV           V+   
Sbjct: 139  YSVDPIAVYLSNKKLEQQQTPTENKDSKIEKPKTRQPLAPRDNVRVNENFESESKVTEKP 198

Query: 470  PRRFSSPNASK--LSAKKSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXXX 643
            P+RFSSP  +K  +SA K     V     + RDPSP  K                     
Sbjct: 199  PQRFSSPATAKRSVSAVKKTNAAV-----VERDPSPAGK--------------GKRSASP 239

Query: 644  XXXKCEVPSLVAAKEENRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPARRLS 823
               KC VPSL+AAKEENR+VAREPAI+VPSRYRQPSP GRK  ASP  RRGS SP RRLS
Sbjct: 240  VPSKCVVPSLMAAKEENRKVAREPAIVVPSRYRQPSPNGRK-QASPSARRGSLSPGRRLS 298

Query: 824  GGLKVASPAAGESXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGAAESKEK 1003
            G LKV SPA G+S            I++VS+AL GS KS RKSWD+      G+ E KEK
Sbjct: 299  GVLKV-SPAVGDS--KKKMATIVAGISKVSEALVGSAKSSRKSWDEQPE--KGSGEQKEK 353

Query: 1004 NGSKNKVDIESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSSKIERLLESEKPIMA 1183
              SK+K D+++ILRTQ AISRRLSD  G Q SN+  SS+ ++ T S  E  L +EK   A
Sbjct: 354  GSSKSKPDLQAILRTQAAISRRLSDVHG-QKSNDENSSSNEKTTDSPSEDSLATEKLTCA 412

Query: 1184 AG 1189
             G
Sbjct: 413  GG 414


>gb|EOY30887.1| Serine/arginine repetitive matrix protein 1 [Theobroma cacao]
          Length = 708

 Score =  318 bits (815), Expect = 9e-99
 Identities = 205/422 (48%), Positives = 253/422 (59%), Gaps = 26/422 (6%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            G+HRSA+LQVIGIVPAL+ S   DLWP+HGFY+QLSDS+NSTYV+LS+ D + ILS+R Q
Sbjct: 24   GDHRSALLQVIGIVPALAGS---DLWPNHGFYVQLSDSLNSTYVSLSERDTELILSNRLQ 80

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQ V+VDRF F  PVPR  G+RP+ G    PF GSP+PL+AR S   R FVIQP S ++
Sbjct: 81   LGQFVYVDRFHFDSPVPRVSGIRPIAG--RHPFVGSPDPLIARISSSKRDFVIQPVSESE 138

Query: 362  AGPPLVSSSLRSDHLRQEK-------------EKRTVFAAKENV-----------VSTGK 469
                 ++  L +  L Q++             + R   A ++NV           V+   
Sbjct: 139  YSVDPIAVYLSNKKLEQQQTPTENKDSKIEKPKTRQPLAPRDNVRVNENLESESKVTEKP 198

Query: 470  PRRFSSPNASK--LSAKKSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXXX 643
            P+RFSSP  +K  +SA K     V     + RDPSP  K                     
Sbjct: 199  PQRFSSPATAKRSVSAVKKTNAAV-----VERDPSPAGK--------------GKRSASP 239

Query: 644  XXXKCEVPSLVAAKEENRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPARRLS 823
               KC VPSL+AAKEENR+VAREPAI+VPSRYRQPSP GRK  ASP  RRGS SP RRLS
Sbjct: 240  VPSKCVVPSLMAAKEENRKVAREPAIVVPSRYRQPSPNGRK-QASPSARRGSLSPGRRLS 298

Query: 824  GGLKVASPAAGESXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGAAESKEK 1003
            G LKV SPA G+S            I++VS+AL GS KS RKSWD+      G+ E KEK
Sbjct: 299  GVLKV-SPAVGDS--KKKMATIVAGISKVSEALVGSAKSSRKSWDEQPE--KGSGEQKEK 353

Query: 1004 NGSKNKVDIESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSSKIERLLESEKPIMA 1183
              SK+K D+++ILRTQ AISRRLSD  G Q SN+  SS+ ++ T S  E  L + K   A
Sbjct: 354  GSSKSKPDLQAILRTQAAISRRLSDVHG-QKSNDENSSSNEKTTDSPSEDSLATAKLTCA 412

Query: 1184 AG 1189
             G
Sbjct: 413  GG 414


>ref|XP_018822876.1| PREDICTED: uncharacterized protein LOC108992700 [Juglans regia]
          Length = 716

 Score =  316 bits (810), Expect = 6e-98
 Identities = 195/426 (45%), Positives = 256/426 (60%), Gaps = 30/426 (7%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            G+HRSA+LQVIGIVPAL+ S   DLWP+HGFY+QLSDS+NSTYV+LS+ D D IL++R Q
Sbjct: 24   GDHRSALLQVIGIVPALAGS---DLWPNHGFYVQLSDSLNSTYVSLSERDNDLILTNRLQ 80

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQ V+VDRF+F  PVPR  G+RP+ G    PF G+PEPL+AR S   R FVIQP + +D
Sbjct: 81   LGQFVYVDRFEFDSPVPRVCGIRPIAG--RHPFVGTPEPLIARISASKREFVIQPVADSD 138

Query: 362  AGPPLVSSSLRSDHLRQEKEK----------------RTVFAAKENVV-----------S 460
                 ++  L S  L + + +                R   A ++N++           +
Sbjct: 139  QSADPIAIYLSSKKLEEARSENKESLKLESKAEKGRSRQALAPRDNLLVGNVGNYDEPKT 198

Query: 461  TGKPRRFSSPNASKLSAKKSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXX 640
            + +P+RFSSP     SAK+S + G      + RDPSP AK                    
Sbjct: 199  SDRPQRFSSP----ASAKRSVSRGKKNVAVVERDPSPAAK--------------GKRSAS 240

Query: 641  XXXXKCEVPSLVAAKEENRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPARRL 820
                KC VPSLV+A+EENR+ +REP+IIVPSRYRQPSP GRK  ASP  RR S SP RRL
Sbjct: 241  PVPSKCVVPSLVSAREENRKTSREPSIIVPSRYRQPSPNGRK-QASPSARRASLSPGRRL 299

Query: 821  SGGLKVASPAAG---ESXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGAAE 991
            SGG+K++   AG   +S            I++VS+AL GS K+IRKSWD+    +    E
Sbjct: 300  SGGVKLSPMVAGGPMDSTSKKKMATIVAGISKVSEALVGSAKTIRKSWDE-PPAMAAPVE 358

Query: 992  SKEKNGSKNKVDIESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSSKIERLLESEK 1171
             KEK  SKNK ++++ILRTQ AISRRLSD  G ++S + +SS EK + SS    L+E + 
Sbjct: 359  QKEKAVSKNKQNLQAILRTQAAISRRLSDVNGRKLSTDDSSSDEKMKPSSPDSCLIEEKL 418

Query: 1172 PIMAAG 1189
             + A G
Sbjct: 419  NLGALG 424


>ref|XP_020591643.1| uncharacterized protein LOC110032368 [Phalaenopsis equestris]
          Length = 712

 Score =  314 bits (805), Expect = 3e-97
 Identities = 206/414 (49%), Positives = 248/414 (59%), Gaps = 22/414 (5%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            GEHRSAVLQV GIVPA S    DDLWPSHGFYLQLSDS++STYV+LSD DAD I+S+RPQ
Sbjct: 24   GEHRSAVLQVTGIVPAPS----DDLWPSHGFYLQLSDSLHSTYVSLSDQDADLIVSNRPQ 79

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHR-GFVIQPASAT 358
            LGQLVH+DR  F  PVPRA+GLRP+   R  PF GSPEPL+A S+P H  GFVIQPAS +
Sbjct: 80   LGQLVHLDRLHFDLPVPRALGLRPISSPRHHPFIGSPEPLIALSTPSHAPGFVIQPASPS 139

Query: 359  DAGPPLVSSSLRSDHLRQEKEKRTVFAAKEN-------------VVSTGKP-RRFSSP-- 490
             A   +              + R VFA KEN              +S  +P RRFSSP  
Sbjct: 140  SAANTV--------------KLRPVFAPKENFLLAASNPASSGSTLSVSRPKRRFSSPAS 185

Query: 491  -NASKLSAKKSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXXXXXXKCEVP 667
             N S L  K+    G        R PSP                           KC+VP
Sbjct: 186  RNPSPLPEKQGSRVG-------SRAPSP------------------------VPSKCDVP 214

Query: 668  SLVAAKEENRRVAREPAIIVPSRYRQPSPVGRKGA-ASPMGRRGSTSP-ARRLSGGLKVA 841
            SLVAA+EENR+VAREPAI+VPSRYRQPSPV RK A  SPMGR+G+ SP +RRLSGG+K  
Sbjct: 215  SLVAAREENRKVAREPAIVVPSRYRQPSPVARKTAGVSPMGRKGAASPGSRRLSGGIKF- 273

Query: 842  SPAAGESXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDD-SSTCVVGAAESKEKNGSKN 1018
            SPAAGE+            I RV DAL GS +S+RKSWDD SS+    +AE+K K GSK 
Sbjct: 274  SPAAGEAGGKRKVGLVVAGITRVPDALVGSGRSVRKSWDDYSSSSENSSAETKGKPGSKG 333

Query: 1019 KVDIESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSSK-IERLLESEKPI 1177
            K   ++  + QVA + +L D    +  NE   + EKRR S K +  ++E+  P+
Sbjct: 334  KAQEQATRKAQVATACQLIDTVKDKSHNEEELTEEKRRKSIKTVASVVENSMPM 387


>emb|CBI28115.3| unnamed protein product, partial [Vitis vinifera]
          Length = 662

 Score =  312 bits (800), Expect = 5e-97
 Identities = 200/419 (47%), Positives = 246/419 (58%), Gaps = 24/419 (5%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            GEHRSA+LQVIGIVPAL+ S   DLWP+HGFY+QLSDS+NSTYV+LSD D D IL++R Q
Sbjct: 24   GEHRSALLQVIGIVPALAGS---DLWPNHGFYVQLSDSLNSTYVSLSDRDTDLILTNRLQ 80

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQ V+VDRF F  PVPR  G+RP+ G    PF GSPEPL+AR SP  + FVIQP S  D
Sbjct: 81   LGQFVYVDRFDFDSPVPRVCGIRPIAG--RHPFVGSPEPLIARISPSKKDFVIQPVSDWD 138

Query: 362  AGPPLVSSSLRSDHLRQEK---------------EKRTVFAAKEN------VVSTGKPRR 478
                 +++ L +  +   K                 R V   ++N         + +P+R
Sbjct: 139  QSVDPIAAYLSNKKIDDVKNDGKESKIETKGEKGRTRQVLGTRDNNGDLDETKVSDRPQR 198

Query: 479  FSSPNASKLSAKKSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXXXXXXKC 658
            FSSP      AK+S + G        RDPSP  K                        KC
Sbjct: 199  FSSP----AGAKRSVSAGKKNVAVAERDPSPAGK--------------GKRSASPVPSKC 240

Query: 659  EVPSLVAAKEENRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPARRLSGGLKV 838
             VPSLV A+EENR+ +REPAIIVPSRYRQPSP GRK  ASP  RR S SP RRLSGGLK 
Sbjct: 241  MVPSLVVAREENRKTSREPAIIVPSRYRQPSPNGRK-QASPNARRASISPGRRLSGGLKF 299

Query: 839  ASPAAG---ESXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGAAESKEKNG 1009
             SPA G   +S            I++VS+AL GS K+ RKSWD+     VG+ E KEK+ 
Sbjct: 300  -SPAVGGAPDSTSKKKMATIVAGISKVSEALVGSAKAGRKSWDE-PPAAVGSGELKEKSL 357

Query: 1010 SKNKVDIESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSSKIERLLESEKPIMAA 1186
            +K K D+++ILRTQ AISRRLSD  G Q + + +S+ EK + +S  E  L  EKP   A
Sbjct: 358  AKIKPDVQAILRTQAAISRRLSDVHGRQANQDDSSTNEKTKPNS-AEGCLVPEKPTCEA 415


>ref|XP_002281524.1| PREDICTED: uncharacterized protein LOC100245597 [Vitis vinifera]
          Length = 710

 Score =  312 bits (800), Expect = 2e-96
 Identities = 200/419 (47%), Positives = 246/419 (58%), Gaps = 24/419 (5%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            GEHRSA+LQVIGIVPAL+ S   DLWP+HGFY+QLSDS+NSTYV+LSD D D IL++R Q
Sbjct: 24   GEHRSALLQVIGIVPALAGS---DLWPNHGFYVQLSDSLNSTYVSLSDRDTDLILTNRLQ 80

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQ V+VDRF F  PVPR  G+RP+ G    PF GSPEPL+AR SP  + FVIQP S  D
Sbjct: 81   LGQFVYVDRFDFDSPVPRVCGIRPIAG--RHPFVGSPEPLIARISPSKKDFVIQPVSDWD 138

Query: 362  AGPPLVSSSLRSDHLRQEK---------------EKRTVFAAKEN------VVSTGKPRR 478
                 +++ L +  +   K                 R V   ++N         + +P+R
Sbjct: 139  QSVDPIAAYLSNKKIDDVKNDGKESKIETKGEKGRTRQVLGTRDNNGDLDETKVSDRPQR 198

Query: 479  FSSPNASKLSAKKSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXXXXXXKC 658
            FSSP      AK+S + G        RDPSP  K                        KC
Sbjct: 199  FSSP----AGAKRSVSAGKKNVAVAERDPSPAGK--------------GKRSASPVPSKC 240

Query: 659  EVPSLVAAKEENRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPARRLSGGLKV 838
             VPSLV A+EENR+ +REPAIIVPSRYRQPSP GRK  ASP  RR S SP RRLSGGLK 
Sbjct: 241  MVPSLVVAREENRKTSREPAIIVPSRYRQPSPNGRK-QASPNARRASISPGRRLSGGLKF 299

Query: 839  ASPAAG---ESXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGAAESKEKNG 1009
             SPA G   +S            I++VS+AL GS K+ RKSWD+     VG+ E KEK+ 
Sbjct: 300  -SPAVGGAPDSTSKKKMATIVAGISKVSEALVGSAKAGRKSWDE-PPAAVGSGELKEKSL 357

Query: 1010 SKNKVDIESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSSKIERLLESEKPIMAA 1186
            +K K D+++ILRTQ AISRRLSD  G Q + + +S+ EK + +S  E  L  EKP   A
Sbjct: 358  AKIKPDVQAILRTQAAISRRLSDVHGRQANQDDSSTNEKTKPNS-AEGCLVPEKPTCEA 415


>gb|PNS93155.1| hypothetical protein POPTR_018G072500v3 [Populus trichocarpa]
          Length = 468

 Score =  304 bits (779), Expect = 4e-96
 Identities = 194/416 (46%), Positives = 252/416 (60%), Gaps = 21/416 (5%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            G+HRS +LQVIGIVPAL+ S   DLWP+ GFY+QLSDS+NSTYV+LS+ D D IL++R Q
Sbjct: 24   GDHRSPLLQVIGIVPALAGS---DLWPNQGFYVQLSDSLNSTYVSLSERDTDLILTNRLQ 80

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQ V++DRF F  PVPR  G+RP+ G  +  F G+PEPL+AR S   + FVIQP + ++
Sbjct: 81   LGQFVYIDRFDFDSPVPRVSGIRPIAGRHS--FVGTPEPLIARISASKKEFVIQPVADSE 138

Query: 362  AGPPLVSSSL----------RSDHLRQ----EKEKRTVFAAKENVV--STGKPRRFSSPN 493
                 ++  L          R+DH ++     K  R   A ++NV+   T   +RFSSP 
Sbjct: 139  YSVDPIAVYLSNNKKFDEFPRNDHNKKGEVTAKVTRQALAPRDNVMVDETATAKRFSSPA 198

Query: 494  ASK-----LSAKKSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXXXXXXKC 658
             +K      +AK+S + G      + RDPSP AK                        KC
Sbjct: 199  TAKRFSSPATAKRSVSVGKKNAALVERDPSPAAK--------------GKRSASPVPSKC 244

Query: 659  EVPSLVAAKEENRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPARRLSGGLKV 838
             VPSL+AAKEENR+VAREPAIIVPSRYRQPSP GRK   SP  RR S SP +RLS G+K+
Sbjct: 245  MVPSLLAAKEENRKVAREPAIIVPSRYRQPSPSGRK-QPSPNARRASISPGKRLS-GVKL 302

Query: 839  ASPAAGESXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGAAESKEKNGSKN 1018
             SPA  +S            I++VS+AL GS KS RK+WD+     VG+ E KEK  +K 
Sbjct: 303  -SPAVSDSVGKKKIANIVAGISKVSEALVGSAKSSRKNWDE-IPAAVGSGEMKEKGEAKK 360

Query: 1019 KVDIESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSSKIERLLESEKPIMAA 1186
            K D+++ILRTQ A+SRRLSD    Q + + TSS EK + SS  E  L+++ P  AA
Sbjct: 361  KPDLQAILRTQAALSRRLSDANSRQSNQDETSSYEKTKPSSP-EGCLDNKNPTCAA 415


>gb|PIA38521.1| hypothetical protein AQUCO_02700017v1 [Aquilegia coerulea]
          Length = 714

 Score =  311 bits (796), Expect = 7e-96
 Identities = 193/398 (48%), Positives = 238/398 (59%), Gaps = 18/398 (4%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            GEHRSA+LQ+IGIVPAL+     +LWP+HGFY+QLSDS NSTYV+LSD D D IL++R Q
Sbjct: 24   GEHRSALLQIIGIVPALAGR---ELWPNHGFYVQLSDSSNSTYVSLSDRDNDLILTNRLQ 80

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQ  +VDR  F  PVPR  G+RPV G    PF GSPEPL+AR SP  RGFVIQP S +D
Sbjct: 81   LGQFAYVDRLDFDSPVPRVSGIRPVAG--RHPFVGSPEPLIARISPSKRGFVIQPVSDSD 138

Query: 362  AG----PPLVSSSLRSDHLRQEKEKRTVFAAKEN--------VVSTGKPRRFSSPNASK- 502
                     +SS  +S   + E+  R V A K+N           + KP+RFSSP ++K 
Sbjct: 139  PSLDPISAYISSGKKSQDTKVEERTRPVLATKDNNSFGNSVERKPSDKPKRFSSPASAKQ 198

Query: 503  ---LSAKKSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXXXXXXKCEVPSL 673
               +SA K +           RD SP  K                        KC VPSL
Sbjct: 199  QRSVSAGKKNVAMAVAVAVAERDASPATKA-------------SSRSASPVPSKCVVPSL 245

Query: 674  VAAKEENRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPARRLSGGLKVASPAA 853
            V AKEENR+ +REPAI+VPSRYRQPSP GRK  ASP  RR S SP RRLS GLKV SP  
Sbjct: 246  VVAKEENRKTSREPAIVVPSRYRQPSPTGRK-QASPSTRRTSISPGRRLSVGLKV-SPIV 303

Query: 854  GESXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDD--SSTCVVGAAESKEKNGSKNKVD 1027
             +S            I++VS+AL GS K++RKSWDD  +      AAE KEK  SKN+ D
Sbjct: 304  TDSASKKKMATIAAGISKVSEALVGSAKTMRKSWDDQHAQEIATEAAEQKEKLISKNRTD 363

Query: 1028 IESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSS 1141
            +++ILRTQ  ++RRLSD    Q + E  S+ EK ++S+
Sbjct: 364  MQAILRTQTVLARRLSDAHTGQRNQEDASTNEKLKSST 401


>gb|OMO57982.1| hypothetical protein COLO4_34937 [Corchorus olitorius]
          Length = 637

 Score =  307 bits (787), Expect = 2e-95
 Identities = 202/425 (47%), Positives = 247/425 (58%), Gaps = 29/425 (6%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            G+HRS +LQVIGIVPAL+ S   DLWP+HGFY+QLSDSVNSTYV+LSD D + ILS+R Q
Sbjct: 9    GDHRSPLLQVIGIVPALAGS---DLWPNHGFYVQLSDSVNSTYVSLSDRDTELILSNRLQ 65

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQ V+VDRF F  PVPR  G+RP+ G     F GSP+PL+AR S   R FVIQP   ++
Sbjct: 66   LGQFVYVDRFHFDSPVPRVSGIRPIAGRHA--FVGSPDPLIARISSSKRDFVIQPVPESE 123

Query: 362  AGPPLVSSSLRSDHLRQEKEKRTVFAAKENVVSTGK------------------------ 469
                 ++  L +  L Q ++ +T   +K++ V   K                        
Sbjct: 124  YSVDPIAVYLSNKKLDQNQQLQTPNDSKDSKVEKSKTRQPLAPRDNVKVNENSESESKVP 183

Query: 470  ---PRRFSSPNASK--LSAKKSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXX 634
               P+RFSSP  +K   SA K  T  V     + RDPSP  K                  
Sbjct: 184  EKPPQRFSSPATAKRSSSAVKKTTAAV-----VERDPSPAGK--------------GKRS 224

Query: 635  XXXXXXKCEVPSLVAAKEENRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPAR 814
                  KC VPSLVAAKEENR+VAREPAIIVPSRYRQPSP GR+  ASP  RR S SP R
Sbjct: 225  ASPVPSKCVVPSLVAAKEENRKVAREPAIIVPSRYRQPSPNGRR-QASPSARRASLSPGR 283

Query: 815  RLSGGLKVASPAAGESXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGAAES 994
            RLSG LKV SPA G+S            I++VS+AL GS KS RKSWD+      G+ E 
Sbjct: 284  RLSGVLKV-SPAVGDS--KKKMATIVAGISKVSEALVGSAKSSRKSWDEQPE--KGSVEP 338

Query: 995  KEKNGSKNKVDIESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSSKIERLLESEKP 1174
            KEK   K++ D+++ILRTQ AISRRLSD    Q SN+  SS+ ++  +S  E  L SEK 
Sbjct: 339  KEKVSGKSRPDLQAILRTQAAISRRLSDVHS-QKSNDENSSSNEKPKASPPEDSLASEKS 397

Query: 1175 IMAAG 1189
              A G
Sbjct: 398  TCAGG 402


>ref|XP_011009205.1| PREDICTED: uncharacterized protein LOC105114378 [Populus euphratica]
          Length = 698

 Score =  307 bits (786), Expect = 1e-94
 Identities = 196/416 (47%), Positives = 252/416 (60%), Gaps = 21/416 (5%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            G+HRS +LQVIGIVPAL+ S   DLWP+ GFY+QLSDS+NSTYV+LS+ D D IL++R Q
Sbjct: 24   GDHRSPLLQVIGIVPALAGS---DLWPNQGFYVQLSDSLNSTYVSLSERDTDLILTNRLQ 80

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQ V++DRF F  PVPR  G+RP+ G  +  F G+PEPL+AR S   + FVIQP + ++
Sbjct: 81   LGQFVYIDRFDFDSPVPRVSGIRPIAGRHS--FVGTPEPLIARISASKKEFVIQPVADSE 138

Query: 362  AGPPLVSSSL----------RSDHLRQ----EKEKRTVFAAKENVV--STGKPRRFSSPN 493
                 ++  L          RSDH ++     K  R   A ++NV+   T   +RFSSP 
Sbjct: 139  YSVDPIAVYLSNNKKFDEFPRSDHNKKGEVTTKVTRQALAPRDNVMVDETATAKRFSSPA 198

Query: 494  ASK-----LSAKKSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXXXXXXKC 658
             +K      +AK+S + G      + RDPSP AK                        KC
Sbjct: 199  TAKRFSSPATAKRSVSVGKKNAALVERDPSPAAK--------------GKRSASPVPSKC 244

Query: 659  EVPSLVAAKEENRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPARRLSGGLKV 838
             VPSL+AAKEENR+VAREPAIIVPSRYRQPSP GRK   SP  RR S SP +RLS G+K+
Sbjct: 245  MVPSLLAAKEENRKVAREPAIIVPSRYRQPSPSGRK-QPSPNARRASLSPGKRLS-GVKL 302

Query: 839  ASPAAGESXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGAAESKEKNGSKN 1018
             SPA  +S            I++VS+AL GS KS RK+WDD     VG+ E KEK  +K 
Sbjct: 303  -SPAVSDSVGKKKIANIVAGISKVSEALVGSAKSSRKNWDD-IPAAVGSGEMKEKGETKK 360

Query: 1019 KVDIESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSSKIERLLESEKPIMAA 1186
            K D+++ILRTQ A+SRRLSD    Q + + TSS EK + SS  E  L+++ P  AA
Sbjct: 361  KPDLQAILRTQAALSRRLSDANSRQSNQDETSSYEKTKPSSP-EGCLDNKNPACAA 415


>gb|PNS93156.1| hypothetical protein POPTR_018G072500v3 [Populus trichocarpa]
          Length = 637

 Score =  304 bits (779), Expect = 3e-94
 Identities = 194/416 (46%), Positives = 252/416 (60%), Gaps = 21/416 (5%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            G+HRS +LQVIGIVPAL+ S   DLWP+ GFY+QLSDS+NSTYV+LS+ D D IL++R Q
Sbjct: 24   GDHRSPLLQVIGIVPALAGS---DLWPNQGFYVQLSDSLNSTYVSLSERDTDLILTNRLQ 80

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQ V++DRF F  PVPR  G+RP+ G  +  F G+PEPL+AR S   + FVIQP + ++
Sbjct: 81   LGQFVYIDRFDFDSPVPRVSGIRPIAGRHS--FVGTPEPLIARISASKKEFVIQPVADSE 138

Query: 362  AGPPLVSSSL----------RSDHLRQ----EKEKRTVFAAKENVV--STGKPRRFSSPN 493
                 ++  L          R+DH ++     K  R   A ++NV+   T   +RFSSP 
Sbjct: 139  YSVDPIAVYLSNNKKFDEFPRNDHNKKGEVTAKVTRQALAPRDNVMVDETATAKRFSSPA 198

Query: 494  ASK-----LSAKKSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXXXXXXKC 658
             +K      +AK+S + G      + RDPSP AK                        KC
Sbjct: 199  TAKRFSSPATAKRSVSVGKKNAALVERDPSPAAK--------------GKRSASPVPSKC 244

Query: 659  EVPSLVAAKEENRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPARRLSGGLKV 838
             VPSL+AAKEENR+VAREPAIIVPSRYRQPSP GRK   SP  RR S SP +RLS G+K+
Sbjct: 245  MVPSLLAAKEENRKVAREPAIIVPSRYRQPSPSGRK-QPSPNARRASISPGKRLS-GVKL 302

Query: 839  ASPAAGESXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGAAESKEKNGSKN 1018
             SPA  +S            I++VS+AL GS KS RK+WD+     VG+ E KEK  +K 
Sbjct: 303  -SPAVSDSVGKKKIANIVAGISKVSEALVGSAKSSRKNWDE-IPAAVGSGEMKEKGEAKK 360

Query: 1019 KVDIESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSSKIERLLESEKPIMAA 1186
            K D+++ILRTQ A+SRRLSD    Q + + TSS EK + SS  E  L+++ P  AA
Sbjct: 361  KPDLQAILRTQAALSRRLSDANSRQSNQDETSSYEKTKPSSP-EGCLDNKNPTCAA 415


>gb|OMO80794.1| hypothetical protein CCACVL1_12741 [Corchorus capsularis]
          Length = 715

 Score =  305 bits (782), Expect = 8e-94
 Identities = 206/426 (48%), Positives = 249/426 (58%), Gaps = 30/426 (7%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            G+HRS +LQVIGIVPAL+ S   DLWP+HGFY+QLSDSVNSTYV+LSD D + ILS+R Q
Sbjct: 24   GDHRSPLLQVIGIVPALAGS---DLWPNHGFYVQLSDSVNSTYVSLSDRDTELILSNRLQ 80

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQ V+VDRF F  PVPR  G+RP+ G     F GSP+PL+AR S   R FVIQP S ++
Sbjct: 81   LGQFVYVDRFHFDSPVPRVSGIRPIAGRHA--FVGSPDPLIARISSSKRDFVIQPVSESE 138

Query: 362  AGPPLVSSSLRSDHLRQ---------------EKEK-RTVFAAKENV-----------VS 460
                 ++  L +  L Q               EK K R   A ++NV           V 
Sbjct: 139  YSVDPIAVYLSNKKLDQNQQLQTPNDSKDSKFEKSKTRQPLAPRDNVKVNENSESESKVP 198

Query: 461  TGKPRRFSSPNASKLSA---KKSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXX 631
               P+RFSSP  +K S+   KK++   V       RDPSP AK                 
Sbjct: 199  EKPPQRFSSPATAKRSSSAVKKTNAAVVE------RDPSPAAK--------------GKR 238

Query: 632  XXXXXXXKCEVPSLVAAKEENRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPA 811
                   KC VPSLVAAKEENR+VAREPAIIVPSRYRQPSP GR+  ASP  RR S SP 
Sbjct: 239  SASPVPSKCVVPSLVAAKEENRKVAREPAIIVPSRYRQPSPNGRR-QASPSARRTSLSPG 297

Query: 812  RRLSGGLKVASPAAGESXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGAAE 991
            RRLSG LKV SP  G+S            I++VS+AL GS KS RKSWD+      G+ E
Sbjct: 298  RRLSGVLKV-SPVVGDS--KKKMATIVAGISKVSEALVGSAKSSRKSWDEQPE--KGSGE 352

Query: 992  SKEKNGSKNKVDIESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSSKIERLLESEK 1171
             K+K   K+K D+++ILRTQ AISRRLSD    Q SN+  SS+ ++  +S  E  L SEK
Sbjct: 353  KKDKVSVKSKPDLQAILRTQAAISRRLSDVHS-QKSNDENSSSNEKPKASPPEDSLASEK 411

Query: 1172 PIMAAG 1189
               A G
Sbjct: 412  STCAGG 417


>ref|XP_002324965.2| hypothetical protein POPTR_0018s06270g [Populus trichocarpa]
 gb|PNS93154.1| hypothetical protein POPTR_018G072500v3 [Populus trichocarpa]
          Length = 698

 Score =  304 bits (779), Expect = 1e-93
 Identities = 194/416 (46%), Positives = 252/416 (60%), Gaps = 21/416 (5%)
 Frame = +2

Query: 2    GEHRSAVLQVIGIVPALSASTTDDLWPSHGFYLQLSDSVNSTYVTLSDADADAILSSRPQ 181
            G+HRS +LQVIGIVPAL+ S   DLWP+ GFY+QLSDS+NSTYV+LS+ D D IL++R Q
Sbjct: 24   GDHRSPLLQVIGIVPALAGS---DLWPNQGFYVQLSDSLNSTYVSLSERDTDLILTNRLQ 80

Query: 182  LGQLVHVDRFKFAQPVPRAVGLRPVPGARTQPFAGSPEPLVARSSPDHRGFVIQPASATD 361
            LGQ V++DRF F  PVPR  G+RP+ G  +  F G+PEPL+AR S   + FVIQP + ++
Sbjct: 81   LGQFVYIDRFDFDSPVPRVSGIRPIAGRHS--FVGTPEPLIARISASKKEFVIQPVADSE 138

Query: 362  AGPPLVSSSL----------RSDHLRQ----EKEKRTVFAAKENVV--STGKPRRFSSPN 493
                 ++  L          R+DH ++     K  R   A ++NV+   T   +RFSSP 
Sbjct: 139  YSVDPIAVYLSNNKKFDEFPRNDHNKKGEVTAKVTRQALAPRDNVMVDETATAKRFSSPA 198

Query: 494  ASK-----LSAKKSDTGGVNCTGDLPRDPSPRAKTIXXXXXXXXXXXXXXXXXXXXXXKC 658
             +K      +AK+S + G      + RDPSP AK                        KC
Sbjct: 199  TAKRFSSPATAKRSVSVGKKNAALVERDPSPAAK--------------GKRSASPVPSKC 244

Query: 659  EVPSLVAAKEENRRVAREPAIIVPSRYRQPSPVGRKGAASPMGRRGSTSPARRLSGGLKV 838
             VPSL+AAKEENR+VAREPAIIVPSRYRQPSP GRK   SP  RR S SP +RLS G+K+
Sbjct: 245  MVPSLLAAKEENRKVAREPAIIVPSRYRQPSPSGRK-QPSPNARRASISPGKRLS-GVKL 302

Query: 839  ASPAAGESXXXXXXXXXXXXIARVSDALAGSVKSIRKSWDDSSTCVVGAAESKEKNGSKN 1018
             SPA  +S            I++VS+AL GS KS RK+WD+     VG+ E KEK  +K 
Sbjct: 303  -SPAVSDSVGKKKIANIVAGISKVSEALVGSAKSSRKNWDE-IPAAVGSGEMKEKGEAKK 360

Query: 1019 KVDIESILRTQVAISRRLSDFGGIQISNEVTSSAEKRRTSSKIERLLESEKPIMAA 1186
            K D+++ILRTQ A+SRRLSD    Q + + TSS EK + SS  E  L+++ P  AA
Sbjct: 361  KPDLQAILRTQAALSRRLSDANSRQSNQDETSSYEKTKPSSP-EGCLDNKNPTCAA 415


Top