BLASTX nr result

ID: Sinomenium22_contig00026077 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00026077
         (1398 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245...   410   e-112
ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620...   401   e-109
ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620...   399   e-108
ref|XP_007034267.1| Uncharacterized protein isoform 1 [Theobroma...   396   e-107
ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Popu...   392   e-106
ref|XP_007225696.1| hypothetical protein PRUPE_ppa006350mg [Prun...   374   e-101
ref|XP_002518043.1| conserved hypothetical protein [Ricinus comm...   373   e-101
ref|XP_007034270.1| Uncharacterized protein isoform 4, partial [...   357   5e-96
ref|XP_007034272.1| Uncharacterized protein isoform 6 [Theobroma...   357   7e-96
ref|XP_007034271.1| Uncharacterized protein isoform 5 [Theobroma...   356   1e-95
ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Caps...   343   1e-91
ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arab...   339   2e-90
ref|XP_004133985.1| PREDICTED: uncharacterized protein LOC101211...   330   9e-88
ref|XP_004247873.1| PREDICTED: uncharacterized protein LOC101244...   329   2e-87
gb|EXB89377.1| hypothetical protein L484_017343 [Morus notabilis]     320   7e-85
ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana] ...   319   2e-84
ref|XP_006360976.1| PREDICTED: uncharacterized protein LOC102592...   319   2e-84
ref|XP_007034268.1| Uncharacterized protein isoform 2 [Theobroma...   314   5e-83
gb|EXC33904.1| hypothetical protein L484_012794 [Morus notabilis]     313   9e-83
ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, part...   310   1e-81

>ref|XP_002263384.1| PREDICTED: uncharacterized protein LOC100245254 [Vitis vinifera]
            gi|298205214|emb|CBI17273.3| unnamed protein product
            [Vitis vinifera]
          Length = 425

 Score =  410 bits (1054), Expect = e-112
 Identities = 220/414 (53%), Positives = 292/414 (70%), Gaps = 2/414 (0%)
 Frame = -1

Query: 1362 PSSEQLDLETIRSRVQALSEVLRTSKEFSELSPSESDKLLKECVIGLENRIEECMSEFSD 1183
            P++  +DL+TIRSR+  L+ +       S+ +P +S  L +E    L++R+ + +S++SD
Sbjct: 5    PAAGTMDLDTIRSRMSELNRIHTNYSHISDSNPLDSRSLFQEFSHHLQSRVNQILSQYSD 64

Query: 1182 FSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLK 1003
              +L  +DLD YL + K+ELNL+E+EN KI NEIE L+ +++EDS +LE DLE L  S+ 
Sbjct: 65   VESLEADDLDAYLGHLKKELNLVESENAKISNEIEALTRTYVEDSNQLESDLEVLKHSVD 124

Query: 1002 FIESQNRHKLEMGTDVAHSVPTGGSETFDRTHEDN-FKVLELDNQIEKNKVTLNSLQDLC 826
            F+ SQ   + E G  V +S            H DN F++L+L+ Q +KNK+TL SLQDL 
Sbjct: 125  FVASQGLKRAEAGALVDYSSSVEDQLDSRTAHGDNNFEILDLNYQTQKNKITLKSLQDLD 184

Query: 825  DILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAIDPPAVDH 646
               KRFEA+ +IED LTGLKVI+ EGNCIRLSL TF+PNLE LL  +K+E   +P  ++H
Sbjct: 185  YTFKRFEAIEKIEDALTGLKVIDFEGNCIRLSLSTFIPNLEGLLCEEKIEAVNEPSELNH 244

Query: 645  ELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAK-SRQFCPSLAVLEMGSSLEWLVRKXXX 469
            ELLIEV D +MELKNVEIFPNDV++GEI+D+AK SR+    +++LE  SSLEW VRK   
Sbjct: 245  ELLIEVMDQSMELKNVEIFPNDVYLGEIIDAAKSSRKLFSHMSILETRSSLEWFVRKVQD 304

Query: 468  XXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTCALKLIS 289
                       V  ANKSRHS EY DRDEI++AHMVGG++A+IK+ Q WP+   ALKL S
Sbjct: 305  KIILCALRQSIVKGANKSRHSLEYLDRDEIIVAHMVGGVDAYIKVCQGWPVSNNALKLKS 364

Query: 288  LKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRSELH 127
            LK+SD  S+ ISLSFLCKVEE+ANSLDV IR+N+ SFVDAIE ILV+QM+S+LH
Sbjct: 365  LKSSDQQSKGISLSFLCKVEEMANSLDVSIRKNISSFVDAIEEILVQQMQSKLH 418


>ref|XP_006493067.1| PREDICTED: uncharacterized protein LOC102620884 isoform X2 [Citrus
            sinensis]
          Length = 444

 Score =  401 bits (1031), Expect = e-109
 Identities = 221/420 (52%), Positives = 287/420 (68%), Gaps = 9/420 (2%)
 Frame = -1

Query: 1359 SSEQLDLETIRSRVQALSEVLRTSKEFSELS-PSESDKLLKECVIGLENRIEECMSEFSD 1183
            SS  LDL ++RS V+ L E+ R+  E    +  S+S+ LLKE     E++++E ++E++D
Sbjct: 19   SSSPLDLHSLRSEVKELMEIHRSGIEDEPNTVSSDSENLLKEYAHDFESKVKEIITEYAD 78

Query: 1182 FSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLK 1003
             S LGIEDLD YLE+ KEEL  +EAE+ KI NEIE L+ + +EDS  LE DLE LNC++ 
Sbjct: 79   VSFLGIEDLDAYLEHLKEELKTVEAESSKISNEIETLTRTQVEDSDRLESDLEELNCAID 138

Query: 1002 FIESQNRHKLEM------GTDVAHSVPTGGSETFDRTHEDN-FKVLELDNQIEKNKVTLN 844
             I S+N  +         G D      T       + HED+ F++LEL++QIEKNK+ LN
Sbjct: 139  LIVSENAKEDRQAVCPARGEDQVCPTHTEDQSDLIKIHEDHRFEILELESQIEKNKIILN 198

Query: 843  SLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAID 664
            SLQDL  +LKRF+AV QIED+LTGLKVI+ +G C RLS++T++P LE      K+E  I+
Sbjct: 199  SLQDLDFVLKRFDAVEQIEDSLTGLKVIDFDGKCFRLSMQTYIPTLEESSFQHKIEDVIE 258

Query: 663  PPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSLEWL 487
            P  V+HELLIEV DGTME+KNVE+FPNDV I ++VD+AKS RQ    L  LE  SSL+W 
Sbjct: 259  PSEVNHELLIEVIDGTMEIKNVEMFPNDVHISDLVDAAKSFRQSGTQLDSLETSSSLQWF 318

Query: 486  VRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTC 307
            +R               V  ANKSRH FEY +RDE+++AH+VGG++AFIK  Q WP+   
Sbjct: 319  IRNVQDRIILSTLRRFVVKTANKSRHFFEYFERDEMIVAHLVGGVDAFIKPSQGWPLSNS 378

Query: 306  ALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRSELH 127
             LK+ISLKNSD+HS+ ISLSF C+VEE ANSLDV IRQNL SFVD +E IL+ QMR ELH
Sbjct: 379  PLKVISLKNSDHHSKGISLSFFCRVEEAANSLDVHIRQNLSSFVDGVEKILLEQMRVELH 438


>ref|XP_006493066.1| PREDICTED: uncharacterized protein LOC102620884 isoform X1 [Citrus
            sinensis]
          Length = 447

 Score =  399 bits (1025), Expect = e-108
 Identities = 220/423 (52%), Positives = 286/423 (67%), Gaps = 12/423 (2%)
 Frame = -1

Query: 1359 SSEQLDLETIRSRVQALSEVLRTSKEFSELS-PSESDKLLKECVIGLENRIEECMSEFSD 1183
            SS  LDL ++RS V+ L E+ R+  E    +  S+S+ LLKE     E++++E ++E++D
Sbjct: 19   SSSPLDLHSLRSEVKELMEIHRSGIEDEPNTVSSDSENLLKEYAHDFESKVKEIITEYAD 78

Query: 1182 FSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLK 1003
             S LGIEDLD YLE+ KEEL  +EAE+ KI NEIE L+ + +EDS  LE DLE LNC++ 
Sbjct: 79   VSFLGIEDLDAYLEHLKEELKTVEAESSKISNEIETLTRTQVEDSDRLESDLEELNCAID 138

Query: 1002 FIESQNRHKLE---------MGTDVAHSVPTGGSETFDRTHEDN-FKVLELDNQIEKNKV 853
             I S+     +          G D      T       + HED+ F++LEL++QIEKNK+
Sbjct: 139  LIVSEGSQNAKEDRQAVCPARGEDQVCPTHTEDQSDLIKIHEDHRFEILELESQIEKNKI 198

Query: 852  TLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEY 673
             LNSLQDL  +LKRF+AV QIED+LTGLKVI+ +G C RLS++T++P LE      K+E 
Sbjct: 199  ILNSLQDLDFVLKRFDAVEQIEDSLTGLKVIDFDGKCFRLSMQTYIPTLEESSFQHKIED 258

Query: 672  AIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSL 496
             I+P  V+HELLIEV DGTME+KNVE+FPNDV I ++VD+AKS RQ    L  LE  SSL
Sbjct: 259  VIEPSEVNHELLIEVIDGTMEIKNVEMFPNDVHISDLVDAAKSFRQSGTQLDSLETSSSL 318

Query: 495  EWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPM 316
            +W +R               V  ANKSRH FEY +RDE+++AH+VGG++AFIK  Q WP+
Sbjct: 319  QWFIRNVQDRIILSTLRRFVVKTANKSRHFFEYFERDEMIVAHLVGGVDAFIKPSQGWPL 378

Query: 315  LTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRS 136
                LK+ISLKNSD+HS+ ISLSF C+VEE ANSLDV IRQNL SFVD +E IL+ QMR 
Sbjct: 379  SNSPLKVISLKNSDHHSKGISLSFFCRVEEAANSLDVHIRQNLSSFVDGVEKILLEQMRV 438

Query: 135  ELH 127
            ELH
Sbjct: 439  ELH 441


>ref|XP_007034267.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508713296|gb|EOY05193.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 430

 Score =  396 bits (1018), Expect = e-107
 Identities = 216/425 (50%), Positives = 289/425 (68%), Gaps = 4/425 (0%)
 Frame = -1

Query: 1386 MEELLESVPSSEQLDLETIRSRVQALSEVLRT--SKEFSELSPSESDKLLKECVIGLENR 1213
            M E +E   SSE LDL +IRSR+  LSE+ R   +K+  E     S+KLLK+C +  E++
Sbjct: 1    MAEPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESK 60

Query: 1212 IEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELER 1033
            +++ + E+SD   LGIEDLD YL + KEELN +EAE+ KI NEIE LS + IE+S  LE 
Sbjct: 61   VKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEG 120

Query: 1032 DLEGLNCSLKFIESQNRHKLEMGTDVAHSV-PTGGSETFDRTHEDNFKVLELDNQIEKNK 856
            +LEGL  +L  I SQ    +E    +  S+     S       E  F+++EL++QIEKN 
Sbjct: 121  NLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNN 180

Query: 855  VTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKME 676
            + L SLQDL  + KR + + QIED LTGLKVI  +GNCIRLSL+T++P LE LL  + +E
Sbjct: 181  IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240

Query: 675  YAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSS 499
               +P  ++HELL+E+ DGTME+KNVE+FPNDV++G+I+D+AKS RQ   +L V +  SS
Sbjct: 241  DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 300

Query: 498  LEWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWP 319
            LEW V K              V   NKSRHSFEY +RDE ++AH+VGGI+AFIKL Q WP
Sbjct: 301  LEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWP 360

Query: 318  MLTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMR 139
            +    LKL+S+K+SD+HSR ISLS LCK EE+ANSLD+ IRQNL +FVDA+E +L+ QMR
Sbjct: 361  LSKSPLKLLSIKSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLEQMR 420

Query: 138  SELHT 124
             +L +
Sbjct: 421  LDLQS 425


>ref|XP_002300157.1| hypothetical protein POPTR_0001s32530g [Populus trichocarpa]
            gi|222847415|gb|EEE84962.1| hypothetical protein
            POPTR_0001s32530g [Populus trichocarpa]
          Length = 429

 Score =  392 bits (1006), Expect = e-106
 Identities = 211/424 (49%), Positives = 294/424 (69%), Gaps = 7/424 (1%)
 Frame = -1

Query: 1374 LESVPSS--EQLDLETIRSRVQALSEVLR--TSKEFSELSPSESDKLLKECVIGLENRIE 1207
            +E  PS+  E L+L TIRSR+  L E+ R   +  FSE++ S+SD+L+K+    L +++ 
Sbjct: 1    MEISPSTTQESLNLNTIRSRINELEEIYRDCNADSFSEINSSDSDELMKDSAQQLVSKVS 60

Query: 1206 ECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDL 1027
            + ++E+SDFS LGIEDLD YL + KEEL+  EAE+ KI NEIE+L+ + +EDS+ELE DL
Sbjct: 61   QTVTEYSDFSFLGIEDLDAYLAHLKEELDAAEAESAKISNEIELLNRTCMEDSSELENDL 120

Query: 1026 EGLNCSLKFIESQNRHKLEMGTDVAHSVPTGGSET--FDRTHEDNFKVLELDNQIEKNKV 853
            E + CSL  I SQ   + E G +      +G +++   +   E+ F++L+LDNQIE++  
Sbjct: 121  EWMKCSLDLISSQRDREKEKGDEQMEHFSSGENQSNLINTNEENKFEILKLDNQIEESTR 180

Query: 852  TLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEY 673
             L S+QDL  + K ++A+ QIED L+GLKVIE +G CIRLSL+T++P  + +L  QK+E 
Sbjct: 181  ILKSMQDLDSVCKWYDAIEQIEDVLSGLKVIEFDGTCIRLSLRTYIPK-QDVLFLQKIEE 239

Query: 672  AIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSL 496
               P  ++HE LIEV +G+ME+K VE+FPND++IG+IVD+AKS RQ    LA++E  SSL
Sbjct: 240  TNVPYEINHEFLIEVTNGSMEIKKVEMFPNDIYIGDIVDAAKSFRQMFLHLALMETSSSL 299

Query: 495  EWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPM 316
            EW VRK                 A+ SR S EY DRDEI++AHMVGG++AF+++ Q WP+
Sbjct: 300  EWFVRKAQDRIIQSTLRRLVARSASTSRQSIEYLDRDEIIVAHMVGGVDAFMEVSQGWPI 359

Query: 315  LTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRS 136
                LKL+SLKNS++H++ ISL FLCKVEE ANSLDV  RQNL SFVD++E ILV QM  
Sbjct: 360  TNSPLKLVSLKNSNHHAKEISLGFLCKVEEAANSLDVHTRQNLSSFVDSVEKILVEQMHL 419

Query: 135  ELHT 124
            ELH+
Sbjct: 420  ELHS 423


>ref|XP_007225696.1| hypothetical protein PRUPE_ppa006350mg [Prunus persica]
            gi|462422632|gb|EMJ26895.1| hypothetical protein
            PRUPE_ppa006350mg [Prunus persica]
          Length = 416

 Score =  374 bits (961), Expect = e-101
 Identities = 208/424 (49%), Positives = 284/424 (66%), Gaps = 4/424 (0%)
 Frame = -1

Query: 1386 MEELLESVPSSEQLDLETIRSRVQALSEVLRTSKE--FSELSPSESDKLLKECVIGLENR 1213
            MEE  + +PSSE LDL TI+ +V+ L E++ + ++   SELSPS+SD L++ C + L++R
Sbjct: 1    MEE--DPIPSSEPLDLNTIQRQVRELEEIIESCRQDDASELSPSDSDDLIRNCGLLLQSR 58

Query: 1212 IEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELER 1033
            +E+ +SE SD   L  ++ + Y+   ++ELN +EAE+ K+ N IE L  +  ED   L  
Sbjct: 59   VEQIVSECSDVGLLEDQEFEAYVGRFEQELNSVEAESTKVSNGIEDLIRTHGEDFNRLGT 118

Query: 1032 DLEGLNCSLKFIESQNRHKLEMGTDVAHSVPTGGSETFD--RTHEDNFKVLELDNQIEKN 859
            DL  L CSL F+E ++  K ++G DV +     G +  D    + D F++LEL+NQIEKN
Sbjct: 119  DLAQLKCSLDFVEEKDLEKAKLGADVDYH--KCGKDLLDPMNVNADKFELLELENQIEKN 176

Query: 858  KVTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKM 679
             + L SLQDL   LK  +   QIED +TGLKVI  EGNC+RLSL+T++P LE L   +K+
Sbjct: 177  NIILKSLQDLECTLKWLDNTEQIEDAVTGLKVIAFEGNCVRLSLRTYIPKLEDLFSPKKV 236

Query: 678  EYAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKSRQFCPSLAVLEMGSS 499
              A +P  V+HELLIE+ +GTM L+NVEIFPNDV+I +I+D+AKS +           SS
Sbjct: 237  GDATEPSEVNHELLIELLEGTMGLRNVEIFPNDVYINDILDAAKSLR----------KSS 286

Query: 498  LEWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWP 319
            L+W V K              V + NKSRHS EY D+DE ++AH+VGG++AFIK+PQ WP
Sbjct: 287  LQWFVTKVQDRIVLCTMRRLVVKNENKSRHSLEYLDKDETVVAHVVGGVDAFIKVPQGWP 346

Query: 318  MLTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMR 139
            +L+  LKLI LK+SD HS+ ISLSFLC V+E+ANSL V+IRQ L SFVDAIE ILV QM 
Sbjct: 347  LLSSPLKLIYLKSSDQHSKGISLSFLCTVQELANSLAVRIRQTLSSFVDAIEKILVEQMC 406

Query: 138  SELH 127
            SE+H
Sbjct: 407  SEIH 410


>ref|XP_002518043.1| conserved hypothetical protein [Ricinus communis]
            gi|223542639|gb|EEF44176.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 415

 Score =  373 bits (958), Expect = e-101
 Identities = 206/412 (50%), Positives = 283/412 (68%), Gaps = 4/412 (0%)
 Frame = -1

Query: 1347 LDLETIRSRVQALSEVLRTSKEFSELSPSESDKLLKECVIGLENRIEECMSEFSDFSALG 1168
            LDL +I   ++ L E+       +E+  S SD++L++C + LE+++++ MSE SDF+ LG
Sbjct: 5    LDLNSIICGIKDLEEIYSGCNGDTEMLSSHSDQVLEDCALHLESKVQQIMSECSDFNFLG 64

Query: 1167 IEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLKFIESQ 988
            IEDLD ++E+ KEEL+   +E  KI  EIE L+ + +ED T LE D+E L CSL FI S+
Sbjct: 65   IEDLDAFVEHLKEELSTTMSETAKISTEIEALNRNHMEDFTRLESDIEMLKCSLDFISSK 124

Query: 987  NRHKLEMGTDVAHSVPTGGSETFDRTHED-NFKVLELDNQIEKNKVTLNSLQDLCDILKR 811
            +   +E   +VA       ++     H D  F++ +LD+QI K+K+ L SLQD   + KR
Sbjct: 125  D---VEKEKEVACREDLYSTDA----HRDYEFEISKLDDQIAKSKMILKSLQDFDSVFKR 177

Query: 810  FEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAIDPPAVDHELLIE 631
             +AV QIE+ L+GLKVIE +G+CIRLSL+T+LP L+ ++   K E   +P  V+HELLIE
Sbjct: 178  VDAVEQIEEALSGLKVIEFDGSCIRLSLRTYLPKLDDVMCQHKTEDTAEPSEVNHELLIE 237

Query: 630  VFDGTMELKNVEIFPNDVFIGEIVDSAKS--RQFCPS-LAVLEMGSSLEWLVRKXXXXXX 460
            V  GTMELKNVEIFPND++I +IVD+AKS  ++F  S L   E  SSL WLVRK      
Sbjct: 238  VVSGTMELKNVEIFPNDIYISDIVDAAKSFRKEFLYSALTESETRSSLGWLVRKVQDRII 297

Query: 459  XXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTCALKLISLKN 280
                    V  +NKSR+SFEY DRDE ++AH+VGG++AFIKL Q WP+    LKLISLK+
Sbjct: 298  QFTLRRLVVKSSNKSRYSFEYLDRDETVVAHLVGGVDAFIKLSQGWPVSRSPLKLISLKS 357

Query: 279  SDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRSELHT 124
            S++HS+ ISLSFLC+VEE+ NSLD+Q+R NL+SFV+ IE +LV QMR ELH+
Sbjct: 358  SNHHSKEISLSFLCRVEEVVNSLDIQMRLNLLSFVEVIEKLLVEQMRIELHS 409


>ref|XP_007034270.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
            gi|508713299|gb|EOY05196.1| Uncharacterized protein
            isoform 4, partial [Theobroma cacao]
          Length = 372

 Score =  357 bits (917), Expect = 5e-96
 Identities = 190/367 (51%), Positives = 254/367 (69%), Gaps = 2/367 (0%)
 Frame = -1

Query: 1218 NRIEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTEL 1039
            +++++ + E+SD   LGIEDLD YL + KEELN +EAE+ KI NEIE LS + IE+S  L
Sbjct: 1    SKVKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNIL 60

Query: 1038 ERDLEGLNCSLKFIESQNRHKLEMGTDVAHSV-PTGGSETFDRTHEDNFKVLELDNQIEK 862
            E +LEGL  +L  I SQ    +E    +  S+     S       E  F+++EL++QIEK
Sbjct: 61   EGNLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEK 120

Query: 861  NKVTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQK 682
            N + L SLQDL  + KR + + QIED LTGLKVI  +GNCIRLSL+T++P LE LL  + 
Sbjct: 121  NNIILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKT 180

Query: 681  MEYAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMG 505
            +E   +P  ++HELL+E+ DGTME+KNVE+FPNDV++G+I+D+AKS RQ   +L V +  
Sbjct: 181  IEDISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQ 240

Query: 504  SSLEWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQS 325
            SSLEW V K              V   NKSRHSFEY +RDE ++AH+VGGI+AFIKL Q 
Sbjct: 241  SSLEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQG 300

Query: 324  WPMLTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQ 145
            WP+    LKL+S+K+SD+HSR ISLS LCK EE+ANSLD+ IRQNL +FVDA+E +L+ Q
Sbjct: 301  WPLSKSPLKLLSIKSSDHHSRGISLSLLCKAEEMANSLDMHIRQNLSAFVDAVEKLLLEQ 360

Query: 144  MRSELHT 124
            MR +L +
Sbjct: 361  MRLDLQS 367


>ref|XP_007034272.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508713301|gb|EOY05198.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 432

 Score =  357 bits (916), Expect = 7e-96
 Identities = 196/391 (50%), Positives = 261/391 (66%), Gaps = 4/391 (1%)
 Frame = -1

Query: 1386 MEELLESVPSSEQLDLETIRSRVQALSEVLRT--SKEFSELSPSESDKLLKECVIGLENR 1213
            M E +E   SSE LDL +IRSR+  LSE+ R   +K+  E     S+KLLK+C +  E++
Sbjct: 1    MAEPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESK 60

Query: 1212 IEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELER 1033
            +++ + E+SD   LGIEDLD YL + KEELN +EAE+ KI NEIE LS + IE+S  LE 
Sbjct: 61   VKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEG 120

Query: 1032 DLEGLNCSLKFIESQNRHKLEMGTDVAHSV-PTGGSETFDRTHEDNFKVLELDNQIEKNK 856
            +LEGL  +L  I SQ    +E    +  S+     S       E  F+++EL++QIEKN 
Sbjct: 121  NLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNN 180

Query: 855  VTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKME 676
            + L SLQDL  + KR + + QIED LTGLKVI  +GNCIRLSL+T++P LE LL  + +E
Sbjct: 181  IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240

Query: 675  YAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSS 499
               +P  ++HELL+E+ DGTME+KNVE+FPNDV++G+I+D+AKS RQ   +L V +  SS
Sbjct: 241  DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 300

Query: 498  LEWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWP 319
            LEW V K              V   NKSRHSFEY +RDE ++AH+VGGI+AFIKL Q WP
Sbjct: 301  LEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWP 360

Query: 318  MLTCALKLISLKNSDNHSRVISLSFLCKVEE 226
            +    LKL+S+K+SD+HSR ISLS LCK EE
Sbjct: 361  LSKSPLKLLSIKSSDHHSRGISLSLLCKAEE 391


>ref|XP_007034271.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508713300|gb|EOY05197.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 392

 Score =  356 bits (914), Expect = 1e-95
 Identities = 195/392 (49%), Positives = 261/392 (66%), Gaps = 4/392 (1%)
 Frame = -1

Query: 1386 MEELLESVPSSEQLDLETIRSRVQALSEVLRT--SKEFSELSPSESDKLLKECVIGLENR 1213
            M E +E   SSE LDL +IRSR+  LSE+ R   +K+  E     S+KLLK+C +  E++
Sbjct: 1    MAEPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESK 60

Query: 1212 IEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELER 1033
            +++ + E+SD   LGIEDLD YL + KEELN +EAE+ KI NEIE LS + IE+S  LE 
Sbjct: 61   VKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEG 120

Query: 1032 DLEGLNCSLKFIESQNRHKLEMGTDVAHSV-PTGGSETFDRTHEDNFKVLELDNQIEKNK 856
            +LEGL  +L  I SQ    +E    +  S+     S       E  F+++EL++QIEKN 
Sbjct: 121  NLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNN 180

Query: 855  VTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKME 676
            + L SLQDL  + KR + + QIED LTGLKVI  +GNCIRLSL+T++P LE LL  + +E
Sbjct: 181  IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240

Query: 675  YAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSS 499
               +P  ++HELL+E+ DGTME+KNVE+FPNDV++G+I+D+AKS RQ   +L V +  SS
Sbjct: 241  DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 300

Query: 498  LEWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWP 319
            LEW V K              V   NKSRHSFEY +RDE ++AH+VGGI+AFIKL Q WP
Sbjct: 301  LEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQGWP 360

Query: 318  MLTCALKLISLKNSDNHSRVISLSFLCKVEEI 223
            +    LKL+S+K+SD+HSR ISLS LCK E +
Sbjct: 361  LSKSPLKLLSIKSSDHHSRGISLSLLCKAERV 392


>ref|XP_006297761.1| hypothetical protein CARUB_v10013795mg [Capsella rubella]
            gi|482566470|gb|EOA30659.1| hypothetical protein
            CARUB_v10013795mg [Capsella rubella]
          Length = 420

 Score =  343 bits (880), Expect = 1e-91
 Identities = 188/410 (45%), Positives = 265/410 (64%), Gaps = 2/410 (0%)
 Frame = -1

Query: 1347 LDLETIRSRVQALSEVLRTSK-EFSELSPSESDKLLKECVIGLENRIEECMSEFSDFSAL 1171
            LDL+ IRSRV+ L  + R  K E  E   S+S+ L+++ V+  E ++ E + ++SD   L
Sbjct: 10   LDLQQIRSRVKELESIHRNCKYEPGESCTSDSENLVQDFVLQFETKVNEIVEDYSDVDIL 69

Query: 1170 GIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLKFIES 991
             +ED D YLE  ++EL+ +EAE+ K+  EIE LS S  EDS+ LERDLEGL  SL  + S
Sbjct: 70   DVEDSDAYLEYLRKELHSVEAESAKVSEEIERLSRSHAEDSSRLERDLEGLLLSLDSMSS 129

Query: 990  QNRHKLEMGTDVAHSVPTGGSETFDRTHEDNFKVLELDNQIEKNKVTLNSLQDLCDILKR 811
            Q+ +K +       S+     E  +   +D FK+ EL+NQ+E+ ++ L SL+DL  + KR
Sbjct: 130  QDVNKSKESPPSCSSM-----EVCEVNDDDKFKMFELENQMEEKRMILKSLEDLDSLRKR 184

Query: 810  FEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAIDPPAVDHELLIE 631
            F+A  Q+ED LTGLKV+E +GN IRL L+T++P L+ L    K E+   P  + HELLI 
Sbjct: 185  FDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIPELDGLPAQHKFEHTTKPSELIHELLIY 244

Query: 630  VFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSLEWLVRKXXXXXXXX 454
            + D T E+  +E+FPNDV+IG+I+++A S RQ     AVL+  SS++W+V K        
Sbjct: 245  LKDKTTEITKLEMFPNDVYIGDIIEAADSFRQVRLHSAVLDTRSSVQWVVAKVQDRIITT 304

Query: 453  XXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTCALKLISLKNSD 274
                  V  +   RH+F+Y D+DE ++AH+ GGI+AF+K+   WP+L   LKL SLKNSD
Sbjct: 305  TLRKYIVTSSKTMRHTFKYYDKDETIVAHIAGGIDAFLKVSDGWPLLNSPLKLASLKNSD 364

Query: 273  NHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRSELHT 124
            N S+ ISLS +CKVEE+ANSLD+Q RQNL  F+DAIE ILV Q R EL +
Sbjct: 365  NQSKGISLSLICKVEELANSLDLQTRQNLSGFIDAIEKILVHQTREELQS 414


>ref|XP_002885604.1| hypothetical protein ARALYDRAFT_342541 [Arabidopsis lyrata subsp.
            lyrata] gi|297331444|gb|EFH61863.1| hypothetical protein
            ARALYDRAFT_342541 [Arabidopsis lyrata subsp. lyrata]
          Length = 421

 Score =  339 bits (870), Expect = 2e-90
 Identities = 187/408 (45%), Positives = 266/408 (65%), Gaps = 2/408 (0%)
 Frame = -1

Query: 1347 LDLETIRSRVQALSEVLRTSK-EFSELSPSESDKLLKECVIGLENRIEECMSEFSDFSAL 1171
            LDL+ IRSRV+ L  + R  + E  E   S+S+ L+++ V+  E +++E + ++SD   L
Sbjct: 10   LDLQEIRSRVKELEFIHRNCRDEPGESCSSDSETLVQDFVLQFEPKVKEIVEDYSDVDLL 69

Query: 1170 GIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLKFIES 991
             +ED D YLE  ++EL  +EAE+ K+  EIE LS S  +DS+ LERDLEGL  SL  + S
Sbjct: 70   DVEDSDAYLEYLRKELQSVEAESAKVSEEIERLSKSHAQDSSRLERDLEGLLLSLDSMSS 129

Query: 990  QNRHKLEMGTDVAHSVPTGGSETFDRTHEDNFKVLELDNQIEKNKVTLNSLQDLCDILKR 811
            Q+  K +     + S+     E  +   +D FK+ EL+NQ+E+ +  L SL+DL  + KR
Sbjct: 130  QDVEKSKENQPSSSSM-----EVCEVNDDDKFKMFELENQMEEKRSILKSLEDLDSLRKR 184

Query: 810  FEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAIDPPAVDHELLIE 631
            F+A  Q+ED LTGLKV+E +GN IRL L+T++P L+SLL  QK E+  +P  + HELLI 
Sbjct: 185  FDAAEQVEDALTGLKVLEFDGNFIRLQLQTYIPKLDSLLGQQKFEHTTEPSELIHELLIY 244

Query: 630  VFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSLEWLVRKXXXXXXXX 454
            + D T E+   E+FPNDV+IG+I+++A S RQ     AVL+  SS++W+V K        
Sbjct: 245  LKDKTTEITKFEMFPNDVYIGDIIEAADSFRQVSLHSAVLDTRSSVQWVVAKVQDRIISS 304

Query: 453  XXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTCALKLISLKNSD 274
                  V  +   RH+FEY ++DE ++ H+ GGI+AF+K+   WP+L   LKL SLKNSD
Sbjct: 305  TLRKYLVTSSKTIRHTFEYYEKDETIVGHIAGGIDAFLKVSNGWPLLNTPLKLESLKNSD 364

Query: 273  NHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRSEL 130
            N S+ ISLS +CKVE++ANSLD+Q RQNL  F+DAIE ILV+Q R EL
Sbjct: 365  NQSKGISLSLICKVEDLANSLDLQTRQNLSGFMDAIEKILVQQTREEL 412


>ref|XP_004133985.1| PREDICTED: uncharacterized protein LOC101211137 [Cucumis sativus]
            gi|449527675|ref|XP_004170835.1| PREDICTED:
            uncharacterized protein LOC101229419 [Cucumis sativus]
          Length = 414

 Score =  330 bits (846), Expect = 9e-88
 Identities = 197/422 (46%), Positives = 269/422 (63%), Gaps = 3/422 (0%)
 Frame = -1

Query: 1386 MEELLESVPS-SEQLDLETIRSRVQALSEVLRTSKEFSELSPSESDKLLKECVIGLENRI 1210
            M E +E+ PS    LDL+ +RS ++ L   L  ++E S      S+KLL+EC + LE+RI
Sbjct: 2    MPESMEATPSVPPSLDLQAVRSELEELQRSLEENEE-STTDSLGSEKLLRECALHLESRI 60

Query: 1209 EECMSEFSDF-SALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELER 1033
            ++ +SE+S+  S LGI+DLD Y+E+ KEEL  +EAE+ KI NEIEVL  + IEDS +L+ 
Sbjct: 61   QQVLSEYSNVDSFLGIDDLDAYVEHMKEELVAVEAESSKISNEIEVLKRTNIEDSNKLKM 120

Query: 1032 DLEGLNCSLKFIESQNRHKLEMGTDVAHSVPTGGSETFDRTHEDN-FKVLELDNQIEKNK 856
            DLE L  SL    SQ+    E  T    S+            E N F+VLEL++QIEKNK
Sbjct: 121  DLEVLKLSLDRFPSQDP---EEATFNCSSMNGEDPMNVIVNRECNAFEVLELESQIEKNK 177

Query: 855  VTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKME 676
              L SLQ++ +I K  + + Q+E T+ G+KVI++  N IRLSL T +PN+E     Q++E
Sbjct: 178  KILKSLQEVDEIFKSLDVIEQVEGTIGGMKVIDVADNSIRLSLHTHIPNVEDFSTLQRLE 237

Query: 675  YAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKSRQFCPSLAVLEMGSSL 496
              I+   +DHEL+IEV DGTMELKN EIFP DV + +I++++KS             SSL
Sbjct: 238  GLIEKSELDHELIIEVLDGTMELKNAEIFPADVHLHDIINASKSIS----------NSSL 287

Query: 495  EWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPM 316
            EW VRK              V  ANKS HSFEY D+DE+++  M+GGI+A IK+ Q WP+
Sbjct: 288  EWFVRKVQDRIVLCTLRRFAVKSANKSCHSFEYLDQDEMIMCSMIGGIDACIKVSQGWPL 347

Query: 315  LTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRS 136
                LKLISLK+SD++++ +SLS +CKVE++ANSLD  IR+NL SF DA+E IL  QM  
Sbjct: 348  ADSPLKLISLKSSDHYTKGVSLSLICKVEKMANSLDAHIRRNLSSFADAVEKILKEQMHL 407

Query: 135  EL 130
            EL
Sbjct: 408  EL 409


>ref|XP_004247873.1| PREDICTED: uncharacterized protein LOC101244321 [Solanum
            lycopersicum]
          Length = 415

 Score =  329 bits (843), Expect = 2e-87
 Identities = 183/403 (45%), Positives = 259/403 (64%), Gaps = 2/403 (0%)
 Frame = -1

Query: 1344 DLETIRSRVQALSEVLRTSKEFSELSPSESDKLLKECVIGLENRIEECMSEFSDFSALGI 1165
            D +++R  +Q L ++ R+ +E  E    E  K L++C +  E+++E+ + + S+ +    
Sbjct: 8    DADSLRREIQELRDIQRSVEE-PEAFGLELKKSLEDCTLQFESKVEQLLCDASEVNFSSD 66

Query: 1164 EDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLKFIESQN 985
            +DLD +    K EL+  EA+N KI +EIE LS  ++E  ++L  ++EGL+C L+ IES  
Sbjct: 67   QDLDEFWNYLKNELSTEEAKNAKIADEIEGLSREYVEGYSKLVNEVEGLSCLLELIESLG 126

Query: 984  RHKLEMGTDVAHSVPTGGSETFDRTH-EDNFKVLELDNQIEKNKVTLNSLQDLCDILKRF 808
              +    T+   S P            E NFK+ EL NQ+EK+K+ L SL++L     RF
Sbjct: 127  IEQGRALTNFPCSTPGEDKGNLSSAPVEHNFKIFELGNQLEKSKLNLESLEELESTFNRF 186

Query: 807  EAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAIDPPAVDHELLIEV 628
            EA+ +IED  +GLK+++ EGN IRLSL+TF+PNLE+LL +Q +  A +PP  +HELLIE+
Sbjct: 187  EAIEKIEDAFSGLKIVQFEGNRIRLSLRTFIPNLENLLHNQTIGVA-EPPEQNHELLIEL 245

Query: 627  FDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSLEWLVRKXXXXXXXXX 451
             DGTMELK+VEIFPNDV I EI D+AKS RQ    + VLE  SSLEWLV++         
Sbjct: 246  VDGTMELKHVEIFPNDVSISEITDTAKSLRQVYFPVGVLENRSSLEWLVKRVQDRIILST 305

Query: 450  XXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTCALKLISLKNSDN 271
                 V  AN SRHSF+Y +R+E ++AHMVGGI+AF+KLPQ WP+    L L+SLK+S  
Sbjct: 306  LRRFLVKSANSSRHSFDYVEREETIVAHMVGGIDAFVKLPQGWPLTCSGLTLMSLKSSSQ 365

Query: 270  HSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQM 142
            +S+ ISL+ LCKV E ANSLD   RQ +  F D +E IL++QM
Sbjct: 366  YSQQISLTLLCKVAEAANSLDTNARQTISGFTDRVEEILMQQM 408


>gb|EXB89377.1| hypothetical protein L484_017343 [Morus notabilis]
          Length = 550

 Score =  320 bits (821), Expect = 7e-85
 Identities = 188/427 (44%), Positives = 263/427 (61%), Gaps = 8/427 (1%)
 Frame = -1

Query: 1386 MEELLESVPSSEQ---LDLETIRSRVQALSEVLRTSKEF-SELSPSESDKLLKECVIGLE 1219
            ME  +E VP S +   LDL+TIRSR + L E+L + ++  SEL  S+ +KL+K+C +  +
Sbjct: 134  MENAMEIVPPSSEHLDLDLDTIRSRAKELEEMLSSLEDNDSELFHSDLEKLVKDCALKFQ 193

Query: 1218 NRIEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTEL 1039
            +R+EE  SE+SD S L  +D D  LE+  EELNL+EAEN ++  EIE+L+ ++ EDS +L
Sbjct: 194  SRMEEIGSEWSDVSFLEDKDFDACLEHLGEELNLVEAENSRMSEEIEILTRTYAEDSNQL 253

Query: 1038 ERDLEGLNCSLKFIESQNRHKLEMGTDVAHSVPTGGSETFDRTHEDN----FKVLELDNQ 871
            E +LEGL  ++     Q+    ++G            + + R  ED       +LEL+N+
Sbjct: 254  EIELEGLKSAMDLTALQDLENAKLGA----------CDDYPRNTEDKQHLVLHLLELENE 303

Query: 870  IEKNKVTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLR 691
            I+K  + L SL+DL  I K F+A+ QIED LT +KVI LE NCIR SL+T++PNLES+L 
Sbjct: 304  IKKKNIILKSLEDLDGICKWFDAIEQIEDILTSVKVIALEENCIRFSLQTYIPNLESILS 363

Query: 690  HQKMEYAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKSRQFCPSLAVLE 511
             Q +E    P  V  ELLIE+ + T++ KN EIFPNDV+I  I ++AK    C       
Sbjct: 364  QQTIEAVNVPFEVKLELLIELLEWTLDQKNAEIFPNDVYINNISNAAKCFSKC------- 416

Query: 510  MGSSLEWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLP 331
               SL+W V K              V  ANKS +S EY D+DE+M+AH+ GG++AFIK+ 
Sbjct: 417  ---SLQWFVTKVQDRIVSCTMRQLVVKSANKSGYSLEYFDKDEVMVAHLAGGVDAFIKVS 473

Query: 330  QSWPMLTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILV 151
            Q WP+    LKL SLK+SD++++ I   FLCKVEE  NSL V I  NL SFVDA++ IL 
Sbjct: 474  QGWPLSNSPLKLTSLKSSDHNTKGIPSIFLCKVEERVNSLAVHICHNLSSFVDAVDKILT 533

Query: 150  RQMRSEL 130
             Q + E+
Sbjct: 534  EQKQLEI 540


>ref|NP_189033.1| uncharacterized protein [Arabidopsis thaliana]
            gi|1742965|emb|CAA70756.1| HAPp48,5 protein [Arabidopsis
            thaliana] gi|9294659|dbj|BAB03008.1| HAPp48,5 protein
            [Arabidopsis thaliana] gi|20259510|gb|AAM13875.1|
            putative HAPp48,5 protein [Arabidopsis thaliana]
            gi|21436469|gb|AAM51435.1| putative HAPp48,5 protein
            [Arabidopsis thaliana] gi|332643310|gb|AEE76831.1|
            uncharacterized protein AT3G23910 [Arabidopsis thaliana]
          Length = 421

 Score =  319 bits (818), Expect = 2e-84
 Identities = 178/420 (42%), Positives = 258/420 (61%), Gaps = 3/420 (0%)
 Frame = -1

Query: 1374 LESVPSSEQLDLETIRSRVQALSEVLRTSKE--FSELSPSESDKLLKECVIGLENRIEEC 1201
            +E       LDL+ IR RV+ L    R  +E      S      ++++ V+  E +++E 
Sbjct: 1    MEEETHDGSLDLQEIRRRVKELDFFPRNCREEPVESCSSDYETLVVQDFVLQFEPKVKEI 60

Query: 1200 MSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEG 1021
            + E+ D   L +ED D YLE  + EL  +EAE+ K+  EIE LS S  +DS+ L+RDLEG
Sbjct: 61   VEEYGDVDLLDVEDSDAYLEYLRNELQSVEAESAKVSEEIERLSQSHAQDSSRLQRDLEG 120

Query: 1020 LNCSLKFIESQNRHKLEMGTDVAHSVPTGGSETFDRTHEDNFKVLELDNQIEKNKVTLNS 841
            L  SL  + SQ+  K +     + S+     E  +   +D FK+ EL+NQ+E+ ++ L S
Sbjct: 121  LLLSLDSMSSQDVEKSKENQPSSSSM-----EVCEVIDDDKFKMFELENQMEEKRMILKS 175

Query: 840  LQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAIDP 661
            L+DL  + KRF+A  Q+ED LTGLKV+E +GN IRL L+T++  L+  L   K ++  +P
Sbjct: 176  LEDLDSLRKRFDAAEQVEDALTGLKVLEFDGNFIRLQLRTYIQKLDGFLGQHKFDHITEP 235

Query: 660  PAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSLEWLV 484
              + HELLI + D T E+   E+FPND++IG+I+++A S RQ     AVL+  SS++W+V
Sbjct: 236  SELIHELLIYLKDKTTEITKFEMFPNDIYIGDIIEAADSFRQVRLHSAVLDTRSSVQWVV 295

Query: 483  RKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTCA 304
             K              V  +   R++FEY D+DE ++AH+ GGI+AF+K+   WP+L   
Sbjct: 296  AKVQDKIISTTLRKYIVMSSKTIRYTFEYYDKDETIVAHIAGGIDAFLKVSDGWPLLNTP 355

Query: 303  LKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRSELHT 124
            LKL SLKNSDN S+ ISLS +CKVEE+ANSLD++ RQNL  F+DAIE ILV Q R EL +
Sbjct: 356  LKLASLKNSDNQSKGISLSLICKVEELANSLDLETRQNLSGFMDAIEKILVEQTREELQS 415


>ref|XP_006360976.1| PREDICTED: uncharacterized protein LOC102592291 [Solanum tuberosum]
          Length = 428

 Score =  319 bits (817), Expect = 2e-84
 Identities = 184/416 (44%), Positives = 257/416 (61%), Gaps = 15/416 (3%)
 Frame = -1

Query: 1344 DLETIRSRVQALSEVLRTSKEFSELSPSESDKLLKECVIGLENRIEECMSEFSDFSALGI 1165
            D+++ R  +Q L ++ R+ +E  E    E  K L++C +  E ++E+ + + S+ S    
Sbjct: 8    DVDSFRREIQELRDIQRSVEE-PEAFGLELKKSLEDCTLQFERKVEQILCDASEISFSSD 66

Query: 1164 EDLDT-------------YLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLE 1024
            +DL               + +  K EL+  EA N KI +EIE LS  ++E  ++L  ++E
Sbjct: 67   QDLGRKKAVHIFFFPPYEFWKYLKNELSTEEANNAKIADEIEGLSREYVEGYSKLVNEIE 126

Query: 1023 GLNCSLKFIESQNRHKLEMGTDVAHSVPTGGSETFDRTH-EDNFKVLELDNQIEKNKVTL 847
            GL+C L+ IES    +  + T+   S P            E NFKV EL NQ+EK+K+ L
Sbjct: 127  GLSCPLELIESLGLEQGRVLTNFPCSTPGEDKGNVSSAPVEQNFKVFELGNQLEKSKLNL 186

Query: 846  NSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAI 667
             SL++L     RFEA+ +IED  +GLK++E EGN IRLSL+TF+PNLE+LL +Q ++ A 
Sbjct: 187  KSLEELESTFNRFEAIEKIEDAFSGLKIVEFEGNRIRLSLRTFIPNLENLLHNQTIDVA- 245

Query: 666  DPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSLEW 490
            +PP  +HELLIE+ DGTMELK+VEIFPNDV I  I D+AKS RQ    + VLE  SSLEW
Sbjct: 246  EPPEQNHELLIELMDGTMELKHVEIFPNDVSISYITDTAKSLRQVYFPVGVLENRSSLEW 305

Query: 489  LVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLT 310
             V+               V  AN SRHSF+Y DR+E ++AHMVGGI+AFIKLPQ WP+ +
Sbjct: 306  FVKGVQDRIVLSTLRRFLVKSANSSRHSFDYVDREETIVAHMVGGIDAFIKLPQGWPLTS 365

Query: 309  CALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQM 142
              L L+SLK+S  +S+ ISL+ LCKV E+AN LD   RQ +  F D +E IL++QM
Sbjct: 366  SGLTLMSLKSSSQYSQQISLTLLCKVAEVANLLDTNERQTISGFTDRVEEILMQQM 421


>ref|XP_007034268.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|590656431|ref|XP_007034269.1| Uncharacterized protein
            isoform 2 [Theobroma cacao] gi|508713297|gb|EOY05194.1|
            Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508713298|gb|EOY05195.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 369

 Score =  314 bits (805), Expect = 5e-83
 Identities = 175/357 (49%), Positives = 235/357 (65%), Gaps = 4/357 (1%)
 Frame = -1

Query: 1386 MEELLESVPSSEQLDLETIRSRVQALSEVLRT--SKEFSELSPSESDKLLKECVIGLENR 1213
            M E +E   SSE LDL +IRSR+  LSE+ R   +K+  E     S+KLLK+C +  E++
Sbjct: 1    MAEPMEISSSSEALDLHSIRSRINELSEIHRIDKNKDEGEALSLNSEKLLKDCSLHFESK 60

Query: 1212 IEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELER 1033
            +++ + E+SD   LGIEDLD YL + KEELN +EAE+ KI NEIE LS + IE+S  LE 
Sbjct: 61   VKQIIEEYSDVGFLGIEDLDEYLAHLKEELNQVEAESAKISNEIEDLSRNHIEESNILEG 120

Query: 1032 DLEGLNCSLKFIESQNRHKLEMGTDVAHSV-PTGGSETFDRTHEDNFKVLELDNQIEKNK 856
            +LEGL  +L  I SQ    +E    +  S+     S       E  F+++EL++QIEKN 
Sbjct: 121  NLEGLKYALDSIASQGMEGVEEDPCLDSSMNDEDQSNLMHSNEEQKFEIMELESQIEKNN 180

Query: 855  VTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKME 676
            + L SLQDL  + KR + + QIED LTGLKVI  +GNCIRLSL+T++P LE LL  + +E
Sbjct: 181  IILKSLQDLDSMFKRLDTLEQIEDALTGLKVIGFDGNCIRLSLQTYIPKLEGLLCQKTIE 240

Query: 675  YAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSS 499
               +P  ++HELL+E+ DGTME+KNVE+FPNDV++G+I+D+AKS RQ   +L V +  SS
Sbjct: 241  DISEPSEMNHELLVEIVDGTMEIKNVEMFPNDVYLGDIIDAAKSFRQLSSNLTVQQTQSS 300

Query: 498  LEWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQ 328
            LEW V K              V   NKSRHSFEY +RDE ++AH+VGGI+AFIKL Q
Sbjct: 301  LEWFVGKVQDRIILSTLRRFIVKSTNKSRHSFEYLERDETIVAHLVGGIDAFIKLSQ 357


>gb|EXC33904.1| hypothetical protein L484_012794 [Morus notabilis]
          Length = 412

 Score =  313 bits (803), Expect = 9e-83
 Identities = 187/422 (44%), Positives = 261/422 (61%), Gaps = 3/422 (0%)
 Frame = -1

Query: 1386 MEELLESVP-SSEQLDLETIRSRVQALSEVLRTSKEF-SELSPSESDKLLKECVIGLENR 1213
            ME  +E VP SSE LDL+TIRSR + L E+L + ++  SEL  S+ +KL+K+C +  ++R
Sbjct: 1    MENAMEIVPPSSEHLDLDTIRSRAKELEEMLSSLEDNDSELFHSDLEKLVKDCALKFQSR 60

Query: 1212 IEECMSEFSDFSALGIEDLDTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELER 1033
            +EE  SE+SD S L  +  D  LE+  EELNL+EAEN  +  +IEVL+ ++ EDS +LE 
Sbjct: 61   MEEIGSEWSDVSFLEDKGFDACLEHLGEELNLVEAENSIMSEKIEVLTRTYAEDSNQLEI 120

Query: 1032 DLEGLNCSLKFIESQNRHKLEMGTDVAHSVPTGGSETFDRTHEDN-FKVLELDNQIEKNK 856
            +LEGL   +     Q+    ++G            + + R  ED    +LEL+ +I++  
Sbjct: 121  ELEGLKNVMDLTALQDLGNAKLGA----------CDDYPRNTEDKQHSLLELEKEIKQKN 170

Query: 855  VTLNSLQDLCDILKRFEAVRQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKME 676
            + L SL+DL  I K F+A+ QIED LTG+KVI LE NCIR SL+T++PNLES L  Q +E
Sbjct: 171  IILKSLEDLDGICKWFDAIEQIEDILTGVKVIALEENCIRFSLQTYIPNLESFLLQQTIE 230

Query: 675  YAIDPPAVDHELLIEVFDGTMELKNVEIFPNDVFIGEIVDSAKSRQFCPSLAVLEMGSSL 496
                P  V HELLIE+ + T++ KNVEIFPNDV++  I ++AK    C          SL
Sbjct: 231  AVNVPFEVKHELLIELLEWTLDQKNVEIFPNDVYLNNISNAAKDFSKC----------SL 280

Query: 495  EWLVRKXXXXXXXXXXXXXXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPM 316
            +W V K              V  AN S +S EY D+DE+M+AH+ GG++AFIK+ Q WP+
Sbjct: 281  QWFVTKVQDRIVSCTMRQLVVKSANTSGYSLEYFDKDEVMVAHLAGGVDAFIKVSQGWPL 340

Query: 315  LTCALKLISLKNSDNHSRVISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRS 136
                LKL SLK+SD++++ I   FL KV+E  NSL V I QNL SFVDA++ IL  Q + 
Sbjct: 341  SNSPLKLTSLKSSDHNTKGIPSIFLFKVKERVNSLAVHICQNLSSFVDAVDKILTEQKQL 400

Query: 135  EL 130
            E+
Sbjct: 401  EI 402


>ref|XP_006418827.1| hypothetical protein EUTSA_v10002763mg, partial [Eutrema salsugineum]
            gi|557096755|gb|ESQ37263.1| hypothetical protein
            EUTSA_v10002763mg, partial [Eutrema salsugineum]
          Length = 355

 Score =  310 bits (793), Expect = 1e-81
 Identities = 168/346 (48%), Positives = 229/346 (66%), Gaps = 2/346 (0%)
 Frame = -1

Query: 1155 DTYLENAKEELNLIEAENLKIYNEIEVLSGSFIEDSTELERDLEGLNCSLKFIESQNRHK 976
            D YLE  ++EL+ +EAE+ K+  EIE LS S  EDS+ L+RDLEGL  SL F+ SQ   K
Sbjct: 5    DAYLEYLRKELHSVEAESAKVSEEIERLSSSHAEDSSRLDRDLEGLLLSLDFLSSQEVQK 64

Query: 975  LEMGTDVAHSVPTGGSETF-DRTHEDNFKVLELDNQIEKNKVTLNSLQDLCDILKRFEAV 799
             +       S+    + T+ D   ++ FK+ EL+NQIE+ +  L SL++L  + KRF+A 
Sbjct: 65   SKENPPSTSSMERCDASTWIDVNDDEKFKMFELENQIEEKRRILKSLENLDSVCKRFDAA 124

Query: 798  RQIEDTLTGLKVIELEGNCIRLSLKTFLPNLESLLRHQKMEYAIDPPAVDHELLIEVFDG 619
             Q+ED LTGLKV+E +GN IRL L+T++P L+ LL   K+ +  +P  + HELLI++ D 
Sbjct: 125  EQVEDALTGLKVLEFDGNFIRLQLRTYIPKLDGLLGQHKLLHNTEPSELIHELLIDLKDK 184

Query: 618  TMELKNVEIFPNDVFIGEIVDSAKS-RQFCPSLAVLEMGSSLEWLVRKXXXXXXXXXXXX 442
            T E+  VE+ PNDV+IG+I D+A S RQ     A+L+  SSL+WLV K            
Sbjct: 185  TTEITKVEMLPNDVYIGDITDAADSFRQIRLHSALLDTRSSLQWLVAKVQERIITTNLRK 244

Query: 441  XXVNDANKSRHSFEYSDRDEIMIAHMVGGINAFIKLPQSWPMLTCALKLISLKNSDNHSR 262
              V  +   RH+FEY D+DE ++AH+ GGI+AF+K+   WP+L+  LKL SLKNSDN S 
Sbjct: 245  HIVKSSKTIRHTFEYYDKDETIVAHITGGIDAFLKVSVGWPLLSTPLKLTSLKNSDNQSN 304

Query: 261  VISLSFLCKVEEIANSLDVQIRQNLVSFVDAIEGILVRQMRSELHT 124
             ISLS +CKVEE+ANSLD+Q RQNL  F+DAIE ILV+Q R ELH+
Sbjct: 305  GISLSLICKVEELANSLDLQTRQNLSGFMDAIEKILVQQTREELHS 350


Top